![]() |
Siwei Wang (汪思为)Phd Student |
I am now a PhD student in IIIS, Tsinghua University, and my thesis advisor is Longbo Huang.
I received my B.S. degree in IIIS, Tsinghua University in 2015.
I focus on online learning problems, especially the Multi-armed Bandits (MAB) model and other kinds of online reinforcement learning problems.
Theory Group, Microsoft Research Asia, Beijing, June 2016 - September 2016
Department of Computer Science and Engineering (CSE), The Chinese University of Hong Kong, Hong Kong, June 2019 - September 2019
Yihan Du, Siwei Wang, Longbo Huang, “Dueling Bandits: From Two-dueling to Multi-dueling”, International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), May 2020.
Siwei Wang and Longbo Huang, “Multi-armed Bandits with Compensation”, Proceedings of the Thirty-second Conference on Neural Information Processing Systems (NIPS), December 2018. (pdf)
Siwei Wang and Wei Chen, “Thompson Sampling for Combinatorial Semi-Bandits”, Proceedings of the 35th International Conference on Machine Learning (ICML), July 2018. (pdf)
Yifeng Teng, Shenghao Yang, Siwei Wang and Mingfei Zhao, “Tight Bound on Randomness for Violating the CHSH Inequality”, IEEE Transactions on Information Theory, April 2016. (pdf)