December Update from Morgan & Claypool Publishers
Communication Networks Multi-Armed Bandits: Theory and Applications to Online Learning in Networks Author: Qing Zhao, Cornell University Keywords: multi-armed bandit, machine learning, online learning, reinforcement learning, Markov decision processes Abstract: Multi-armed bandit problems pertain to optimal sequential decision making and learning in … Continue reading