Reinforcement Learning machine learning 기법 중 하나. B) For RS A Survey on Reinforcement Learning for Recommender Systems C) Related Markov Decision Process, dynamic programming, Monte Carlo Method, temporal difference N-step Bootstrapping Policy Gradient Multi-Armed Bandit