Keio Univ. Yagami-campus, Building 14, Room 631A
3-14-1 Kouhoku-ku, Hiyoshi, Yokohama 223-8522, JAPAN
abstract:
I'd like to briefly review mathematical basics of reinforcement learning including the setup, the Markov Decision Process, and three training schemes, value-based algorithm, policy-based algorithm, and hybrid of them so-called actor-critic algorithm.
Public events of RIKEN Center for Advanced Intelligence Project (AIP)
Join community