澳门太阳集团(9728·VIP网城)网站入口-欢迎您

学术科研

Graduate Student Colloquium

Impulse control via reinforcement learning

演讲者：陈子豪（南科大）
时间：2023-03-03 21:10-21:40
地点：理学院大楼M5024讨论间

摘要：This lecture mainly introduces how to use reinforcement learning to solve a pulse control problem. First, the impulse control problem is converted into the optimal stopping time problem. Then the reinforcement learning method is used to solve the optimal stopping time problem, and then the verification theorem is used to obtain the optimal impulse control.