ADPRL 2013

2013 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning

Adaptive (or Approximate) dynamic programming (ADP) is a general and effective approach for solving optimal control problems by adapting to uncertain environments over time. ADP optimizes a user-defined cost function with respect to an adaptive control law, conditioned on prior knowledge of the system, and its state, in the presence of system uncertainties. A numerical search over the present value of the control minimizes a nonlinear cost function forward-in-time providing a basis for real-time, approximate optimal control. The ability to improve performance over time subject to new or unexplored objectives or dynamics has made ADP an attractive approach in a number of application domains including optimal control and estimation, operation research, and computational intelligence. ADP is viewed as a form of reinforcement learning based on an actor-critic architecture that optimizes a user-prescribed value online and obtains the resulting optimal control policy.
Reinforcement learning (RL) algorithms learn to optimize an agent by letting it interact with an environment and learn from its received feedback. The goal of the agent is to optimize its accumulated reward over time, and for this it estimates value functions that predict its future reward intake when executing a particular policy. Reinforcement learning techniques can be combined with many different function approximators and do not assume any a priori knowledge about the environment. An important aspect in RL is that an agent has to explore parts of the environment it does not know well, while at the same time it has to exploit its knowledge to maximize its reward intake. RL techniques have already been applied successfully for many problems such as controlling robots, game playing, elevator control, network routing, and traffic light optimization.


The symposium topics include, but are not limited to:

Keynote, Tutorial and Panel Sessions

Please forward your proposals with detailed abstract and bio-sketches of the speakers to Symposium Co-Chairs and SSCI Keynote-Tutorial Chair, Dr S Das.

Accepted Special Sessions

Special Session 1:  Online planning
Organizers:  Lucian Busoniu and Rémi Munos
 (To submit a paper to this session, please select "01s1" as the main research topic)



Special Session 2: Evolutionary Algorithms for ADPRL

Organizers: Hisashi Handa & Kazuhiro Ohkura
 (To submit a paper to this session, please select "01s2" as the main research topic)


Special Session 3:  Finite-Approximate-Error Based Adaptive Dynamic Programming: Algorithms and Applications
Organizers: Yanhong Luo, Qinglai Wei and Zengguang Hou


Special Session 4:  Data-driven Adaptive Dynamic Programming and Its Applications in Complex Systems
Organizers: Derong Liu, Haibo He and Dongbin Zhao
 (To submit a paper to this session, please select "01s4" as the main research topic)

Special Session 5:  Special session on ADP and RL in real-time feedback systems
Organizers:  Xin Xu & Haibo He. 
(To submit a paper to this session, please select "01s5" as the main research topic)


Special Sessions

Please forward your special session proposals to Symposium Co-Chairs.

