CQUniversity
Browse

File(s) not publicly available

A biologically inspired hierarchical reinforcement learning system

journal contribution
posted on 2017-12-06, 00:00 authored by Weidong Zhou, R Coggins
To increase the adaptivity of hierarchical reinforcement learning (HRL) and accelerate the learning process in environments with multiple sources of reward, we propose an emotion-based HRL algorithm inspired by neurobiology. In the algorithm, each reward source defines a subtask and each subtask is assigned an artificial emotion indication (AEI) that predicts the reward component associated with the subtask. The AEIs are simultaneously learned along with the top-level policy and used to interrupt subtask execution when the AEIs change significantly. The algorithm is tested in a simulated gridworld which has two sources of reward and is partially observable. Experimental results show that the inclusion of an artificial emotion mechanism that adaptively terminates subtasks makes reuse of the subtask policies efficient for multigoal environments. The use of the artificial emotion variables significantly accelerates the learning process by 60% and achieves higher long-term reward compared to a human-designed policy and a restricted form of the MAXQ algorithm.

Funding

Category 1 - Australian Competitive Grants (this includes ARC, NHMRC)

History

Volume

36

Start Page

1

End Page

44

Number of Pages

44

eISSN

1087-6553

ISSN

0196-9722

Location

United States

Publisher

Taylor & Francis Inc.

Language

en-aus

Peer Reviewed

  • Yes

Open Access

  • No

External Author Affiliations

University of Sydney;

Era Eligible

  • Yes

Journal

Cybernetics and systems.