Reinforcement Learning for Low Probability High Impact Risks

Hunt, Gareth David

Hunt G 2019.pdf (7.151Mb)

Access Status

Open access

Authors

Hunt, Gareth David

Date

2019

Supervisor

Mihai Lazarescu

Type

Thesis

Award

MPhil

Metadata

Show full item record

Faculty

Science and Engineering

School

School of Electrical Engineering, Computing and Mathematical Science

URI

http://hdl.handle.net/20.500.11937/77106

Collection

Curtin Theses

Abstract

We demonstrate a method of reinforcement learning that uses training in simulation. Our system generates an estimate of the potential reward and danger of each action as well as a measure of the uncertainty present in both. The system generates this by seeking out not only rewarding actions but also dangerous ones in the simulated training. During runtime our system is able to use this knowledge to avoid risks while accomplishing its tasks.