Autonomous Two-Stage Object Retrieval Using Supervised and Reinforcement Learning
dc.contributor.author | Rouillard, T. | |
dc.contributor.author | Howard, Ian | |
dc.contributor.author | Cui, Lei | |
dc.date.accessioned | 2020-08-17T03:36:22Z | |
dc.date.available | 2020-08-17T03:36:22Z | |
dc.date.issued | 2019 | |
dc.identifier.citation | Rouillard, T. and Howard, I. and Cui, L. 2019. Autonomous Two-Stage Object Retrieval Using Supervised and Reinforcement Learning, in 16th IEEE International Conference on Mechatronics and Automation (ICMA), Aug 4-7 2019. Tianjin, Peoples Republic of China: IEEE. | |
dc.identifier.uri | http://hdl.handle.net/20.500.11937/80592 | |
dc.identifier.doi | 10.1109/ICMA.2019.8816290 | |
dc.description.abstract |
Humans have been sending tele-operated robots into hazardous areas in an attempt to preserve life for many years. The task they are presented with is often challenging and requires cognitive abilities, that is, the ability to process information in order to apply it to a different situation. In this work, we proposed an autonomous approach employing both supervised and reinforcement learning for hidden-object retrieval in two stages. Stage 1 used both learning methods to find a hidden object whereas stage 2 only used reinforcement learning to isolate it. The method is targeted towards field robots with reduced computational power hence we explored the viability of tabular reinforcement learning algorithms. We used a convolutional neural network (CNN) to interpret the state of the scene from images and a reinforcement learning agent for each stage of the task. The robot was presented with a workspace containing piles of rubble under one of which an object was buried. The robot must learn to find and isolate it. We compared the performance of four reinforcement learning algorithms over 500 episodes and found that Sarsa (λ) and RMax were most appropriate for stage 1 and 2 respectively. The approach allowed a robot to learn to complete a search and retrieval task by interpreting images. This could lead to the deployment of such robots in disaster areas eliminating the need for tele-operated platforms. | |
dc.language | English | |
dc.publisher | IEEE | |
dc.relation.sponsoredby | http://purl.org/au-research/grants/arc/DE170101062 | |
dc.subject | Science & Technology | |
dc.subject | Technology | |
dc.subject | Automation & Control Systems | |
dc.subject | Engineering, Electrical & Electronic | |
dc.subject | Robotics | |
dc.subject | Engineering | |
dc.subject | Intelligent robots | |
dc.subject | Reinforcement learning | |
dc.subject | Computer vision | |
dc.subject | Robotic Manipulation | |
dc.subject | ALGORITHM | |
dc.title | Autonomous Two-Stage Object Retrieval Using Supervised and Reinforcement Learning | |
dc.type | Conference Paper | |
dcterms.source.startPage | 780 | |
dcterms.source.endPage | 786 | |
dcterms.source.title | Proceedings of 2019 IEEE International Conference on Mechatronics and Automation, ICMA 2019 | |
dcterms.source.isbn | 9781728116983 | |
dcterms.source.conference | 16th IEEE International Conference on Mechatronics and Automation (IEEE ICMA) | |
dcterms.source.conference-start-date | 4 Aug 2019 | |
dcterms.source.conferencelocation | Tianjin, PEOPLES R CHINA | |
dc.date.updated | 2020-08-17T03:36:21Z | |
curtin.department | School of Civil and Mechanical Engineering | |
curtin.accessStatus | Fulltext not available | |
curtin.faculty | Faculty of Science and Engineering | |
curtin.contributor.orcid | Howard, Ian [0000-0003-3999-9184] | |
curtin.contributor.orcid | Cui, Lei [0000-0003-2283-5079] | |
dcterms.source.conference-end-date | 7 Aug 2019 | |
curtin.contributor.scopusauthorid | Howard, Ian [12808325800] | |
curtin.contributor.scopusauthorid | Cui, Lei [35168967600] |