Curtin University Homepage
  • Library
  • Help
    • Admin

    espace - Curtin’s institutional repository

    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item

    Autonomous Two-Stage Object Retrieval Using Supervised and Reinforcement Learning

    Access Status
    Fulltext not available
    Authors
    Rouillard, T.
    Howard, Ian
    Cui, Lei
    Date
    2019
    Type
    Conference Paper
    
    Metadata
    Show full item record
    Citation
    Rouillard, T. and Howard, I. and Cui, L. 2019. Autonomous Two-Stage Object Retrieval Using Supervised and Reinforcement Learning, in 16th IEEE International Conference on Mechatronics and Automation (ICMA), Aug 4-7 2019. Tianjin, Peoples Republic of China: IEEE.
    Source Title
    Proceedings of 2019 IEEE International Conference on Mechatronics and Automation, ICMA 2019
    Source Conference
    16th IEEE International Conference on Mechatronics and Automation (IEEE ICMA)
    DOI
    10.1109/ICMA.2019.8816290
    ISBN
    9781728116983
    Faculty
    Faculty of Science and Engineering
    School
    School of Civil and Mechanical Engineering
    Funding and Sponsorship
    http://purl.org/au-research/grants/arc/DE170101062
    URI
    http://hdl.handle.net/20.500.11937/80592
    Collection
    • Curtin Research Publications
    Abstract

    Humans have been sending tele-operated robots into hazardous areas in an attempt to preserve life for many years. The task they are presented with is often challenging and requires cognitive abilities, that is, the ability to process information in order to apply it to a different situation. In this work, we proposed an autonomous approach employing both supervised and reinforcement learning for hidden-object retrieval in two stages. Stage 1 used both learning methods to find a hidden object whereas stage 2 only used reinforcement learning to isolate it. The method is targeted towards field robots with reduced computational power hence we explored the viability of tabular reinforcement learning algorithms. We used a convolutional neural network (CNN) to interpret the state of the scene from images and a reinforcement learning agent for each stage of the task. The robot was presented with a workspace containing piles of rubble under one of which an object was buried. The robot must learn to find and isolate it. We compared the performance of four reinforcement learning algorithms over 500 episodes and found that Sarsa (λ) and RMax were most appropriate for stage 1 and 2 respectively. The approach allowed a robot to learn to complete a search and retrieval task by interpreting images. This could lead to the deployment of such robots in disaster areas eliminating the need for tele-operated platforms.

    Related items

    Showing items related by title, author, creator and subject.

    • Virtual laboratories for education in science, technology, and engineering: A review
      Potkonjak, V.; Gardner, M.; Callaghan, V.; Mattila, P.; Guetl, Christian; Petrović, V.; Jovanovic, K. (2016)
      Within education, concepts such as distance learning, and open universities, are now becoming more widely used for teaching and learning. However, due to the nature of the subject domain, the teaching of Science, Technology, ...
    • Robotics in primary school in the opinion of prospective and in-service teachers. A comparison study
      Morze, N.; Smyrnova-Trybulska, E.; Zuziak, W.; Kommers, Piet; Gladun, M. (2017)
      Copyright © 2017 Inderscience Enterprises Ltd. This article Describes Some Issues Concerning The Use Of Robots In primary school as an object and subject of learning. The authors conclude, based on an analysis of earlier ...
    • Developing completion criteria for rehabilitation areas on arid and semi-arid mine sites in Western Australia
      Brearley, Darren (2003)
      Continued expansion of the gold and nickel mining industry in Western Australia during recent years has led to disturbance of larger areas and the generation of increasing volumes of waste rock. Mine operators are obligated ...
    Advanced search

    Browse

    Communities & CollectionsIssue DateAuthorTitleSubjectDocument TypeThis CollectionIssue DateAuthorTitleSubjectDocument Type

    My Account

    Admin

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Follow Curtin

    • 
    • 
    • 
    • 
    • 

    CRICOS Provider Code: 00301JABN: 99 143 842 569TEQSA: PRV12158

    Copyright | Disclaimer | Privacy statement | Accessibility

    Curtin would like to pay respect to the Aboriginal and Torres Strait Islander members of our community by acknowledging the traditional owners of the land on which the Perth campus is located, the Whadjuk people of the Nyungar Nation; and on our Kalgoorlie campus, the Wongutha people of the North-Eastern Goldfields.