Curtin University Homepage
  • Library
  • Help
    • Admin

    espace - Curtin’s institutional repository

    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item

    Multivariate Image Processing in Minerals Engineering with Vision Transformers

    94158.pdf (7.884Mb)
    Access Status
    Open access
    Authors
    Liu, Xiu
    Aldrich, Chris
    Date
    2024
    Type
    Journal Article
    
    Metadata
    Show full item record
    Citation
    Liu, X. and Aldrich, C. 2024. Multivariate Image Processing in Minerals Engineering with Vision Transformers. Minerals Engineering. 208: 108599.
    Source Title
    Minerals Engineering
    DOI
    10.1016/j.mineng.2024.108599
    ISSN
    0892-6875
    Faculty
    Faculty of Science and Engineering
    School
    WASM: Minerals, Energy and Chemical Engineering
    Funding and Sponsorship
    http://purl.org/au-research/grants/arc/CE200100009
    URI
    http://hdl.handle.net/20.500.11937/94374
    Collection
    • Curtin Research Publications
    Abstract

    Vision transformers (ViTs) are a new class of deep learning algorithms that have recently emerged as a competitive alternative to convolutional neural networks. In this investigation, their application to two operations previously studied in the mineral processing industry is considered. These are image recognition of fines in coal particles on conveyor belts and characterisation of the particle size in the underflow of a hydrocyclone. Promising results were achieved by use of vision transformers, as they performed as well as, or better than convolutional neural networks in these image recognition problems. In addition, features extracted from the best ViT model could be used to visualise its performance and these features could also serve as a basis for nonlinear process monitoring models. Furthermore, explainability techniques such as attention maps for ViTs were implemented to better understand the ViT models, similar to techniques such as occlusion sensitivity maps used with convolutional neural networks.

    Advanced search

    Browse

    Communities & CollectionsIssue DateAuthorTitleSubjectDocument TypeThis CollectionIssue DateAuthorTitleSubjectDocument Type

    My Account

    Admin

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Follow Curtin

    • 
    • 
    • 
    • 
    • 

    CRICOS Provider Code: 00301JABN: 99 143 842 569TEQSA: PRV12158

    Copyright | Disclaimer | Privacy statement | Accessibility

    Curtin would like to pay respect to the Aboriginal and Torres Strait Islander members of our community by acknowledging the traditional owners of the land on which the Perth campus is located, the Whadjuk people of the Nyungar Nation; and on our Kalgoorlie campus, the Wongutha people of the North-Eastern Goldfields.