Curtin University Homepage
  • Library
  • Help
    • Admin

    espace - Curtin’s institutional repository

    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item

    Seqenv: Linking sequences to environments through text mining

    Access Status
    Open access via publisher
    Authors
    Sinclair, L.
    Ijaz, U.
    Jensen, L.
    Coolen, Marco
    Gubry-Rangin, C.
    Chronáková, A.
    Oulas, A.
    Pavloudi, C.
    Schnetzer, J.
    Weimann, A.
    Ijaz, A.
    Eiler, A.
    Quince, C.
    Pafilis, E.
    Date
    2016
    Type
    Journal Article
    
    Metadata
    Show full item record
    Citation
    Sinclair, L. and Ijaz, U. and Jensen, L. and Coolen, M. and Gubry-Rangin, C. and Chronáková, A. and Oulas, A. et al. 2016. Seqenv: Linking sequences to environments through text mining. PeerJ. 2016 (12).
    Source Title
    PeerJ
    DOI
    10.7717/peerj.2690
    ISSN
    2167-8359
    URI
    http://hdl.handle.net/20.500.11937/50929
    Collection
    • Curtin Research Publications
    Abstract

    © 2016 Sinclair et al.Understanding the distribution of taxa and associated traits across different environments is one of the central questions in microbial ecology. High-throughput sequencing (HTS) studies are presently generating huge volumes of data to address this biogeographical topic. However, these studies are often focused on specific environment types or processes leading to the production of individual, unconnected datasets. The large amounts of legacy sequence data with associated metadata that exist can be harnessed to better place the genetic information found in these surveys into a wider environmental context. Here we introduce a software program, seqenv, to carry out precisely such a task. It automatically performs similarity searches of short sequences against the ``nt'' nucleotide database provided by NCBI and, out of every hit, extracts-if it is available-the textual metadata field. After collecting all the isolation sources from all the search results, we run a text mining algorithm to identify and parse words that are associated with the Environmental Ontology (EnvO) controlled vocabulary. This, in turn, enables us to determine both in which environments individual sequences or taxa have previously been observed and, by weighted summation of those results, to summarize complete samples. We present two demonstrative applications of seqenv to a survey of ammonia oxidizing archaea as well as to a plankton paleome dataset from the Black Sea. These demonstrate the ability of the tool to reveal novel patterns in HTS and its utility in the fields of environmental source tracking, paleontology, and studies of microbial biogeography. To install seqenv, go to: https://github.com/xapple/seqenv.

    Related items

    Showing items related by title, author, creator and subject.

    • Molecular analysis of genes encoding resistance to Cationic Biocides in staphylococci
      Morgan, Dale (2007)
      Bacterial resistance to non-antibiotic agents is being increasingly studied. Plasmid-mediated resistance to cationic agents, which are important biocides, has been described in antibiotic-resistant Staphylococcus aureus. ...
    • The sedimentary succession of the last ~ 3.50 Myr in the western South Yellow Sea: Paleoenvironmental and tectonic implications
      Liu, J.; Zhang, X.; Mei, X.; Zhao, Q.; Guo, X.; Zhao, W.; Liu, Jian; Saito, Y.; Wu, Z.; Li, J.; Zhu, X.; Chu, H. (2018)
      © 2017 The South Yellow Sea (SYS) Basin, which is part of the West Pacific Continental Margin, began to form in the late Mesozoic to early Cenozoic as a result of regional tectonic activity in Asia. The pre-Middle Pleistocene ...
    • Growth history and facies evolution of the subis limestone - A carbonate platform exposed onshore Borneo Island, Malaysia
      Dedeche, A.; Pierson, B.; Hunter, Aaron (2013)
      The Subis Limestone is a Member of the Tangap Formation. It is Upper Oligocene to Lower Miocene in age. The Subis Limestone is divided into two sequences which are different in depositional setting and architecture. The ...
    Advanced search

    Browse

    Communities & CollectionsIssue DateAuthorTitleSubjectDocument TypeThis CollectionIssue DateAuthorTitleSubjectDocument Type

    My Account

    Admin

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Follow Curtin

    • 
    • 
    • 
    • 
    • 

    CRICOS Provider Code: 00301JABN: 99 143 842 569TEQSA: PRV12158

    Copyright | Disclaimer | Privacy statement | Accessibility

    Curtin would like to pay respect to the Aboriginal and Torres Strait Islander members of our community by acknowledging the traditional owners of the land on which the Perth campus is located, the Whadjuk people of the Nyungar Nation; and on our Kalgoorlie campus, the Wongutha people of the North-Eastern Goldfields.