Curtin University Homepage
  • Library
  • Help
    • Admin

    espace - Curtin’s institutional repository

    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item

    Mining substructures in protein data

    20226_downloaded_stream_214.pdf (330.9Kb)
    Access Status
    Open access
    Authors
    Hadzic, Fedja
    Dillon, Tharam S.
    Sidhu, Amandeep
    Chang, Elizabeth
    Tan, H.
    Date
    2006
    Type
    Conference Paper
    
    Metadata
    Show full item record
    Citation
    Hadzic, Fedja and Dillon, Tharam and Sidhu, Amandeep and Chang, Elizabeth and Tan, Henry. 2006. : Mining substructures in protein data, in Tsumoto, Shusaku (ed), IEEE International Conference on Data Mining Workshops, Dec 18 2006, pp. 213-217. Hong Kong: IEEE.
    Source Title
    Proceedings of the Sixth IEEE International Conference on Data Mining - Workshops
    Source Conference
    IEEE International Conference on Data Mining Workshops
    Faculty
    Curtin Business School
    School
    Centre for Extended Enterprises and Business Intelligence
    Remarks

    Copyright 2006 IEEE

    This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

    URI
    http://hdl.handle.net/20.500.11937/7278
    Collection
    • Curtin Research Publications
    Abstract

    In this paper we consider the 'Prions' database that describes protein instances stored for Human Prion Proteins. The Prions database can be viewed as a database of rooted ordered labeled subtrees. Mining frequent substructures from tree databases is an important task and it has gained a considerable amount of interest in areas such as XML mining, Bioinformatics, Web mining etc. This has given rise to the development of many tree mining algorithms which can aid in structural comparisons, association rule discovery and in general mining of tree structured knowledge representations. Previously we have developed the MB3 tree mining algorithm, which given a minimum support threshold, efficiently discovers all frequent embedded subtrees from a database of rooted ordered labeled subtrees. In this work we apply the algorithm to the Prions database in order to extract the frequently occurring patterns, which in this case are of induced subtree type. Obtaining the set of frequent induced subtrees from the Prions database can potentially reveal some useful knowledge. This aspect will be demonstrated by providing an analysis of the extracted frequent subtrees with respect to discovering interesting protein information. Furthermore, the minimum support threshold can be used as the controlling factor for answering specific queries posed on the Prions dataset. This approach is shown to be a viable technique for mining protein data.

    Related items

    Showing items related by title, author, creator and subject.

    • Mining Induced/Embedded Subtrees using the Level of Embedding Constraint
      Tan, H.; Hadzic, Fedja; Dillon, T. (2012)
      The increasing need for representing information through more complex structures where semantics and relationships among data objects can be more easily expressed has resulted in many semi-structured data sources. Structure ...
    • Quality and interestingness of association rules derived from data mining of relational and semi-structured data
      Mohd Shaharanee, Izwan Nizal (2012)
      Deriving useful and interesting rules from a data mining system are essential and important tasks. Problems such as the discovery of random and coincidental patterns or patterns with no significant values, and the generation ...
    • Razor: Mining distance-constrained embedded subtrees
      Tan, H.; Dillon, Tharam S.; Hadzic, Fedja; Chang, Elizabeth (2006)
      Our work is focused on the task of mining frequent subtrees from a database of rooted ordered labelled subtrees. Previously we have developed an efficient algorithm, MB3 [12], for mining frequent embedded subtrees from a ...
    Advanced search

    Browse

    Communities & CollectionsIssue DateAuthorTitleSubjectDocument TypeThis CollectionIssue DateAuthorTitleSubjectDocument Type

    My Account

    Admin

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Follow Curtin

    • 
    • 
    • 
    • 
    • 

    CRICOS Provider Code: 00301JABN: 99 143 842 569TEQSA: PRV12158

    Copyright | Disclaimer | Privacy statement | Accessibility

    Curtin would like to pay respect to the Aboriginal and Torres Strait Islander members of our community by acknowledging the traditional owners of the land on which the Perth campus is located, the Whadjuk people of the Nyungar Nation; and on our Kalgoorlie campus, the Wongutha people of the North-Eastern Goldfields.