Show simple item record

dc.contributor.authorNip, K.M.
dc.contributor.authorHafezqorani, S.
dc.contributor.authorGagalova, Kristina
dc.contributor.authorChiu, R.
dc.contributor.authorYang, C.
dc.contributor.authorWarren, R.L.
dc.contributor.authorBirol, I.
dc.date.accessioned2025-01-15T04:21:24Z
dc.date.available2025-01-15T04:21:24Z
dc.date.issued2023
dc.identifier.citationNip, K.M. and Hafezqorani, S. and Gagalova, K.K. and Chiu, R. and Yang, C. and Warren, R.L. and Birol, I. 2023. Reference-free assembly of long-read transcriptome sequencing data with RNA-Bloom2. Nature Communications. 14 (1): 2940.
dc.identifier.urihttp://hdl.handle.net/20.500.11937/96867
dc.identifier.doi10.1038/s41467-023-38553-y
dc.description.abstract

Long-read sequencing technologies have improved significantly since their emergence. Their read lengths, potentially spanning entire transcripts, is advantageous for reconstructing transcriptomes. Existing long-read transcriptome assembly methods are primarily reference-based and to date, there is little focus on reference-free transcriptome assembly. We introduce “RNA-Bloom2 [https://github.com/bcgsc/RNA-Bloom]”, a reference-free assembly method for long-read transcriptome sequencing data. Using simulated datasets and spike-in control data, we show that the transcriptome assembly quality of RNA-Bloom2 is competitive to those of reference-based methods. Furthermore, we find that RNA-Bloom2 requires 27.0 to 80.6% of the peak memory and 3.6 to 10.8% of the total wall-clock runtime of a competing reference-free method. Finally, we showcase RNA-Bloom2 in assembling a transcriptome sample of Picea sitchensis (Sitka spruce). Since our method does not rely on a reference, it further sets the groundwork for large-scale comparative transcriptomics where high-quality draft genome assemblies are not readily available.

dc.languageeng
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectTranscriptome
dc.subjectRNA
dc.subjectHigh-Throughput Nucleotide Sequencing
dc.subjectGene Expression Profiling
dc.subjectSequence Analysis, RNA
dc.subjectRNA
dc.subjectGene Expression Profiling
dc.subjectSequence Analysis, RNA
dc.subjectHigh-Throughput Nucleotide Sequencing
dc.subjectTranscriptome
dc.titleReference-free assembly of long-read transcriptome sequencing data with RNA-Bloom2
dc.typeJournal Article
dcterms.source.volume14
dcterms.source.number1
dcterms.source.issn2041-1723
dcterms.source.titleNature Communications
dc.date.updated2025-01-15T04:21:24Z
curtin.departmentSchool of Molecular and Life Sciences (MLS)
curtin.accessStatusOpen access
curtin.facultyFaculty of Science and Engineering
curtin.contributor.orcidGagalova, Kristina [0000-0002-5975-0805]
curtin.identifier.article-number2940
dcterms.source.eissn2041-1723
curtin.contributor.scopusauthoridGagalova, Kristina [55969284500]
curtin.repositoryagreementV3


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

http://creativecommons.org/licenses/by/4.0/
Except where otherwise noted, this item's license is described as http://creativecommons.org/licenses/by/4.0/