Reference-free assembly of long-read transcriptome sequencing data with RNA-Bloom2
dc.contributor.author | Nip, K.M. | |
dc.contributor.author | Hafezqorani, S. | |
dc.contributor.author | Gagalova, Kristina | |
dc.contributor.author | Chiu, R. | |
dc.contributor.author | Yang, C. | |
dc.contributor.author | Warren, R.L. | |
dc.contributor.author | Birol, I. | |
dc.date.accessioned | 2025-01-15T04:21:24Z | |
dc.date.available | 2025-01-15T04:21:24Z | |
dc.date.issued | 2023 | |
dc.identifier.citation | Nip, K.M. and Hafezqorani, S. and Gagalova, K.K. and Chiu, R. and Yang, C. and Warren, R.L. and Birol, I. 2023. Reference-free assembly of long-read transcriptome sequencing data with RNA-Bloom2. Nature Communications. 14 (1): 2940. | |
dc.identifier.uri | http://hdl.handle.net/20.500.11937/96867 | |
dc.identifier.doi | 10.1038/s41467-023-38553-y | |
dc.description.abstract |
Long-read sequencing technologies have improved significantly since their emergence. Their read lengths, potentially spanning entire transcripts, is advantageous for reconstructing transcriptomes. Existing long-read transcriptome assembly methods are primarily reference-based and to date, there is little focus on reference-free transcriptome assembly. We introduce “RNA-Bloom2 [https://github.com/bcgsc/RNA-Bloom]”, a reference-free assembly method for long-read transcriptome sequencing data. Using simulated datasets and spike-in control data, we show that the transcriptome assembly quality of RNA-Bloom2 is competitive to those of reference-based methods. Furthermore, we find that RNA-Bloom2 requires 27.0 to 80.6% of the peak memory and 3.6 to 10.8% of the total wall-clock runtime of a competing reference-free method. Finally, we showcase RNA-Bloom2 in assembling a transcriptome sample of Picea sitchensis (Sitka spruce). Since our method does not rely on a reference, it further sets the groundwork for large-scale comparative transcriptomics where high-quality draft genome assemblies are not readily available. | |
dc.language | eng | |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
dc.subject | Transcriptome | |
dc.subject | RNA | |
dc.subject | High-Throughput Nucleotide Sequencing | |
dc.subject | Gene Expression Profiling | |
dc.subject | Sequence Analysis, RNA | |
dc.subject | RNA | |
dc.subject | Gene Expression Profiling | |
dc.subject | Sequence Analysis, RNA | |
dc.subject | High-Throughput Nucleotide Sequencing | |
dc.subject | Transcriptome | |
dc.title | Reference-free assembly of long-read transcriptome sequencing data with RNA-Bloom2 | |
dc.type | Journal Article | |
dcterms.source.volume | 14 | |
dcterms.source.number | 1 | |
dcterms.source.issn | 2041-1723 | |
dcterms.source.title | Nature Communications | |
dc.date.updated | 2025-01-15T04:21:24Z | |
curtin.department | School of Molecular and Life Sciences (MLS) | |
curtin.accessStatus | Open access | |
curtin.faculty | Faculty of Science and Engineering | |
curtin.contributor.orcid | Gagalova, Kristina [0000-0002-5975-0805] | |
curtin.identifier.article-number | 2940 | |
dcterms.source.eissn | 2041-1723 | |
curtin.contributor.scopusauthorid | Gagalova, Kristina [55969284500] | |
curtin.repositoryagreement | V3 |