A1 Journal article (refereed)
Teaching transposon classification as a means to crowd source the curation of repeat annotation : a tardigrade perspective (2024)


Peona, V., Martelossi, J., Almojil, D., Bocharkina, J., Brännström, I., Brown, M., Cang, A., Carrasco-Valenzuela, T., DeVries, J., Doellman, M., Elsner, D., Espíndola-Hernández, P., Montoya, G. F., Gaspar, B., Zagorski, D., Hałakuc, P., Ivanovska, B., Laumer, C., Lehmann, R., . . . Suh, A. (2024). Teaching transposon classification as a means to crowd source the curation of repeat annotation : a tardigrade perspective. Mobile DNA, 15, Article 10. https://doi.org/10.1186/s13100-024-00319-8


JYU authors or editors


Publication details

All authors or editorsPeona, Valentina; Martelossi, Jacopo; Almojil, Dareen; Bocharkina, Julia; Brännström, Ioana; Brown, Max; Cang, Alice; Carrasco-Valenzuela, Tomàs; DeVries, Jon; Doellman, Meredith; et al.

Journal or seriesMobile DNA

eISSN1759-8753

Publication year2024

Publication date06/05/2024

Volume15

Article number10

PublisherBioMed Central

Publication countryUnited Kingdom

Publication languageEnglish

DOIhttps://doi.org/10.1186/s13100-024-00319-8

Publication open accessOpenly available

Publication channel open accessOpen Access channel

Publication is parallel published (JYX)https://jyx.jyu.fi/handle/123456789/95043

Publication is parallel publishedhttps://doi.org/10.3929/ethz-b-000672225

Web address of parallel published publication (pre-print) https://doi.org/10.1101/2023.11.06.565293


Abstract

Background
The advancement of sequencing technologies results in the rapid release of hundreds of new genome assemblies a year providing unprecedented resources for the study of genome evolution. Within this context, the significance of in-depth analyses of repetitive elements, transposable elements (TEs) in particular, is increasingly recognized in understanding genome evolution. Despite the plethora of available bioinformatic tools for identifying and annotating TEs, the phylogenetic distance of the target species from a curated and classified database of repetitive element sequences constrains any automated annotation effort. Moreover, manual curation of raw repeat libraries is deemed essential due to the frequent incompleteness of automatically generated consensus sequences.

Results
Here, we present an example of a crowd-sourcing effort aimed at curating and annotating TE libraries of two non-model species built around a collaborative, peer-reviewed teaching process. Manual curation and classification are time-consuming processes that offer limited short-term academic rewards and are typically confined to a few research groups where methods are taught through hands-on experience. Crowd-sourcing efforts could therefore offer a significant opportunity to bridge the gap between learning the methods of curation effectively and empowering the scientific community with high-quality, reusable repeat libraries.

Conclusions
The collaborative manual curation of TEs from two tardigrade species, for which there were no TE libraries available, resulted in the successful characterization of hundreds of new and diverse TEs in a reasonable time frame. Our crowd-sourcing setting can be used as a teaching reference guide for similar projects: A hidden treasure awaits discovery within non-model organisms.


Keywordsgenomicsmolecular geneticsTardigradasequencingtransposonsannotationcuration (collections)librariesbioinformaticscrowdsourcing

Free keywordstransposable elements; manual curation; library; annotation; non-model organism; genome assembly


Contributing organizations


Ministry reportingYes

Preliminary JUFO rating1


Last updated on 2024-22-05 at 10:18