| dc.contributor.author | HUANG, Yu‑Ning | |
| dc.contributor.author | JAISWAL, Pooja Vinod | |
| dc.contributor.author | RAJES, Anushka | |
| dc.contributor.author | YADAV, Anushka | |
| dc.contributor.author | YU, Dottie | |
| dc.contributor.author | LIU, Fangyun | |
| dc.contributor.author | SCHEG, Grace | |
| dc.contributor.author | SHIH, Emma | |
| dc.contributor.author | BOLDIREV, Grigore | |
| dc.contributor.author | NAKASHIDZE, Irina | |
| dc.contributor.author | SARKAR, Aditya | |
| dc.contributor.author | MEHTA, Jay Himanshu | |
| dc.contributor.author | WANG, Ke | |
| dc.contributor.author | PATEL, Khooshbu Kantibhai | |
| dc.contributor.author | MIRZA, Mustafa Ali Baig | |
| dc.contributor.author | HAPANI, Kunali Chetan | |
| dc.contributor.author | PENG, Qiushi | |
| dc.contributor.author | AYYALA, Ram | |
| dc.contributor.author | GUO, Ruiwei | |
| dc.contributor.author | KAPUR, Shaunak | |
| dc.contributor.author | RAMESH, Tejasvene | |
| dc.contributor.author | CIORBĂ, Dumitru | |
| dc.contributor.author | MUNTEANU, Viorel | |
| dc.contributor.author | BOSTAN, Viorel | |
| dc.contributor.author | DIMIAN, Mihai | |
| dc.contributor.author | ABEDALTHAGAFI, Malak S. | |
| dc.contributor.author | MANGUL, Serghei | |
| dc.date.accessioned | 2026-02-23T18:35:13Z | |
| dc.date.available | 2026-02-23T18:35:13Z | |
| dc.date.issued | 2025 | |
| dc.identifier.citation | HUANG, Yu‑Ning; Pooja Vinod JAISWAL; Anushka RAJES; Anushka YADAV; Dottie YU; Fangyun LIU et al. The systematic assessment of completeness of public metadata accompanying omics studies in the Gene Expression Omnibus data repository. Genome Biology. 2025, vol. 26, nr. 1, art. nr. 274. ISSN 1474-7596. | en_US |
| dc.identifier.issn | 1474-7596 | |
| dc.identifier.uri | https://doi.org/10.1186/s13059-025-03725-0 | |
| dc.identifier.uri | https://repository.utm.md/handle/5014/35412 | |
| dc.description | Access full text: https://doi.org/10.1186/s13059-025-03725-0 | en_US |
| dc.description.abstract | Background: Recent advances in high-throughput sequencing technologies have enabled the collection and sharing of a massive amount of omics data, along with its associated metadata—descriptive information that contextualizes the data, including phenotypic traits and experimental design. Enhancing metadata availability is critical to ensure data reusability and reproducibility and to facilitate novel biomedical discoveries through effective data reuse. Yet, incomplete metadata accompanying public omics data may hinder reproducibility and reusability and limit secondary analyses. Results: Our study assesses the completeness of metadata in over 253 scientific studies, covering more than 164,000 samples from both human and non-human mammalian studies. We find that over 25% of critical metadata are omitted, with only 74.8% of relevant phenotypes available in publications or public repositories. Notably, public repositories alone contain 62% of the phenotypes, surpassing the textual content of publications by 3.5%. Only 11.5% of studies completely shared all phenotypes, while 37.9% shared less than 40% of the phenotypes. Additionally, studies with non-human samples are more likely to include complete metadata compared to human studies. Similar trends are observed in an extended dataset comprising 61,000 studies and 2.1 million samples from the Gene Expression Omnibus (GEO) data repository. Conclusions: These findings highlight significant gaps in metadata sharing, underscoring the need for standardized practices to improve metadata availability. Enhanced metadata reporting would foster data reusability, support better-informed decision-making, and promote reproducible research across the biomedical field. | en_US |
| dc.language.iso | en | en_US |
| dc.publisher | BioMed Central | en_US |
| dc.rights | Attribution-NonCommercial-NoDerivs 3.0 United States | * |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/us/ | * |
| dc.subject | public metadata | en_US |
| dc.subject | gene expression | en_US |
| dc.subject | data repository | en_US |
| dc.title | The systematic assessment of completeness of public metadata accompanying omics studies in the Gene Expression Omnibus data repository | en_US |
| dc.type | Article | en_US |
The following license files are associated with this item: