Exploring open data in biomedicine: A practical introduction to using GEO and GEO2R
Keywords:
GEO, accessible bioinformatics, open biomedical dataAbstract
The NCBI Gene Expression Omnibus (GEO) is a public repository containing thousands of gene expression studies and other molecular profiles. Reusing these open data makes it possible to generate hypotheses, validate findings, and perform exploratory in silico analyses without laboratory costs. In this article, we explain in simple language what GEO is, what information it contains, and how to use accessible tools such as GEO2R to perform basic reproducible analyses, even in institutions with limited resources. We also present an illustrative example of the workflow and potential results, without establishing definitive biological or clinical conclusions.
References
Clough, E., Barrett, T., Wilhite, S. E., Ledoux, P., et al. (2024). NCBI GEO: archive for gene expression and epigenomics data sets: 23-year update. Nucleic Acids Research, 52(D1), D138–D144. https://doi.org/10.1093/nar/gkad965
Kesh, S., & Raghupathi, W. (2004). Critical issues in bioinformatics and computing. Perspectives in health information management/AHIMA, American Health Information Management Association, 1, 9.
Wang, Z., Lachmann, A., & Ma’ayan, A. (2019). Mining data and metadata from the Gene Expression Omnibus. Biophysical Reviews, 11(1), 103–110. https://doi.org/10.1007/s12551-018-0490-8
Kułak, K., Sztybór, I., & Kamińska, K. (2024). Obesity-an epidemic of the 21st century–literature review. Journal of Education, Health and Sport, 70, 49557-49557.
Marchant, E., Singh, E., Kureel, S., Blair, B., Maroto, R., Sheetz, M., & Rasmussen, B. (2024). Abstract 2197 Whole-body ultrasound treatment increases insulin production and lowers fasting blood glucose in aged insulin-resistant mice. Journal of Biological Chemistry, 300(3). https://doi.org/10.1016/j.jbc.2024.106486
Alsulami S, Nyakotey DA, Dudek K, Bawah AM,Lovegrove JA, Annan RA, Ellahi B, Vimaleswaran KS. Interaction between metabolic genetic risk score anddietaryfatty acid intake on centralobesity in a Ghanian population. Nutrients 12: 1906, 2020. https://doi.org/10.3390/nu12071906
Sahoo, K., & Sundararajan, V. (2024). Methods in DNA methylation array dataset analysis: A review. Computational and Structural Biotechnology Journal, 23, 2304-2325. https://doi.org/10.1016/j.csbj.2024.05.015
Chen, J. Q., Salas, L. A., Wiencke, J. K., Koestler, D. C., Molinaro, A. M., Andrew, A. S., & Christensen, B. C. (2022). Immune profiles and DNA methylation alterations related with non-muscle-invasive bladder cancer outcomes. Clinical Epigenetics, 14(1), 14. https://doi.org/10.1186/s13148-022-01234-6
Han, M. R., Jeong, J. H., Kim, Y. G., Yang, H. H., Seo, C. O., Kim, Y., ... & Choi, J. I. (2024). Epigenetic regulation on left atrial function and disease recurrence after catheter ablation in atrial fibrillation. Clinical Epigenetics, 16(1), 183. https://doi.org/10.1186/s13148-024-01794-9
Chen, E. Y., Tan, C. M., Kou, Y., Duan, Q., Wang, Z., et al. (2013). Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinformatics, 14, 128. https://doi.org/10.1186/1471-2105-14-128
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Revista de divulgación científica iBIO

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Self-archiving or deposit of the works in their post-publication version (editorial version) is permitted in any personal, institutional or thematic repository, social or scientific networks. The above applies from the moment of publication of the article in question on the website of the Revista de divulgación científica iBIO.