Exploring open data in biomedicine: A practical introduction to using GEO and GEO2R

Authors

  • portugués portugués portugués
  • portugués portugués Internal Medicine Department, Ribeirão Preto Medical School, São Paulo State University, Ribeirão Preto, Brasil. Research Laboratory on Metabolism, Physiology and Physical Exercise (LAPEMFE), Minas Gerais State University, Divinópolis, Brasil.
  • José de Jesús Martínez-González Universidad Autónoma del Estado de Morelos, CIICAp, Av. Universidad 1001, Col. Chamilpa 62209, Cuernavaca, Morelos, México.

Keywords:

GEO, accessible bioinformatics, open biomedical data

Abstract

The NCBI Gene Expression Omnibus (GEO) is a public repository containing thousands of gene expression studies and other molecular profiles. Reusing these open data makes it possible to generate hypotheses, validate findings, and perform exploratory in silico analyses without laboratory costs. In this article, we explain in simple language what GEO is, what information it contains, and how to use accessible tools such as GEO2R to perform basic reproducible analyses, even in institutions with limited resources. We also present an illustrative example of the workflow and potential results, without establishing definitive biological or clinical conclusions.

References

Clough, E., Barrett, T., Wilhite, S. E., Ledoux, P., et al. (2024). NCBI GEO: archive for gene expression and epigenomics data sets: 23-year update. Nucleic Acids Research, 52(D1), D138–D144. https://doi.org/10.1093/nar/gkad965

Kesh, S., & Raghupathi, W. (2004). Critical issues in bioinformatics and computing. Perspectives in health information management/AHIMA, American Health Information Management Association, 1, 9.

Wang, Z., Lachmann, A., & Ma’ayan, A. (2019). Mining data and metadata from the Gene Expression Omnibus. Biophysical Reviews, 11(1), 103–110. https://doi.org/10.1007/s12551-018-0490-8

Kułak, K., Sztybór, I., & Kamińska, K. (2024). Obesity-an epidemic of the 21st century–literature review. Journal of Education, Health and Sport, 70, 49557-49557.

Marchant, E., Singh, E., Kureel, S., Blair, B., Maroto, R., Sheetz, M., & Rasmussen, B. (2024). Abstract 2197 Whole-body ultrasound treatment increases insulin production and lowers fasting blood glucose in aged insulin-resistant mice. Journal of Biological Chemistry, 300(3). https://doi.org/10.1016/j.jbc.2024.106486

Alsulami S, Nyakotey DA, Dudek K, Bawah AM,Lovegrove JA, Annan RA, Ellahi B, Vimaleswaran KS. Interaction between metabolic genetic risk score anddietaryfatty acid intake on centralobesity in a Ghanian population. Nutrients 12: 1906, 2020. https://doi.org/10.3390/nu12071906

Sahoo, K., & Sundararajan, V. (2024). Methods in DNA methylation array dataset analysis: A review. Computational and Structural Biotechnology Journal, 23, 2304-2325. https://doi.org/10.1016/j.csbj.2024.05.015

Chen, J. Q., Salas, L. A., Wiencke, J. K., Koestler, D. C., Molinaro, A. M., Andrew, A. S., & Christensen, B. C. (2022). Immune profiles and DNA methylation alterations related with non-muscle-invasive bladder cancer outcomes. Clinical Epigenetics, 14(1), 14. https://doi.org/10.1186/s13148-022-01234-6

Han, M. R., Jeong, J. H., Kim, Y. G., Yang, H. H., Seo, C. O., Kim, Y., ... & Choi, J. I. (2024). Epigenetic regulation on left atrial function and disease recurrence after catheter ablation in atrial fibrillation. Clinical Epigenetics, 16(1), 183. https://doi.org/10.1186/s13148-024-01794-9

Chen, E. Y., Tan, C. M., Kou, Y., Duan, Q., Wang, Z., et al. (2013). Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinformatics, 14, 128. https://doi.org/10.1186/1471-2105-14-128

Published

2026-04-30

How to Cite

portugués, portugués, portugués, portugués, & Martínez-González, J. de J. (2026). Exploring open data in biomedicine: A practical introduction to using GEO and GEO2R. Revista De divulgación científica IBIO, 8(2), 312. Retrieved from http://revistaibio.com/ojs33/index.php/main/article/view/312

Issue

Section

How does it work?