Subscribe to RSS

DOI: 10.1055/s-0045-1804890
Analysis of TCGA (The Cancer Genome Atlas) Data for Prognosis, Risk Categorization, and Survival of AML Patients Using Bioinformatics
Funding None.
Abstract
Introduction Acute myeloid leukemia (AML) is a leading cause of mortality among Indian children and adults, driven by diverse genetic and epigenetic abnormalities. Limited access to genomic sequencing in India due to resource constraints has hindered a comprehensive understanding of prognostic factors specific to this population.
Objectives This study aims to analyze publicly available genomic data using statistical and bioinformatics tools to identify key prognostic markers relevant to Indian AML patients.
Materials and Methods The study utilized tumor/normal pair data from 200 adult de novo AML patients, obtained from The Cancer Genome Atlas (TCGA) and the Genotype-Tissue Expression database, analyzed using cBioPortal. Statistical and bioinformatics tools were employed to assess the impact of existing prognostic targets on disease response and to identify variables with clinical relevance and practical testing feasibility.
Results Analysis of the TCGA-AML data set identified high-frequency gene mutations (≥ 10%) and well-defined cytogenetic subtypes, including t(8;21)(q22;q22), NPM1 mutations, and CEBPA mutations as key factors for future prognostic evaluation. These findings will contribute to the development of a prognostic scoring system using R programming in future.
Conclusion This study offers insights into the cytogenetic and mutational landscape of AML in the Indian population, identifying critical genetic and cytogenetic markers with the potential to enhance prognostication, guide treatment strategies, and inform transplant decisions. Using R tools like limma and edgeR, differential expression analysis identified five key genes—NPM1, FLT3, IDH2, RUNX1, and STAG2—as significantly upregulated in AML. Notably, STAG2 emerges as a novel marker with potential prognostic significance, warranting validation in larger Indian cohorts. These findings may help uncover novel therapeutic targets not currently recognized in Western populations, paving the way for a more tailored and personalized approach to treatment in India. By leveraging global genomic databases, this research addresses regional gaps in knowledge. Future work should focus on validating these findings through large-scale studies in Indian cohorts to ensure their broader applicability and impact.
Note
The manuscript has been read and approved by all the authors and that each author believes that the manuscript represents honest work.
* Joint First authors.
# Joint corresponding authors.
Publication History
Article published online:
24 February 2025
© 2025. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution License, permitting unrestricted use, distribution, and reproduction so long as the original work is properly cited. (https://creativecommons.org/licenses/by/4.0/)
Thieme Medical and Scientific Publishers Pvt. Ltd.
A-12, 2nd Floor, Sector 2, Noida-201301 UP, India
-
References
- 1 cBioPortal. Accessed March 23, 2023 at: https://www.cbioportal.org/study/summary?id=laml_tcga_pub
- 2 GTEx portal. Accessed March 31, 2023 at: https://gtexportal.org/home/
- 3 Welsh KJ, Nedelcu E, Wahed A, Bai Y, Dasgupta A, Nguyen A. Bioinformatics analysis to determine prognostic mutations of 72 de novo acute myeloid leukemia cases from the cancer genome atlas (TCGA) with 23 most common mutations and no abnormal cytogenetics. Ann Clin Lab Sci 2015; 45 (05) 515-521
- 4 Bloodspot. Accessed April 3, 2023 at: https://servers.binf.ku.dk/bloodspot/
- 5 Ley TJ, Miller C, Ding L. et al; Cancer Genome Atlas Research Network. Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia. N Engl J Med 2013; 368 (22) 2059-2074
- 6 Xu C, Jackson SA. Machine learning and complex biological data. Genome Biol 2019; 20 (01) 76
- 7 Cerami E, Gao J, Dogrusoz U. et al. The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov 2012; 2 (05) 401-404
- 8 Yan H, Qu J, Cao W. et al. Identification of prognostic genes in the acute myeloid leukemia immune microenvironment based on TCGA data analysis. Cancer Immunol Immunother 2019; 68 (12) 1971-1978
- 9 Leung MK, Delong A, Alipanahi B. et al. Machine learning in genomic medicine: a review of computational problems and data sets. Proc IEEE 2015; 104 (01) 176-197
- 10 Karczewski KJ, Snyder MP. Integrative omics for health and disease. Nat Rev Genet 2018; 19 (05) 299-310
- 11 Fernandez-Lozano C, Gestal M, Munteanu CR, Dorado J, Pazos A. A methodology for the design of experiments in computational intelligence with multiple regression models. PeerJ 2016; 4: e2721
- 12 Chen H, Li C, Peng X, Zhou Z, Weinstein JN, Liang H. Cancer Genome Atlas Research Network. A pan-cancer analysis of enhancer expression in nearly 9000 patient samples. Cell 2018; 173 (02) 386-399.e12
- 13 Shen R, Olshen AB, Ladanyi M. Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis. Bioinformatics 2009; 25 (22) 2906-2912
- 14 Liu J, Lichtenberg T, Hoadley KA. et al; Cancer Genome Atlas Research Network. An integrated TCGA pan-cancer clinical data resource to drive high-quality survival outcome analytics. Cell 2018; 173 (02) 400-416.e11
- 15 Ni J, Wu Y, Qi F. et al. Screening the cancer genome atlas database for genes of prognostic value in acute myeloid leukemia. Front Oncol 2020; 9: 1509
- 16 Wang J, Dao F-T, Yang L, Qin Y-Z. Characterization of somatic mutation-associated microenvironment signatures in acute myeloid leukemia patients based on TCGA analysis. Sci Rep 2020; 10 (01) 19037
- 17 Papaemmanuil E, Gerstung M, Bullinger L. et al. Genomic classification and prognosis in acute myeloid leukemia. New England Journal of Medicine 2016; Jun 9; 374 (23) 2209-21
- 18 Grimwade D, Ivey A, Huntly BJ. Molecular landscape of acute myeloid leukemia in younger adults and its clinical relevance. Blood, The Journal of the American Society of Hematology 2016; Jan 7; 127 (1) 29-41
- 19 Bullinger L, Döhner K, Döhner H. Genomics of acute myeloid leukemia diagnosis and pathways. Journal of clinical oncology 2017; Mar 20; 35 (9) 934-46
- 20 Paschka P, Schlenk RF, Gaidzik VI. et al. ASXL1 mutations in younger adult patients with acute myeloid leukemia: a study by the German-Austrian Acute Myeloid Leukemia Study Group. Haematologica 2015; 100 (3) 324-330
- 21 Bhatnagar B, Eisfeld AK, Kohlschmidt J. et al. Clinical and molecular characterization of patients with acute myeloid leukemia and sole trisomies of chromosomes 4, 8, 11, 13 or 21. Leukemia 2020; Feb; 34 (2) 358-68
- 22 Santos FP, Jones D, Qiao W. et al. Prognostic value of FLT3 mutations among different cytogenetic subgroups in acute myeloid leukemia. Cancer 2011; 117 (10) 2145-2155