Methods Inf Med 2013; 52(02): 137-147
DOI: 10.3414/ME12-01-0046
Original Articles
Schattauer GmbH

Supporting Translational Research on Inherited Cardiomyopathies through Information Technology

C. Larizza
1   Dipartimento di Ingegneria Industriale e dell’Informazione, Università di Pavia, Pavia, Italy
,
M. Gabetta
1   Dipartimento di Ingegneria Industriale e dell’Informazione, Università di Pavia, Pavia, Italy
,
G. Milani
1   Dipartimento di Ingegneria Industriale e dell’Informazione, Università di Pavia, Pavia, Italy
,
M. Bucalo
1   Dipartimento di Ingegneria Industriale e dell’Informazione, Università di Pavia, Pavia, Italy
,
F. Mulas
2   Bioinformatics Unit, Centre for Tissue Engineering, University of Pavia, Pavia, Italy
,
A. Nuzzo
2   Bioinformatics Unit, Centre for Tissue Engineering, University of Pavia, Pavia, Italy
3   Department of Biotechnology, BU Oncology, Nerviano Medical Sciences, Viagrande (CT), Italy
,
V. Favalli
4   IRCCS Fondazione Policlinico S. Matteo, Pavia, Italy
,
E. Arbustini
4   IRCCS Fondazione Policlinico S. Matteo, Pavia, Italy
,
R. Bellazzi
1   Dipartimento di Ingegneria Industriale e dell’Informazione, Università di Pavia, Pavia, Italy
› Author Affiliations
Further Information

Publication History

received: 22 May 2012

accepted: 05 February 2012

Publication Date:
24 January 2018 (online)

Summary

Objectives: The INHERITANCE project, funded by the European Commission, is aimed at studying genetic or inherited Dilated cardiomyopathies (DCM) and at understanding the impact and management of the disease within families that suffer from heart conditions that are caused by DCMs. The biomedical informatics research activity of the project aims at implementing information technology solutions to support the project team in the different phases of their research, in particular in genes screening prioritization and new gene-disease association discovery.

Methods: In order to manage the huge quantity of scientific, clinical and patient data generated by the project several advanced biomedical informatics tools have been developed. The paper describes a layer of software instruments to support translation of the results of the project in clinical practice as well as to support the scientific discovery process. This layer includes data warehousing, intelligent querying of the phenotype data, integrated search of biological data and knowledge repositories, text mining of the relevant literature, and case based reasoning.

Results: At the moment, a set of 1,394 patients and 9,784 observations has been stored into the INHERITANCE data warehouse. The literature database contains more than 1,100,000 articles retrieved from the Pubmed and generically related to cardiac diseases, already analyzed for extracting medical concepts and genes.

Conclusions: After two years of project the data warehouse has been completely set up and the text mining tools for automatic literature analysis have been implemented and tested. A first prototype of the decision support tool for knowledge discovery and gene prioritization is available, but a more complete release is still under development.

 
  • References

  • 1 Elliott P, Andersson B, Arbustini E, Bilinska Z, Cecchi F, Charron P, Dubourg O, Kühl U, Maisch B, McKenna WJ, Monserrat L, Pankuweit S, Rapezzi C, Seferovic P, Tavazzi L, Keren A. Classification of the cardiomyopathies: a position statement from the European Society Of Cardiology Working Group on Myocardial and Pericardial Diseases. Eur Heart J 2008; 29: 270-276.
  • 2 Ahamad F, Seidman JG, Seidman CE. The genetic basis for cardiac remodelling. Annu Rev Genomics. Hum Genet 2005; 6: 185-216.
  • 3 Meune C, Van Berlo JH, Anselme F, Bonne G, Pinto YM, Duboc D. Primary prevention of sudden death in patients with lamin A/C gene mutations. N Engl J Med 2006; 354: 209-210.
  • 4 Murphy SN, Weber G, Mendis M, Gainer V, Chueh HC, Churchill S, Kohane I. Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2). J Am Med Inform Assoc 2010; 17 (02) 124-130.
  • 5 Szalma S, Koka V, Khasanova T, Perakslis ED. Effective knowledge management in translational medicine. J Transl Med 2010; 8: 68
  • 6 Segagni D, Tibollo V, Dagliati A, Zambelli A, Priori SG, Bellazzi R. An ICT infrastructure to integrate clinical and molecular data in oncology research. BMC Bioinformatics 2012; 13 Suppl (Suppl. 04) S5
  • 7 Kimball R, Ross M. The data warehouse toolkit (second edition). New York: Wiley; 2002.
  • 8 Roos M, Marshall MS, Gibson AP, Schuemie M, Meij E, Katrenko S, Van Hage WR, Krommydas K, Adriaans PW. Structuring and extracting knowledge for the support of hypothesis generation in molecular biology. BMC Bioinformatics 2009; 10: S9
  • 9 Lindberg DA, Humphreys BL, McCray AT. The Unified Medical Language System. Methods Inf Med 1993; 32: 281-291.
  • 10 Nuzzo A, Mulas F, Gabetta M, Arbustini E, Zupan B, Larizza C, Bellazzi R. Text Mining approaches for automated literature knowledge extraction and representation. Stud Health Technol Inform 2010; 160 Pt 2 954-958.
  • 11 Mulas F, Curk T, Bellazzi R, Zupan B. On Quality of Different Annotation Sources for Gene Expression Analysis. Artif Intell Med, Lecture Notes in Computer Science 2009; 5651: 421-425.
  • 12 NCBI Entrez Utilities Web Services. U.S. National Center for Biotechnology Information. (Updated Apr 17, 2009; cited Aug 30, 2012.) Available from. http://eutils.ncbi.nlm.nih.gov/entrez/query/static/esoap_help.html.
  • 13 Cunningham H, Maynard D, Bontcheva K, Tablan V. GATE: A framework and graphical development environment for robust NLP tools and applications. In Proceedings of the 40th Annual Meeting of the ACL. 2002.
  • 14 Lindberg DA, Humphreys BL, McCray AT. The Unified Medical Language System. Methods Inf Med 1993; 32: 281-291.
  • 15 Maglott D, Ostell J, Pruitt KD, Tatusova T. Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res 2005; 33: D54-58.
  • 16 Begum S, Ahmed MU, Funk P, Xiong N, Folke M. Case-Based Reasoning Systems in the Health Sciences: A Survey of Recent Trends and Developments. IEEE Trans Syst Man Cybern 2011; 41 (04) 421-434.
  • 17 Gabetta M, Larizza C, Milani G, Favalli V, Arbustini E, Bellazzi R. A flexible approach to case based reasoning in medicine. In Proceedings of the GNB Third National Congress of Bioengineering. 2012.
  • 18 Melton G, Parsons S, Morrison F, Rothschild A, Markatou M, Hripcsak G. Inter-patient distance metrics using SNOMED CT defining relationships. J Biomed Inform 2006; 39 (06) 697-705.
  • 19 Caviedes JE, Cimino JJ. Towards the development of a conceptual distance metric for the umls. Journal of Biomedical Informatics 2004; 37 (02) 77-85.
  • 20 Newman MEJ. Mathematics of networks. The New Palgrave Encyclopedia of Economics. 2008.
  • 21 Pasotti M, Klersy C, Pilotto A, Marziliano N, Rapezzi C, Serio A, Mannarino S, Gambarin F, Favalli V, Grasso M, Agozzino M, Campana C, Gavazzi A, Febo O, Marini M, Landolina M, Mortara A, Piccolo G, Viganò M, Tavazzi L, Arbustini E. Long-term outcome and risk stratification in dilated cardiolaminopathies. J Am Coll Cardiol 2008; 52: 1250-1260.