Subscribe to RSS
A Novel Application of a Generation Model in Foreseeing ‘Future’ ReactionsThis project was supported by the National Natural Science Foundation of China, (No.81903438) and Natural Science Foundation of Zhejiang Province (LD22H300004).
Deep learning is widely used in chemistry and can rival human chemists in certain scenarios. Inspired by molecule generation in new drug discovery, we present a deep-learning-based approach to reaction generation with the Trans-VAE model. To examine how exploratory and innovative the model is in reaction generation, we constructed the dataset by time splitting. We used the Michael addition reaction as a generation vehicle and took these reactions reported before a certain date as the training set and explored whether the model could generate reactions that were reported after that date. We took 2010 and 2015 as time points for splitting the reported Michael addition reaction; among the generated reactions, 911 and 487 reactions were applied in the experiments after the respective split time points, accounting for 12.75% and 16.29% of all reported reactions after each time point. The generated results were in line with expectations and a large number of new, chemically feasible, Michael addition reactions were generated, which further demonstrated the ability of the Trans-VAE model to learn reaction rules. Our research provides a reference for the future discovery of novel reactions by using deep learning.
Key wordsdeep learning - artificial intelligence - reaction generation - Michael reaction - synthesis design
Received: 14 May 2022
Accepted after revision: 06 September 2022
Accepted Manuscript online:
06 September 2022
Article published online:
07 October 2022
© 2022. Thieme. All rights reserved
Georg Thieme Verlag KG
Rüdigerstraße 14, 70469 Stuttgart, Germany
References and Notes
- 1 Davies IW. Nature 2019; 570: 175
- 2 Blakemore DC, Castro L, Churcher I, Rees DC, Thomas AW, Wilson DM, Wood A. Nat. Chem. 2018; 10: 383
- 3 Diels O, Alder K. Justus Liebigs Ann. Chem. 1928; 460: 98
- 4 Herges R. Tetrahedron Comput. Methodol. 1988; 1: 15
- 5 Ley SV, Fitzpatrick DE, Ingham RJ, Myers RM. Angew. Chem. Int. Ed. 2015; 54: 3449
- 6 Boström J, Brown DG, Young RJ, Keserü GM. Nat. Rev. Drug. Discovery 2018; 17: 709
- 7 Maryasin B, Marquetand P, Maulide N. Angew. Chem. Int. Ed. 2018; 57: 6978
- 8 Matlock MK, Hoffman M, Dang NL, Folmsbee DL, Langkamp LA, Hutchison GR, Kumar N, Sarullo K, Swanmidass SJ. J. Phys. Chem. A 2021; 125: 8978
- 9 Wu Z, Ramsundar B, Feinberg EN, Gomes J, Geniesse C, Pappu AS, Leswing K, Pande V. Chem. Sci. 2018; 9: 513
- 10 Debus B, Parastar H, Harrington P, Kirsanov D. Trends Anal. Chem. 2021; 145: 116459
- 11 Graziano G. Nat. Rev. Chem. 2020; 4: 564
- 12 Satoh H, Funatsu K. J. Chem. Inf. Comput. Sci. 1995; 35: 34
- 13 Zhang J, Norinder U, Svensson F. J. Chem. Inf. Model. 2021; 61: 2648
- 14 Ting K.-LH, Lee RC. T, Milne GW. A, Shapiro M, Guarino AM. Science 1973; 180: 417
- 15 Gong Y, Xue D, Chuai G, Yu J, Liu Q. Chem. Sci. 2021; 12: 14459
- 16 He H, Yan S, Lyu D, Xu MX, Ye RQ, Zheng P, Lu XY, Wang L, Ren B. Anal. Chem. 2021; 93: 3653
- 17 Fooshee D, Mood A, Gutman E, Tavakoli M, Urban G, Liu F, Huynh N, Van Vranken D, Baldi P. Mol. Syst. Des. Eng. 2018; 3: 442
- 18 Baylon JL, Cilfone NA, Gulcher JR, Chittenden TW. J. Chem. Inf. Model. 2019; 59: 673
- 19 Segler MH. S, Preuss M, Waller MP. Nature 2018; 555: 604
- 20 Dong J, Zhao M, Liu Y, Su Y, Zeng X. Briefings Bioinf. 2021; 23; bbab391
- 21 Kim HW, Lee SW, Na GS, Han SJ, Kim SK, Shin JH, Chang H, Kim YT. React. Chem. Eng. 2021; 6: 235
- 22 Xu X, Gu H, Wang Y, Wang J, Qin P. Front. Genet. 2019; online DOI: 10.3389/fgene.2019.00233.
- 23 Jin W, Coley CW, Barzilay R, Jaakkola T. arXiv 2017; 1709.04555 DOI: 10.48550/arXiv.1709.04555.
- 24 Coley CW, Jin W, Rogers L, Jamison TF, Jaakkola TS, Green WH, Barzilay R, Jensen KF. Chem. Sci. 2019; 10: 370
- 25 Schwaller P, Laino T, Gaudin T, Bolgar P, Hunter CA, Bekas C, Lee AA. ACS Cent. Sci. 2019; 5: 1572
- 26 Cortes-Ciriano I, Bender A. J. Chem. Inf. Model. 2015; 55: 2682
- 27 Shi T, Huang S, Chen L, Heng Y, Kuang ZY, Xu L, Mei H. Chemom. Intell. Lab. Syst. 2020; 205: 104122
- 28 Bort W, Baskin II, Gimadiev T, Mukanov A, Nugmanov R, Sidorov P, Marcou G, Horvath D, Klimchuk O, Madzhidov T, Varnek A. Sci. Rep. 2021; 11: 3178
- 29 Wang X, Yao C, Zhang Y, Yu J, Qiao H, Zhang C, Wu Y, Bai R, Duan H. ChemRxiv 2021; preprint, DOI DOI: 10.26434/chemrxiv-2021-c192z-v2.
- 30 Dollar O, Joshi N, Beck DA. C, Pfaendtner J. Chem. Sci. 2021; 12: 8362
- 31 Payra S, Saha A, Banerjee S. RSC Adv. 2016; 6: 95951
- 32 Gorde AB, Ramapanicker R. Eur. J. Org. Chem. 2019; 4745
- 33 Wang A, Lv K, Tao Z, Gu J, Liu MJ, Wan BJ, Franzblau SG, Ma C, Ma X, Han B, Wang A, Xu S, Lu Y. Eur. J. Med. Chem. 2019; 181: 111595
- 34 van der Maaten L, Hinton G. J. Mach. Learn. Res. 2008; 9: 2579
- 35 Rogers D, Hahn M. J. Chem. Inf. Model. 2010; 50: 742
- 36 Tennekes M. J. Stat. Software 2018; 84 (06) 1 DOI: 10.18637/jss.v084.i06.
- 37 Liu B, Ramsundar B, Kawthekar P, Shi J, Gomes J, Nguyen QL, Ho S, Sloane J, Wender P, Pande V. ACS Cent. Sci. 2017; 3: 1103
- 38 McInnes L, Healy J, Melville J. arXiv 2018; 1802.03426 DOI: 10.48550/arXiv.1802.03426.
- 39 Landrum, G. RDKit: Open-source cheminformatics (accessed Sept. 28, 2022): http://www.rdkit.org