Abstract
This Perspective provides examples of current and future applications of deep learning in pharmacogenomics, including: identification of novel regulatory variants located in noncoding domains of the genome and their function as applied to pharmacoepigenomics; patient stratification from medical records; and the mechanistic prediction of drug response, targets and their interactions. Deep learning encapsulates a family of machine learning algorithms that has transformed many important subfields of artificial intelligence over the last decade, and has demonstrated breakthrough performance improvements on a wide range of tasks in biomedicine. We anticipate that in the future, deep learning will be widely used to predict personalized drug response and optimize medication selection and dosing, using knowledge extracted from large and complex molecular, epidemiological, clinical and demographic datasets.
References
- 1 . Predicting the future – big data, machine learning, and clinical medicine. N. Engl. J. Med. 375(13), 1216–1219 (2016).
- 2 . Methodological challenges and analytic opportunities for modeling and interpreting big healthcare data. Gigascience 5, 12 (2016).
- 3 . Big data analytics in healthcare. Biomed Res. Int. 2015, 370194 (2015).
- 4 . The unreasonable effectiveness of data. IEEE Intell. Syst. 24(2), 8–12 (2009).
- 5 . Deep learning. Nature 521(7553), 436–444 (2015).
- 6 . Machine learning in genomic medicine: a review of computational problems and datasets. Proc. IEEE 104(1), 176–197 (2016).
- 7 . Deep learning for computational biology. Mol. Syst. Biol. 12(7), 878 (2016).
- 8 Opportunities and obstacles for deep learning in biology and medicine. J. R. Soc. Interface 15(141), pii:20170387 (2018).
- 9 . Cancer pharmacogenomics, challenges in implementation, and patient-focused perspectives. Pharmgenomics Pers. Med. 9, 65–77 (2016).
- 10 . Advancing psychiatric pharmacogenomics using drug development paradigms. Pharmacogenomics 18(15), 1459–1467 (2017).
- 11 The pharmacogenomics of severe traumatic brain injury. Pharmacogenomics 18(15), 1413–1425 (2017).
- 12 . Pharmacogenomics in cardiology – genetics and drug response: 10 years of progress. Future Cardiol. 11(3), 281–286 (2015).
- 13 . Dosing recommendations for pharmacogenetic interactions related to drug metabolism. Pharmacogenet. Genomics 26(7), 334–339 (2016).
- 14 . Genomics and transcriptomics in drug discovery. Drug Discov. Today 19(2), 126–132 (2014).
- 15 . Patient-centric trials for therapeutic development in precision oncology. Nature 526(7573), 361–370 (2015).
- 16 . Does pharmacogenomic testing improve clinical outcomes for major depressive disorder? A systematic review of clinical trials and cost–effectiveness studies. J. Clin. Psychiatry 78(6), 720–729 (2017).
- 17 Roadmap Epigenomics Consortium; Integrative analysis of 111 reference human epigenomes. Nature 518(7539), 317–330 (2015).
- 18 . The epigenome, 4D nucleome and next-generation neuropsychiatric pharmacogenomics. Pharmacogenomics 16(14), 1649–1669 (2015).
- 19 . Pharmacogenomics in clinical practice and drug development. Nat. Biotechnol. 30(11), 1117–1124 (2012).
- 20 . 18O-assisted dynamic metabolomics for individualized diagnostics and treatment of human diseases. Croat. Med. J. 53(6), 529–534 (2012).
- 21 . SOCR data dashboard: an integrated big data archive mashing medicare, labor, census and econometric information. J. Big Data 2, pii:13 (2015).
- 22 . SOCRAT platform design: a web architecture for interactive visual analytics applications. In: Proceedings of the 2nd Workshop on Human-In-the-Loop Data Analytics. ACM New York, NY, USA, 1–6 (2017).
- 23 . Statistical analysis of big data on pharmacogenomics. Adv. Drug Deliv. Rev. 65(7), 987–1000 (2013).
- 24 . Methods to analyze big data in pharmacogenomics research. Pharmacogenomics 18(8), 807–820 (2017).
- 25 . Identifying predictive features in drug response using machine learning: opportunities and challenges. Annu. Rev. Pharmacol. 55, 15–34 (2015).
- 26 . Points of significance: classification evaluation. Nat. Methods 13(8), 603–604 (2016).
- 27 . Applications of deep learning in biomedicine. Mol. Pharm. 13(5), 1445–1454 (2016).
- 28 . A renaissance of neural networks in drug discovery. Expert Opin. Drug Discov. 11(8), 785–795 (2016).
- 29 . Deep learning in drug discovery. Mol. Inform. 35(1), 3–14 (2016).
- 30 . Deep learning for computational chemistry. J. Comput. Chem. 38(16), 1291–1307 (2017).
- 31 Is multitask deep learning practical for pharma? J. Chem. Inf. Model. 57(8), 2068–2076 (2017).
- 32 . Virtual screening: a challenge for deep learning. In: 10th International Conference on Practical Applications of Computational Biology & Bioinformatics. Mohamad MS, Rocha MP, Fdez-Riverola F, Domínguez-Mayo FJ, De Paz JF (Eds). Springer International Publishing, Cham, Switzerland, 13–22 (2016).
- 33 A survey on deep learning in medical image analysis. Med. Image Anal.
doi:10.1016/j.media.2017.07.005 (2017). - 34 . Pediatric bone age assessment using deep convolutional neural networks. arXiv 1712.05053 (2017).
- 35 . Deep convolutional neural networks for breast cancer histology image analysis. arXiv 1802.00752 (2018).
- 36 . Automatic instrument segmentation in robot-assisted surgery using deep learning. bioRxiv
doi:10.1101/275867 (2018). - 37 . Integrative data analysis of multi-platform cancer data with a multimodal deep learning approach. IEEE/ACM Trans. Comput. Biol. Bioinform. 12(4), 928–937 (2015).
- 38 . Deep learning based multi-omics integration robustly predicts survival in liver cancer. Clin. Cancer Res.
doi:10.1158/1078-0432.CCR-17-0853 (2017). - 39 . Deep learning applications for predicting pharmacological properties of drugs and drug repurposing using transcriptomic data. Mol. Pharm. 13(7), 2524–2530 (2016).
- 40 ; Pooled Resource Open-Access ALSCTC. Semi-supervised learning of the electronic health record for phenotype stratification. J. Biomed. Inform. 64, 168–178 (2016).
- 41 . Basset: learning the regulatory code of the accessible genome with deep convolutional neural networks. Genome Res. 26(7), 990–999 (2016).
- 42 MoleculeNet: a benchmark for molecular machine learning. Chem. Sci. 9(2), 513–530 (2018).
- 43 . Points of significance: model selection and overfitting. Nat. Methods 13(9), 703–704 (2016).
- 44 . The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS ONE 10(3), e0118432 (2015).
- 45 . miRFinder: an improved approach and software implementation for genome-wide fast microRNA precursor scans. BMC Bioinformatics 8, 341 (2007).
- 46 . Improving palliative care with deep learning. arXiv 1711.06402 (2017).
- 47 . Predicting effects of noncoding variants with deep learning-based sequence model. Nat. Methods 12(10), 931–934 (2015).
- 48 . Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat. Biotechnol. 33(8), 831–838 (2015).
- 49 . Recognition of prokaryotic and eukaryotic promoters using convolutional deep learning neural networks. PLoS ONE 12(2), e0171410 (2017).
- 50 . Learning important features through propagating activation differences. Proceedings of the 34th International Conference on Machine Learning. Sydney, Australia, 6–11 August 2017.
- 51 . Deep motif dashboard: visualizing and understanding genomic sequences using deep neural networks. Pac. Symp. Biocomput. 22, 254–265 (2016).
- 52 . Genetic architect: discovering genomic structure with learned neural architectures. arXiv 1605.07156 (2016).
- 53 . Deep learning for drug-induced liver injury. J. Chem. Inf. Model. 55(10), 2085–2093 (2015).
- 54 . DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences. Nucleic Acids Res. 44(11), e107 (2016).
- 55 . DeepChrome: deep-learning for predicting gene expression from histone modifications. Bioinformatics 32(17), i639–i648 (2016).
- 56 . Imputation for transcription factor binding predictions based on deep learning. PLoS Comput. Biol. 13(2), e1005403 (2017).
- 57 . Nucleotide sequence and DNaseI sensitivity are predictive of 3D chromatin architecture. bioRxiv
doi:10.1101/103614 (2017). - 58 . Predicting the impact of non-coding variants on DNA methylation. Nucleic Acids Res. 45(11), e99 (2017).
- 59 . DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning. Genome Biol. 18(1), 67 (2017).
- 60 . RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach. BMC Bioinformatics 18(1), 136 (2017).
- 61 . Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks. bioRxiv
doi:10.1101/146175 (2017). - 62 . FactorNet: a deep learning framework for predicting cell type specific transcription factor binding from nucleotide-resolution sequential data. bioRxiv
doi:10.1101/151274 (2017). - 63 . Sequential regulatory activity prediction across chromosomes with convolutional neural networks. bioRxiv
doi:10.1101/161851 (2017). - 64 . Modeling positional effects of regulatory sequences with spline transformations increases prediction accuracy of deep neural networks. bioRxiv
doi:10.1101/165183 (2017). - 65 . DeepATAC: a deep-learning method to predict regulatory factor binding activity from ATAC-seq signals. bioRxiv
doi:10.1101/172767 (2017). - 66 . Epigenomic mapping and effect sizes of noncoding variants associated with psychotropic drug response. Pharmacogenomics 16(14), 1565–1583 (2015).
- 67 . A glutamatergic network mediates lithium response in bipolar disorder as defined by epigenome pathway analysis. Pharmacogenomics 16(14), 1547–1563 (2015).
- 68 Network reconstruction reveals that valproic acid activates neurogenic transcriptional programs in adult brain following traumatic injury. Pharm. Res. 34(8), 1658–1672 (2017).
- 69 . The 3D genome as moderator of chromosomal communication. Cell 164(6), 1110–1121 (2016).
- 70 3D cell nuclear morphology: microscopy imaging dataset and voxel-based morphometry classification results. bioRxiv
doi:10.1101/208207 (2017). - 71 . Rotational 3D mechanogenomic Turing patterns of human colon Caco-2 cells during differentiation. bioRxiv
doi:10.1101/272096 (2018). - 72 . Mining the topography and dynamics of the 4D nucleome to identify novel CNS drug pathways. Methods 123, 102–118 (2017).
- 73 High-resolution interrogation of functional elements in the noncoding genome. Science 353(6307), 1545–1549 (2016).
- 74 . Patterns of treatment response in newly diagnosed epilepsy. Neurology 78(20), 1548–1554 (2012).
- 75 . Mining the unknown: assigning function to noncoding single nucleotide polymorphisms. Trends Genet. 33(1), 34–45 (2017).
- 76 . Deep learning for regulatory genomics. Nat. Biotechnol. 33(8), 825–826 (2015).
- 77 . Chromatin accessibility prediction via convolutional long short-term memory networks with k-mer embedding. Bioinformatics 33(14), I92–I101 (2017).
- 78 . gkm-DNN: efficient prediction using gapped k-mer features and deep neural networks. bioRxiv
doi:10.1101/170761 (2017). - 79 . An integrated encyclopedia of DNA elements in the human genome. Nature 489 (7414), 57–74 (2012).
- 80 . EP-DNN: a deep neural network-based global enhancer prediction algorithm. Sci. Rep. 6, 38433 (2016).
- 81 . DeepEnhancer: predicting enhancers by convolutional neural networks. Presented at: 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). Shenzen, China, 15–18 December 2016.
- 82 . PEDLA: predicting enhancers with a deep learning-based algorithmic framework. Sci. Rep. 6, 28517 (2016).
- 83 . Predicting enhancer–promoter interaction from genomic sequence with deep neural networks. bioRxiv
doi:10.1101/085241 (2016). - 84 . CNNsite: prediction of DNA-binding residues in proteins using convolutional neural network with sequence features. Presented at: 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). Shenzen, China, 15–18 December 2016.
- 85 . FIDDLE: an integrative deep learning framework for functional genomic data inference. bioRxiv
doi:10.1101/081380 (2016). - 86 Creating a universal SNP and small indel variant caller with deep neural networks. bioRxiv
doi:10.1101/092890 (2018). - 87 . Evaluating DeepVariant: a new deep learning variant caller from the google brain team (2017). https://blog.dnanexus.com/2017-12-05-evaluating-deepvariant-googles-machine-learning-variant-caller/.
- 88 . Improving Strategy for Discovering Interacting Genetic Variants in Association Studies. Springer, Cham, Switzerland, 461–469 (2016).
- 89 . A deep learning approach to detect SNP interactions. J. Software 11(10), 965–975 (2016).
- 90 Food and Drug Administration. Clinical pharmacogenomics: premarket evaluation in early-phase clinical studies and recommendations for labeling. US Department of Health and Human Services, Silver Spring, MD, USA (2013).www.fda.gov/downloads/Drugs/GuidanceComplianceRegulatoryInformation/Guidances/UCM337169.pdf.
- 91 . Deep EHR: a survey of recent advances on deep learning techniques for electronic health record (EHR) analysis. arXiv 1706.03446 (2017).
- 92 A global reference for human genetic variation. Nature 526(7571), 68–74 (2015).
- 93 . Case-based reasoning using electronic health records efficiently identifies eligible patients for clinical trials. J. Am. Med. Inform. Assoc. 22(e1), e141–e150 (2015).
- 94 . Extracting research-quality phenotypes from electronic health records to support precision medicine. Genome Med. 7(1), 41 (2015).
- 95 . Integration of genomics into the electronic health record: mapping terra incognita. Genet. Med. 15(10), 757–760 (2013).
- 96 . Deep patient: an unsupervised representation to predict the future of patients from the electronic health records. Sci. Rep. 6, 26094 (2016).
- 97 . Disease prediction from electronic health records using generative adversarial networks. arXiv 1711.04126 (2017).
- 98 . Boosting deep learning risk prediction with generative adversarial networks for electronic health records. arXiv 1709.01648 (2017).
- 99 . DeepCare: a deep dynamic memory model for predictive medicine. In: Advances in Knowledge Discovery and Data Mining: 20th Pacific-Asia Conference, PAKDD 2016, Auckland, New Zealand, April 19–22, 2016, Proceedings, Part II. Bailey J, Khan L, Washio T, Dobbie G, Huang JZ, Wang R (Eds). Springer International Publishing, Cham, Switzerland, 30–41 (2016).
- 100 . Doctor AI: predicting clinical events via recurrent neural networks. In: Proceedings of the 1st Machine Learning for Healthcare Conference. Doshi-Velez F, Fackler J, Kale D, Wallace B, Wiens J (Eds). PMLR Children's Hospital LA, CA, USA, 301–318 (2016).
- 101 . Mapping patient trajectories using longitudinal extraction and deep learning in the MIMIC-III critical care database. In: Biocomputing 2018. Altman RB, Dunker AK, Hunter L, Ritchie MD, Murray TA, Klein TE (Eds). World Scientific, Singapore, 123–132 (2018).
- 102 . Causal phenotype discovery via deep networks. AMIA Annu. Symp. Proc. 2015, 677–686 (2015).
- 103 . Optimal medication dosing from suboptimal clinical examples: a deep reinforcement learning approach. Conf. Proc. IEEE Eng. Med. Biol. Soc. 2016, 2978–2981 (2016).
- 104 . Cardiologist-level arrhythmia detection with convolutional neural networks. arXiv 1707.01836 (2017).
- 105 ResearchKit. http://researchkit.org/.
- 106 . Sage bionetworks in collaboration with The Michael J. Fox Foundation announce winners in the DREAM Parkinson's Disease Digital Biomarker Challenge (2018). www.businesswire.com/news/home/20180117006187/en/Sage-Bionetworks-Collaboration-Michael-J.-Fox-Foundation.
- 107 The Mood Challenge (2017). www.moodchallenge.com/.
- 108 Models of human core transcriptional regulatory circuitries. Genome Res. 26(3), 385–396 (2016).
- 109 . In-solution hybrid capture of bisulfite-converted DNA for targeted bisulfite sequencing of 174 ADME genes. Nucleic Acids Res. 41(6), e72 (2013).
- 110 A vision and strategy for the virtual physiological human in 2010 and beyond. Philos. Trans. A Math. Phys. Eng. Sci. 368(1920), 2595–2614 (2010).
- 111 . Clinical success of drug targets prospectively predicted by in silico study. Trends Pharmacol. Sci. 339(3), 229–231 (2018).
- 112 The cornucopia of meaningful leads: applying deep adversarial autoencoders for new molecule development in oncology. Oncotarget 8(7), 10883–10890 (2017).
- 113 . druGAN: an advanced generative adversarial autoencoder model for de novo generation of new molecules with desired molecular properties in silico. Mol. Pharm. 14(9), 3098–3104 (2017).
- 114 . Application of generative autoencoder in de novo molecular design. Mol. Inform. 37(1–2), 1700123 (2017).
- 115 . Low data drug discovery with one-shot learning. ACS Cent. Sci. 3(4), 283–293 (2017).
- 116 Deep-learning-based drug-target interaction prediction. J. Proteome Res. 16(4), 1401–1409 (2017).
- 117 . DeepSynergy: predicting anticancer drug synergy with deep learning. Bioinformatics 34(9), 1538–1546 (2018).
- 118 . DeepTox: toxicity prediction using deep learning. Front. Environ. Sci.
doi.org/10.3389/fenvs.2015.00080 (2016). - 119 . Modeling industrial ADMET data with multitask networks. arXiv 1606.08793 (2016).
- 120 . Deep learning based regression and multiclass models for acute oral toxicity prediction with automatic chemical feature extraction. J. Chem. Inf. Model. 57(11), 2672–2685 (2017).
- 121 . ToxAlerts: a web server of structural alerts for toxic chemicals and compounds with potential adverse reactions. J. Chem. Inf. Model. 52(8), 2310–2316 (2012).
- 122 . DeepMetabolism: a deep learning system to predict phenotype from genome sequencing. bioRxiv
doi:10.1101/135574 (2017). - 123 Deep learning and association rule mining for predicting drug response in cancer. A personalised medicine approach. bioRxiv
doi:10.1101/070490 (2017). - 124 . Machine learning-based prediction of adverse drug effects: an example of seizure-inducing compounds. J. Pharmacol. Sci. 133(2), 70–78 (2017).
- 125 . DL-ADR: a novel deep learning model for classifying genomic variants into adverse drug reactions. BMC Med. Genomics 9(Suppl. 2), 48 (2016).
- 126 Artificial intelligence the next digital frontier? McKinsey and Company Global Institute (2017).www.mckinsey.com/∼/media/McKinsey/Industries/Advanced Electronics/Our Insights/How artificial intelligence can deliver real value to companies/MGI-Artificial-Intelligence-Discussion-paper.ashx.
- 127 . Bioinformatics and advanced analytics powering drug discovery (2017). www.researchandmarkets.com/reports/4308287/bioinformatics-and-advanced-analytics-powering.
- 128 . Outsourcing AI for drug discovery: independent expertise is key to avoid overhyped claims (2017). www.biopharmatrend.com/post/49-research-in-ai-for-drug-discovery-is-overhyped-and-what-to-do-about-it/.

