Developing and Validating Predictive Models for Adverse Drug Reactions using Electronic Health Records (EHRs): A Narrative Review
Epidemiology and Health Data Insights, 1(6), 2025, ehdi019, https://doi.org/10.63946/ehdi/17369
Publication date: Nov 03, 2025
ABSTRACT
Adverse drug reactions (ADRs) remain a major global challenge, contributing substantially to patient morbidity, mortality, and healthcare costs. Traditional pharmacovigilance approaches—spontaneous reporting and post-marketing surveillance—are hampered by underreporting, delays, and limited contextual data. The growing availability of electronic health records (EHRs), which capture longitudinal structured and unstructured patient information, presents an unprecedented opportunity to advance ADR prediction. This narrative review synthesizes recent progress in developing and validating predictive models that leverage EHRs, highlighting methodological approaches, challenges, and future directions. Predictive strategies range from traditional regression models to advanced machine learning and deep learning architectures, with multimodal frameworks increasingly integrating structured fields (demographics, labs, prescriptions) and unstructured clinical text through natural language processing. While ensemble and deep learning methods demonstrate superior performance, issues of data quality, missingness, bias, and interpretability persist. Robust validation frameworks—spanning internal cross-validation to multi-center external testing—are critical to ensure generalizability and clinical trustworthiness. Ethical considerations, including fairness, privacy, and transparency, remain central to safe deployment. Looking forward, promising avenues include federated learning across institutions, integration of multi-omics and pharmacogenomic data, explainable AI tailored for clinical use, and real-time monitoring through digital twin frameworks. These trajectories, combined with robust governance and clinician–data scientist collaboration, have the potential to transform ADR detection from a reactive process to proactive, personalized prevention. By synthesizing the existing evidence, this review provides insights into the development of more effective predictive models for ADRs and informs strategies for improving pharmacovigilance. This study will contribute to the ongoing efforts to leverage EHRs and predictive models for improving patient outcomes and reducing the burden of ADRs.
KEYWORDS
Adverse Drug Reactions (ADRs) Electronic Health Records (EHRs) Predictive Model-ing Machine Learning Pharmacovigilance Artificial Intelligence (AI)
CITATION (Vancouver)
Nwokedi VU, Odedele MK, Yusuff TA, Anderson IA, Akanbi OO, Agu CP, et al. Developing and Validating Predictive Models for Adverse Drug Reactions using Electronic Health Records (EHRs): A Narrative Review. Epidemiology and Health Data Insights. 2025;1(6):ehdi019. https://doi.org/10.63946/ehdi/17369
APA
Nwokedi, V. U., Odedele, M. K., Yusuff, T. A., Anderson, I. A., Akanbi, O. O., Agu, C. P., & Omoike, A. O. (2025). Developing and Validating Predictive Models for Adverse Drug Reactions using Electronic Health Records (EHRs): A Narrative Review. Epidemiology and Health Data Insights, 1(6), ehdi019. https://doi.org/10.63946/ehdi/17369
Harvard
Nwokedi, V. U., Odedele, M. K., Yusuff, T. A., Anderson, I. A., Akanbi, O. O., Agu, C. P., and Omoike, A. O. (2025). Developing and Validating Predictive Models for Adverse Drug Reactions using Electronic Health Records (EHRs): A Narrative Review. Epidemiology and Health Data Insights, 1(6), ehdi019. https://doi.org/10.63946/ehdi/17369
AMA
Nwokedi VU, Odedele MK, Yusuff TA, et al. Developing and Validating Predictive Models for Adverse Drug Reactions using Electronic Health Records (EHRs): A Narrative Review. Epidemiology and Health Data Insights. 2025;1(6), ehdi019. https://doi.org/10.63946/ehdi/17369
Chicago
Nwokedi, Vivian Ukamaka, Michael Kayode Odedele, Taofeek Adeshina Yusuff, Irene Adjoa Anderson, Olukunle O. Akanbi, Chiamaka Pamela Agu, and Amber Otibhor Omoike. "Developing and Validating Predictive Models for Adverse Drug Reactions using Electronic Health Records (EHRs): A Narrative Review". Epidemiology and Health Data Insights 2025 1 no. 6 (2025): ehdi019. https://doi.org/10.63946/ehdi/17369
MLA
Nwokedi, Vivian Ukamaka et al. "Developing and Validating Predictive Models for Adverse Drug Reactions using Electronic Health Records (EHRs): A Narrative Review". Epidemiology and Health Data Insights, vol. 1, no. 6, 2025, ehdi019. https://doi.org/10.63946/ehdi/17369
REFERENCES
- Wasylewicz A, van de Burgt B, Weterings A, Jessurun N, Korsten E, Egberts T, et al. Identifying adverse drug reactions from free‐text electronic hospital health record notes. Br J Clin Pharmacol. 2022 Mar;88(3):1235-45. https://doi.org/10.1111/bcp.15068
- Salas M, Petracek J, Yalamanchili P, Aimer O, Kasthuril D, Dhingra S, et al. The use of artificial intelligence in pharmacovigilance: a systematic review of the literature. Pharm Med. 2022 Oct;36(5):295-306. https://doi.org/10.1007/s40290-022-00441-z
- Ayalew MB, Tegegn HG, Abdela OA. Drug related hospital admissions; a systematic review of the recent literatures. Bull Emerg Trauma. 2019 Oct;7(4):339. https://doi.org/10.29252/beat-070401
- Davies EC, Green CF, Taylor S, Williamson PR, Mottram DR, Pirmohamed M. Adverse drug reactions in hospital in-patients: a prospective analysis of 3695 patient-episodes. PLoS One. 2009 Feb 11;4(2):e4439. https://doi.org/10.1371/journal.pone.0004439
- Mouton JP, Mehta U, Parrish AG, Wilson DP, Stewart A, Njuguna CW, et al. Mortality from adverse drug reactions in adult medical inpatients at four hospitals in South Africa: a cross‐sectional survey. Br J Clin Pharmacol. 2015 Oct;80(4):818-26. https://doi.org/10.1111/bcp.12567
- Alhawassi TM, Krass I, Bajorek BV, Pont LG. A systematic review of the prevalence and risk factors for adverse drug reactions in the elderly in the acute care setting. Clin Interv Aging. 2014 Dec 1:2079-86. https://doi.org/10.2147/CIA.S71178
- Pedrós C, Formiga F, Corbella X, Arnau JM. Adverse drug reactions leading to urgent hospital admission in an elderly population: prevalence and main features. Eur J Clin Pharmacol. 2016 Feb;72(2):219-26. https://doi.org/10.1007/s00228-015-1974-0
- Choudhury R. Integrating EHRs for proactive pharmacovigilance. J Pharmacovigil. 2025 Apr 2;13(1):502. https://doi.org/10.35248/2329-6887.25.13.502
- Davis SE, Zabotka L, Desai RJ, Wang SV, Maro JC, Coughlin K, et al. Use of electronic health record data for drug safety signal identification: a scoping review. Drug Saf. 2023 Aug;46(8):725-42. https://doi.org/10.1007/s40264-023-01325-0
- Zhang F, Sun B, Diao X, Zhao W, Shu T. Prediction of adverse drug reactions based on knowledge graph embedding. BMC Med Inform Decis Mak. 2021 Feb 4;21(1):38. https://doi.org/10.1186/s12911-021-01402-3
- Yang J, Hu Z, Zhang L, Peng B. Predicting Drugs Suspected of Causing Adverse Drug Reactions Using Graph Features and Attention Mechanisms. Pharmaceuticals. 2024 Jun 22;17(7):822. https://doi.org/10.3390/ph17070822
- Abdeldjouad FZ, Brahami M, Sabri M. Evaluating the effectiveness of artificial intelligence in predicting adverse drug reactions among cancer patients: a systematic review and meta-analysis. In: 2023 IEEE International Conference Challenges and Innovations on TIC (I2CIT); 2023 Dec 21. p. 1-8. https://doi.org/10.48550/arXiv.2404.05762
- Yalçın N, Kaşıkcı M, Çelik HT, Allegaert K, Demirkan K, Yiğit Ş, et al. An artificial intelligence approach to support detection of neonatal adverse drug reactions based on severity and probability scores: a new risk score as web-tool. Children. 2022 Nov 26;9(12):1826. https://doi.org/10.3390/children9121826
- Wang W, Ferrari D, Haddon-Hill G, Curcin V. Electronic health records as source of research data. In: Machine Learning for Brain Disorders. 2023 Jul 23. p. 331-54. https://doi.org/10.1007/978-1-0716-3195-9_11
- Hossain E, Rana R, Higgins N, Soar J, Barua PD, Pisani AR, et al. Natural language processing in electronic health records in relation to healthcare decision-making: a systematic review. Comput Biol Med. 2023 Mar 1;155:106649. https://doi.org/10.1016/j.compbiomed.2023.106649
- Golder S, O’Connor K, Lopez-Garcia G, Tatonetti N, Gonzalez-Hernandez G. Leveraging Unstructured Data in Electronic Health Records to Detect Adverse Events from Pediatric Drug Use-A Scoping Review. medRxiv. 2025 Mar 20. https://doi.org/10.1101/2025.03.20.25324320
- Tabari P, Costagliola G, De Rosa M, Boeker M. State-of-the-art fast healthcare interoperability resources (fhir)–based data model and structure implementations: Systematic scoping review. JMIR Med Inform. 2024 Sep 24;12(1):e58445. https://doi.org/10.2196/58445
- Zhang F, Sun B, Diao X, Zhao W, Shu T. Prediction of adverse drug reactions based on knowledge graph embedding. BMC Med Inform Decis Mak. 2021 Feb 4;21(1):38. https://doi.org/10.1186/s12911-021-01402-3
- Hu Q, Chen Y, Zou D, He Z, Xu T. Predicting adverse drug event using machine learning based on electronic health records: a systematic review and meta-analysis. Front Pharmacol. 2024 Nov 13;15:1497397. https://doi.org/10.3389/fphar.2024.1497397
- Zhao YX, Yuan H, Wu Y. Prediction of adverse drug reaction using machine learning and deep learning based on an imbalanced electronic medical records dataset. In: Proceedings of the 5th international conference on medical and health informatics; 2021 May 14. p. 17-21. http://dx.doi.org/10.1145/3472813.3472817
- Mohsen A, Tripathi LP, Mizuguchi K. Deep learning prediction of adverse drug reactions in drug discovery using open TG–GATEs and FAERS databases. Front Drug Discov. 2020 Oct 27;1:768792. https://doi.org/10.48550/arXiv.2010.05411
- He M, Shi Y, Han F, Cai Y. Prediction of adverse drug reactions based on pharmacogenomics combination features: a preliminary study. Front Pharmacol. 2025 Mar 10;16:1448106. https://doi.org/10.3389/fphar.2025.1448106
- Li H, Hu T, Xiong Z, Zheng C, Feng F, He X, et al. ADRNet: A generalized collaborative filtering framework combining clinical and non-clinical data for adverse drug reaction prediction. In: Proceedings of the 17th ACM Conference on Recommender Systems; 2023 Sep 14. p. 682-7. https://doi.org/10.1145/3604915.3608813
- Kwak H, Lee M, Yoon S, Chang J, Park S, Jung K. Drug-disease graph: predicting adverse drug reaction signals via graph neural network with clinical data. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining; 2020 May 6. p. 633-44. https://doi.org/10.48550/arXiv.2004.00407
- Zhang J, Kowsari K, Harrison JH, Lobo JM, Barnes LE. Patient2vec: A personalized interpretable deep representation of the longitudinal electronic health record. IEEE Access. 2018 Oct 12;6:65333-46. https://doi.org/10.48550/arXiv.1810.04793
- Collins GS, Dhiman P, Ma J, Schlussel MM, Archer L, Van Calster B, et al. Evaluation of clinical prediction models (part 1): from development to external validation. BMJ. 2024 Jan 8;384. https://doi.org/10.1136/bmj-2023-074819
- Steyerberg EW, Harrell Jr FE. Prediction models need appropriate internal, internal-external, and external validation. J Clin Epidemiol. 2015 Apr 18;69:245. https://doi.org/10.1016/j.jclinepi.2015.04.005
- Van Calster B, Steyerberg EW, Wynants L, Van Smeden M. There is no such thing as a validated prediction model. BMC Med. 2023 Feb 24;21(1):70. https://doi.org/10.1186/s12916-023-02779-w
- Sperrin M, Riley RD, Collins GS, Martin GP. Targeted validation: validating clinical prediction models in their intended population and setting. Diagn Progn Res. 2022 Dec 22;6(1):24. https://doi.org/10.1186/s41512-022-00136-8
- Ramspek CL, Jager KJ, Dekker FW, Zoccali C, van Diepen M. External validation of prognostic models: what, why, how, when and where?. Clin Kidney J. 2021 Jan 1;14(1):49-58. https://doi.org/10.1093/ckj/sfaa188
- la Roi-Teeuw HM, van Royen FS, de Hond A, Zahra A, de Vries S, Bartels R, et al. Don't be misled: 3 misconceptions about external validation of clinical prediction models. J Clin Epidemiol. 2024 Aug 1;172:111387. https://doi.org/10.1016/j.jclinepi.2024.111387
- Van Calster B, McLernon DJ, Van Smeden M, Wynants L, Steyerberg EW, Bossuyt P, et al. Calibration: the Achilles heel of predictive analytics. BMC Med. 2019 Dec 16;17(1):230. https://doi.org/10.1186/s12916-019-1466-7
- Luijken K, Groenwold RH, Van Calster B, Steyerberg EW, van Smeden M. Impact of predictor measurement heterogeneity across settings on the performance of prediction models: A measurement error perspective. Stat Med. 2019 Aug 15;38(18):3444-59. https://doi.org/10.48550/arXiv.1806.10495
- Navarro CL, Damen JA, van Smeden M, Takada T, Nijman SW, Dhiman P, et al. Systematic review identifies the design and methodological conduct of studies on machine learning-based prediction models. J Clin Epidemiol. 2023 Feb 1;154:8-22. https://doi.org/10.1016/j.jclinepi.2022.11.015
- Riley RD, Ensor J, Snell KI, Debray TP, Altman DG, Moons KG, et al. External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: opportunities and challenges. BMJ. 2016 Jun 22;353. https://doi.org/10.1136/bmj.i3140
- Saelmans A, Seinen T, Pera V, Markus AF, Fridgeirsson E, John LH, et al. Implementation and Updating of Clinical Prediction Models: A Systematic Review. Mayo Clin Proc Digit Health. 2025 May 23:100228. https://doi.org/10.1016/j.mcpdig.2025.100228
- Ferri P, Romero-Garcia N, Badenes R, Lora-Pablos D, Morales TG, De La Camara AG, et al. Extremely missing numerical data in Electronic Health Records for machine learning can be managed through simple imputation methods considering informative missingness: A comparative of solutions in a COVID-19 mortality case study. Comput Methods Programs Biomed. 2023 Dec 1;242:107803. https://doi.org/10.1016/j.cmpb.2023.107803
- Ren W, Liu Z, Wu Y, Missing Data in Electronic health Record s (MINDER) Group. Moving beyond medical statistics: a systematic review on missing data handling in electronic health records. Health Data Sci. 2024;4:0176. https://doi.org/10.34133/hds.0176
- Cross JL, Choma MA, Onofrey JA. Bias in medical AI: Implications for clinical decision-making. PLOS Digit Health. 2024 Nov 7;3(11):e0000651. https://doi.org/10.1371/journal.pdig.0000651
- Hanna MG, Pantanowitz L, Jackson B, Palmer O, Visweswaran S, Pantanowitz J, et al. Ethical and bias considerations in artificial intelligence/machine learning. Mod Pathol. 2025 Mar 1;38(3):100686. https://doi.org/10.1016/j.modpat.2024.100686
- Obermeyer Z, Powers B, Vogeli C, Mullainathan S. Dissecting racial bias in an algorithm used to manage the health of populations. Science. 2019 Oct 25;366(6464):447-53. https://doi.org/10.1126/science.aax2342
- Cheng F, Liu D, Du F, Lin Y, Zytek A, Li H, et al. Vbridge: Connecting the dots between features and data to explain healthcare models. IEEE Trans Vis Comput Graph. 2021 Oct 1;28(1):378-88. https://doi.org/10.48550/arXiv.2108.02550
- Hofmann B. Biases in AI: acknowledging and addressing the inevitable ethical issues. Front Digit Health. 2025 Aug 20;7:1614105. https://doi.org/10.3389/fdgth.2025.1614105
- Liaw ST, Liyanage H, Kuziemsky C, Terry AL, Schreiber R, Jonnagaddala J, et al. Ethical use of electronic health record data and artificial intelligence: recommendations of the primary care informatics working group of the international medical informatics association. Yearb Med Inform. 2020 Aug;29(01):051-7. https://doi.org/10.1055/s-0040-1701980
- Abdullah TA, Zahid MS, Ali W. A review of interpretable ML in healthcare: taxonomy, applications, challenges, and future directions. Symmetry. 2021 Dec 17;13(12):2439. https://doi.org/10.3390/sym13122439
- Choi E, Bahadori MT, Sun J, Kulas J, Schuetz A, Stewart W. Retain: An interpretable predictive model for healthcare using reverse time attention mechanism. Adv Neural Inf Process Syst. 2016;29. https://doi.org/10.48550/arXiv.1608.05745
- Algarvio RC, Conceição J, Rodrigues PP, Ribeiro I, Ferreira-da-Silva R. Artificial intelligence in pharmacovigilance: a narrative review and practical experience with an expert-defined Bayesian network tool. Int J Clin Pharm. 2025 Jul 30:1-3. https://doi.org/10.1007/s11096-025-01975-3
- Dsouza VS, Leyens L, Kurian JR, Brand A, Brand H. Artificial Intelligence in Pharmacovigilance: A Systematic Review on Predicting Adverse Drug Reactions in Hospitalized Patients. Res Social Adm Pharm. 2025 Feb 12. https://doi.org/10.1016/j.sapharm.2025.02.008
- De Abreu Ferreira R, Zhong S, Moureaud C, Le MT, Rothstein A, Li X, et al. A pilot, predictive surveillance model in pharmacovigilance using machine learning approaches. Adv Ther. 2024 Jun;41(6):2435-45. https://doi.org/10.1007/s12325-024-02870-5
- Dimitsaki S, Natsiavas P, Jaulent MC. Applying AI to structured real-world data for pharmacovigilance purposes: Scoping review. J Med Internet Res. 2024 Dec 30;26:e57824. https://doi.org/10.2196/57824
- Guo D, Choo KK. Applications of Federated Large Language Model for Adverse Drug Reactions Prediction: Scoping Review. J Med Internet Res. 2025 Sep 8;27:e68291. https://doi.org/10.2196/68291
- Choudhury O, Park Y, Salonidis T, Gkoulalas-Divanis A, Sylla I, k Das A. Predicting adverse drug reactions on distributed health data using federated learning. AMIA Annu Symp Proc. 2020 Mar 4;2019:313.
- Ali MS, Ahsan MM, Tasnim L, Afrin S, Biswas K, Hossain MM, et al. Federated Learning in Healthcare: Model Misconducts, Security, Challenges, Applications, and Future Research Directions--A Systematic Review. arXiv preprint arXiv:2405.13832. 2024 May 22. https://doi.org/10.48550/arXiv.2405.13832
- Gao J, Li Y. Fedmetamed: Federated meta-learning for personalized medication in distributed healthcare systems. In: 2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM); 2024 Dec 3. p. 6384-91. https://doi.org/10.48550/arXiv.2412.03851
- Zack M, Stupichev DN, Moore AJ, Slobodchikov ID, Sokolov DG, Trifonov IF, et al. AI and Multi-Omics in Pharmacogenomics: A New Era of Precision Medicine. Mayo Clin Proc Digit Health. 2025 Jun 26:100246. https://doi.org/10.1016/j.mcpdig.2025.100246
- Alkhanbouli R, Matar Abdulla Almadhaani H, Alhosani F, Simsekler MC. The role of explainable artificial intelligence in disease prediction: a systematic literature review and future research directions. BMC Med Inform Decis Mak. 2025 Mar 4;25(1):110. https://doi.org/10.1186/s12911-025-02944-6
- Sun Q, Akman A, Schuller BW. Explainable artificial intelligence for medical applications: A review. ACM Trans Comput Healthc. 2025 Feb 17;6(2):1-31. https://doi.org/10.48550/arXiv.2412.01829
- Li X, Loscalzo J, Mahmud AF, Aly DM, Rzhetsky A, Zitnik M, et al. Digital twins as global learning health and disease models for preventive and personalized medicine. Genome Med. 2025 Feb 7;17(1):11. https://doi.org/10.1186/s13073-025-01435-7
- Sadée C, Testa S, Barba T, Hartmann K, Schuessler M, Thieme A, et al. Medical digital twins: enabling precision medicine and medical artificial intelligence. Lancet Digit Health. 2025 Jun 14. https://doi.org/10.1016/j.landig.2025.02.004
LICENSE
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.