Pharmacoepidemiology and Big Data Analytics: Challenges and Opportunities when Moving towards Precision Medicine


  • Andrea M. Burden ETH Zurich, Institute for Pharmaceutical Sciences, Department of Chemistry and Applied Biosciences, HCI H 407, Vladimir-Prelog-Weg 4/10, CH-8093 Zurich;, Email:





Big data, Machine learning, Medicine, Pharmacoepidemiology


Pharmacoepidemiology is the study of the safety and effectiveness of medications following market approval. The increased availability and size of healthcare utilization databases allows for the study of rare adverse events, sub-group analyses, and long-term follow-up. These datasets are large, including thousands of patient records spanning multiple years of observation, and representative of real-world clinical practice. Thus, one of the main advantages is the possibility to study the real-world safety and effectiveness of medications in uncontrolled environments. Due to the large size (volume), structure (variety), and availability (velocity) of observational healthcare databases there is a large interest in the application of natural language processing and machine learning, including the development of novel models to detect drug–drug interactions, patient phenotypes, and outcome prediction. This report will provide an overview of the current challenges in pharmacoepidemiology and where machine learning applications may be useful for filling the gap.




How to Cite