Drug-Drug Interaction Prediction: Databases, Models, and Future Directions

December 18, 2024 Off By admin

Table of Contents

Predicting Drug-Drug Interactions: Key Computational Models, Databases, and Future Directions

Introduction

Drug-drug interactions (DDIs) are a major concern in clinical pharmacology, as they can significantly affect the safety and efficacy of treatment regimens. The simultaneous use of multiple drugs, whether for enhancing therapeutic efficacy or minimizing side effects, is common practice. However, inappropriate drug combinations may lead to adverse drug reactions, complicating treatment and increasing healthcare costs. Predicting DDIs accurately is crucial for minimizing these risks. This blog post provides a detailed exploration of computational models for predicting DDIs, focusing on machine learning, deep learning, and score function-based approaches. It also reviews key databases and web servers that aid in DDI prediction, discusses the limitations of current methods, and outlines future research directions.

What Are Drug-Drug Interactions?

A drug is a chemical substance that induces biological effects when administered to a living organism. While drugs are typically used to treat or prevent diseases, they can also have unintended effects when combined with other drugs. Drug-drug interactions (DDIs) occur when the effect of one drug is altered by the presence of another. These interactions can be beneficial, enhancing the therapeutic effects, or detrimental, causing adverse effects or diminishing the efficacy of one or more drugs.

Given the complexity of drug interactions, accurate prediction models are crucial for ensuring patient safety and optimizing treatment outcomes. Computational models have become essential tools in predicting potential DDIs before clinical testing.

Key Databases and Web Servers for DDI Research

Over the years, several databases and web servers have been developed to facilitate the study of DDIs. These resources offer valuable information about known and predicted DDIs, including their mechanisms, risk levels, and recommended management strategies. Below are some of the prominent databases and web servers for DDI research:

Databases:

DrugBank: Contains information on over 15,000 drugs, including approved small molecules, biologics, and experimental drugs. It provides data on drug structures, pharmacological properties, drug targets, and over 1.3 million DDIs.
DDInter: A comprehensive database with 236,834 recorded DDIs involving 1,833 drugs. It includes detailed information on mechanisms, risk levels, and drug adjustment recommendations.
SuperDRUG2: A knowledge base for 4,587 approved drugs, providing data on chemical structures, pharmacokinetics, and potential DDIs.
INXBASE: Contains over 20,000 DDIs and is designed for easy integration into health information systems to assist healthcare professionals.
OncoRx: Focuses on DDIs related to oncology drugs.
DrugComb: A portal for drug combination screening, including 739,964 combinations involving 8,397 drugs.
DailyMed: Contains FDA-approved drug labels, including information on indications, dosages, adverse reactions, and DDIs.

Web Servers:

PolySearch2: A text-mining system that provides relationships between biomedical entities such as drugs, diseases, and genes.
DDI-CPI: A server that predicts DDIs based on the chemical-protein interactome, presenting interaction probabilities for over 2,500 drugs.
vNN-ADMET: A web server that predicts the absorption, distribution, metabolism, excretion, and toxicity (ADMET) properties of drugs, including DDIs.

These databases and web servers provide essential tools for researchers and healthcare professionals to predict and manage DDIs, ultimately improving patient safety.

The table summarizing the function and URL of the databases and web servers related to drug interactions and drug data:

Databases or Web Servers	Function	URL
DrugBank	Recording 15,451 drugs and providing over 200 data fields for each drug, covering chemical, pharmacological, pharmaceutical aspects, and information on drug targets.	https://www.drugbank.ca/
DDInter	Recording 236,834 DDIs involving 1,833 drugs and documenting mechanisms, risk levels, and recommendations for drug adjustments.	http://ddinter.scbdd.com
SuperDRUG2	Recording drug annotations including regulatory details, chemical structures (2D and 3D), dosage, biological targets, physicochemical properties, side effects, pharmacokinetic data, and DDIs.	http://cheminfo.charite.de/superdrug2
INXBASE	Recording more than 20,000 DDIs.	https://www.medbase.fi/en/professionals/inxbase/
OncoRx	Documenting 943 DDIs between 117 anticancer drugs (ACDs) and 166 complementary and alternative medicines (CAMs).	http://www.onco-informatics.com/oncorx/index.php
DIDB	Recording drug interaction results derived from drug–drug, drug–food, and drug–herb interaction studies.	https://www.druginteractionsolutions.org/
DrugComb	Documenting the standardized results of drug combination screening studies, involving 739,964 combinations of 8,397 drugs.	https://drugcomb.fimm.fi/
DailyMed	Recording essential scientific information for the safe and effective use of drugs, including indications, dosage, administration, adverse reactions, and DDIs.	https://dailymed.nlm.nih.gov/dailymed/about-dailymed.cfm
PolySearch2	Predicting relationships between biomedical entities, such as human diseases, genes, SNPs, proteins, drugs, metabolites, and more.	http://polysearch.ca
DDI-CPI	Presenting predicted probabilities of interactions between a given drug and 2,515 drugs in the DDI-CPI library.	http://cpi.bio-x.cn/ddi/
vNN-ADMET	Predicting the ADMET properties of drugs.	https://vnnadmet.bhsai.org/vnnadmet/login.xhtml

This table provides an overview of each resource, including its primary function and URL for access.

The table summarizing the significance and related links of computational models for drug-drug interaction (DDI) prediction:

Model	Significance	Link to GitHub or Sites
Bayesian probabilistic method-based model	Introducing the system connection score and drug phenotypic similarity score	http://www.picb.ac.cn/hanlab/DDI
INDI	Applying a novel scoring scheme to construct the feature vectors of drug pairs based on multiple types of drug similarity	N/A
Label propagation-based model	Implementing label propagation based on multiple similarity information	N/A
Collective PSL-based model	Applying the hinge-loss MRFs to identify potential DDIs in the multigraph through maximum a posteriori	N/A
Random forest-based model	Introducing the enrichment score of the targets of drugs	N/A
Logistic regression-based model	Implementing prediction based on two interaction networks constructed based on PK and PD interactions	N/A
PUL-based model	Applying the growing self-organizing maps clustering algorithm to identify reliable negative samples	N/A
Meta-learning-based model	Using node2vec to get the feature vectors of drugs from the feature network	N/A
MRMF	Introducing manifold regularization into matrix factorization	N/A
DDINMF	Introducing the feature matrix of drug into matrix factorization to make the model suitable for predicting enhancive and degressive DDIs between known drugs and new drugs	N/A
TMFUF	Being suitable for predicting not only known but also new drugs that interact with new drugs	N/A
LCM-DS	Introducing the Dempster–Shafer theory of evidence to integrate the results of three local classification models	https://github.com/JustinShi2016/ScientificReports2018
DDIGIP	Applying the KNNs to fill in the adjacency matrix	N/A
Gradient boosting-based model	Using the TPE approach to optimize the hyperparameters of the classifier	N/A
Network algorithm and matrix perturbation algorithm-based model	Applying the classifier ensemble rule to take the logistic regression to map the outputs of all models to a score as the final prediction result	https://github.com/zw9977129/drug-drug-interaction/
HNAI	Applying five prediction models to identify potential DDIs	N/A
IAC	Introducing the action crossing method to obtain the feature vectors of drug pairs based on drug–enzyme and drug–transporter actions	N/A
SFLLN	Introducing the sparse feature learning ensemble method to project drugs from different feature spaces to the common interaction space	https://github.com/BioMedicalBigDataMiningLabWhu/SFLLN
DDIMDL	Applying the DNN to calculate the interaction probabilities based on the feature vectors of drugs	https://github.com/YifanDengWHU/DDIMDL
SSI-DDI	Applying the GAT layers to extract the feature vectors of atoms contained in drugs	https://github.com/kanz76/SSI-DDI
STNN-DDI	Introducing tensor to describe the interactions between substructures of drugs	https://github.com/zsy-9/STNN-DDI
META-DDIE	Introducing the chemical sequential pattern mining algorithm to obtain a set of discrete frequent substructures of drugs	https://github.com/YifanDengWHU/META-DDIE
DANN-DDI	Introducing the structural deep network embedding method to learn the embeddings of drugs from interaction networks	https://github.com/naodandandan/DANN-DDI
MRCGNN	Introducing the contrastive learning to obtain the representations of drugs	https://github.com/Zhankun-Xiong/MRCGNN
MCFF-MTDDI	Introducing the extra label-based feature vector to make the model suitable for multi-label prediction	https://github.com/ChendiHan111/MCFF-MTDDI
DSIL-DDI	Introducing the GNN to extract the substructure representations of drugs	N/A
DSN-DDI	Applying the intra-view and inter-view representation learning methods to obtain the representations of drugs	https://github.com/microsoft/Drug-Interaction-Research/tree/DSN-DDI-for-DDI-Prediction
BioDKG-DDI	Applying a novel similarity fusion method to fuse multiple similarity matrices of drugs	N/A
MDF-SA-DDI	Introducing the multi-head self-attention mechanism to integrate the feature vectors of each drug pair	https://github.com/ShenggengLin/MDF-SA-DDI
Deep feed-forward network-based model	Introducing the GO term-based drug similarity	N/A
R2-DDI	Applying the MLP to obtain the refinement vectors of drugs	https://github.com/linjc16/R2-DDI
Graph kernel-based approach	Constructing all-path graph kernels to describe the connections between syntactic and semantic within the sentences	https://sbmi.uth.edu/ccb/resources/ddi.htm
Semantic predication-based model	Introducing four types of semantic predication generated by SemRep	N/A
Att-BLSTM	Combining attention mechanism and RNN with BLSTM to learn the global semantic representation of the sentence	N/A
PM-BLSTM	Applying a rule to filter the drugs to ensure that only one drug pair in each sentence was studied	N/A
A two-stage DDIs extraction model	Applying the SVM classifier to identify DDIs and the LSTM-based classifier to identify the type of DDIs	N/A
IK-DDI	Introducing key external text derived from the DrugBank	https://github.com/DouMingLiang/IK-DDI
3DGT-DDI	Introducing the 3D structure conformations of drugs	https://github.com/hehh77/3DGT-DDI
Russell–Rao-based model	Applying the Russell–Rao method to calculate interaction probability	N/A
Score matrix and PCA-based model	Applying PCA method to integrate the score matrices to obtain the final results	N/A

This table provides a summary of the computational models and links to the relevant GitHub repositories or websites for further details.

Table summarizing the Drug-Drug Interaction FAQ:

Question	Answer
What exactly is a drug, and how do they differ from other substances like food?	A drug is a chemical substance that, when administered to a living organism, produces a biological effect, often used to treat or prevent diseases. Unlike food, which is consumed regularly for nutrition, drugs are taken in specific ways (e.g., inhalation, injection, ingestion) and for limited periods.
What are some of the key databases used for researching drug-drug interactions, and what information do they contain?	Key databases include: – DrugBank: Over 15,000 drug entries with chemical, pharmacological, and pharmaceutical data. – DDInter: Focuses on DDIs, with over 230,000 interactions and risk levels. – SuperDRUG2: Contains drug regulatory details, dosages, and DDI information. – DIDB: Pharmacokinetic interactions. – DDI-CPI: Predicts interactions based on chemical-protein interactions.
What is a drug-drug interaction (DDI), and why is it important to study them?	A DDI occurs when one drug’s effect is altered by another drug, food, or substance. Studying DDIs is important because they can cause adverse effects, reduce drug efficacy, or result in side effects, helping to optimize dosages and improve patient safety.
How are computational models used to predict potential DDIs?	Computational models, such as Bayesian methods, random forests, and deep learning techniques, analyze drug characteristics to predict DDIs. These models use data on chemical structure, target proteins, and side effects, among others, to identify patterns and predict interactions.
What are the different types of drug similarities considered when predicting DDIs?	Types of drug similarities include: – Chemical structure – Receptors – Side effects – Anatomical therapeutic chemical (ATC) codes – Target protein sequences – PPI network distances – Gene Ontology (GO) annotations
How do machine learning techniques contribute to DDI prediction, and what specific methods are utilized?	Machine learning techniques such as logistic regression, random forests, deep neural networks, and graph neural networks help predict DDIs by analyzing relationships and patterns in drug data. Other methods include matrix factorization, SVMs, and ensemble methods, with techniques like attention mechanisms and contrastive learning for improved prediction accuracy.
What are some specific models mentioned in the sources, and what novel approaches do they introduce for DDI prediction?	Specific models include: – INDI: Combines PK and PD interactions. – MRMF: Uses manifold regularization in matrix factorization. – DDINMF: Semi-nonnegative matrix factorization for enhancive and degressive DDIs. – DDIGIP: Gaussian interaction profile kernels. – SSI-DDI: Deep-learning model focusing on drug substructures. – MRCGNN: Multi-relational contrastive learning graph neural network.
What is the role of text mining and natural language processing (NLP) in identifying DDIs?	NLP and text mining help extract information from biomedical literature to detect DDIs. Techniques like word embeddings, dependency graphs, and recurrent neural networks (RNNs) analyze unstructured text to identify drug pairs and their interactions, uncovering previously unknown or underreported interactions.

This table organizes the essential information about drug-drug interactions and the methodologies used to predict them.

Computational Models for Predicting DDIs

Computational models for predicting DDIs have significantly advanced over the past decade. These models can be broadly categorized into three types: traditional machine learning models, deep learning models, and score function-based models.

1. Traditional Machine Learning Models

These models rely on classical machine learning algorithms to predict DDIs. They use data features such as chemical structures, drug side effects, and protein interactions to make predictions. Some examples include:

INDI: Infers various types of DDIs (pharmacokinetic (PK), pharmacodynamic (PD), and potential PK interactions) using logistic regression.
Random Forest Models: Utilize chemical interactions, protein interactions, and enrichment scores to predict DDIs.
Logistic Regression Models: Predict DDIs using two interaction networks based on PK and PD interactions.

Advantages:

Suitable for large-scale, rapid predictions.
Can be used for newly approved drugs.

Limitations:

Parameter settings can affect model performance.
Feature vector construction can be challenging.
Scarcity of reliable negative samples limits accuracy.

2. Deep Learning Models

Deep learning models automatically extract significant features from drug data and effectively integrate them for DDI prediction. Notable deep learning approaches include:

DDIMDL: Uses deep neural networks (DNNs) to predict DDIs based on drug features like substructure, targets, enzymes, and pathways.
Graph Neural Networks (GNNs): SSI-DDI employs GNNs to extract drug substructure representations for DDI prediction.
BioDKG-DDI: Uses a drug knowledge graph and self-attention mechanisms for DDI prediction.

Advantages:

Automatically learns significant features.
High flexibility in feature fusion.
Can utilize substructure information and biomedical literature.

Limitations:

Computationally intensive.
Lack of interpretability.
Scarcity of reliable negative samples.

3. Score Function-Based Models

These models calculate interaction probabilities using statistical or probabilistic methods, such as the system connection score or drug phenotypic similarity score. Some examples include:

System Connection Scores: Calculates a likelihood ratio to determine the interaction probability between drugs.
Russell-Rao and PCA Methods: Uses probability distributions and score matrices for prediction.

Advantages:

Simple to understand.
Does not require negative samples.

Limitations:

Not suitable for predicting interactions with new drugs.
Requires assumptions about the probability distribution of the data, which can reduce accuracy.

Challenges and Future Directions

Despite the promising advancements in DDI prediction, several challenges remain:

Data Imbalance: There are fewer positive DDI samples compared to negative samples, which can affect the model’s accuracy.
Limited Scope: Current models primarily predict potential DDIs but often overlook factors like the severity of interactions or the effects of drug doses.
Multi-Drug Interactions: Most models focus on pairwise interactions, but patients often take multiple drugs simultaneously, making predictions for multi-drug interactions crucial.
Lack of Interpretability: Many deep learning models do not provide insights into the underlying mechanisms of DDIs, hindering their practical application.
Reliable Negative Samples: Randomly selecting drug pairs without known interactions as negative samples can affect model performance.

Future research should focus on:

Expanding datasets with known drug interactions, including multi-drug combinations and the severity of interactions.
Applying advanced deep learning techniques to improve prediction accuracy and model interpretability.
Developing methods to identify reliable negative samples and optimizing model parameters.
Incorporating external data, such as biomedical literature, to enhance the prediction of DDIs.

Conclusion

Accurately predicting drug-drug interactions is essential for enhancing patient safety and optimizing therapeutic treatments. Computational models, along with comprehensive databases and web servers, play a critical role in predicting potential DDIs. Researchers must continue to refine these models by addressing the challenges they face, such as data imbalance and limited scope, and explore new avenues like multi-drug interactions and improved interpretability. By advancing these models, we can ensure safer drug combinations and more effective treatments, ultimately improving patient care.

Glossary of Key Terms:

Drug: A chemical substance with a known structure that produces a biological effect when administered to a living organism; used to prevent or treat diseases.
Drug-Drug Interaction (DDI): A situation where the effect of one drug is altered by the presence of another drug; can lead to increased or decreased therapeutic effect or adverse effects.
Database: A structured set of data held in a computer, especially one that is accessible in various ways. In this context, it refers to databases that store information on drugs and their interactions.
Web Server: Software that responds to requests over the internet by delivering web pages, data, or other content.
Computational Model: A mathematical representation of a real-world system used to simulate and analyze its behavior, often applied in drug discovery to predict DDIs.
Small Molecule Drugs: Low molecular weight, organic compounds, typically synthesized chemically, that are easily absorbed by the body.
Biologics: Complex, high molecular weight substances produced using biotechnology, such as proteins, peptides, vaccines, and allergenics.
Nutraceuticals: Food or food products that provide health or medical benefits.
Experimental Drugs: Drugs undergoing research and not yet approved for clinical use.
Pharmacology: The study of how drugs interact with living organisms to produce therapeutic effects.
Pharmacokinetics (PK): The study of how a drug moves through the body, covering absorption, distribution, metabolism, and excretion.
Pharmacodynamics (PD): The study of the biochemical and physiological effects of drugs on the body.
Cytochrome P450 Enzyme: A family of enzymes that metabolize drugs and other substances, with some enzymes specific to particular drug substrates.
Target Protein: A protein in the body that a drug interacts with to produce its therapeutic effect.
Manifold Regularization: A method that assumes data structure lies on a low-dimensional manifold and penalizes dissimilarity in their low-dimensional representation.
Matrix Factorization: A machine learning technique that decomposes a matrix into the product of two or more matrices, used to extract latent features from data.
Feature Vector: A numerical representation of an object or concept; for drugs, this can be based on factors such as chemical structure, side effects, or target proteins.
Ensemble Learning: A method that combines predictions from multiple models to improve accuracy and robustness.
Graph Neural Networks (GNNs): Neural networks that operate on graph-structured data, such as molecular graphs or drug-drug interaction networks.
Attention Mechanism: A neural network technique that allows the model to focus on the most relevant parts of the input data.
SMILES String: A linear notation for representing the chemical structure of a molecule.
Tensor: A multidimensional array used to represent data with multiple dimensions; essential in models based on structural data.
Multigraph: A graph where multiple edges may exist between the same pair of nodes, useful for representing multiple dimensions of similarity between drugs.
Unified Medical Language System (UMLS): A comprehensive vocabulary of biomedical terms and codes from various sources.
MedDRA: The Medical Dictionary for Regulatory Activities, a standardized medical terminology used for reporting adverse events.
Jaccard Index: A similarity measure between two sets, defined as the size of the intersection divided by the size of the union.
Tanimoto Coefficient: A similarity measure between two binary vectors, often used to assess structural similarity between molecules.
Gaussian Interaction Profile (GIP): A kernel method used to calculate similarity between entities based on their interactions

Reference

Zhao, Y., Yin, J., Zhang, L., Zhang, Y., & Chen, X. (2024). Drug–drug interaction prediction: databases, web servers and computational models. Briefings in Bioinformatics, 25(1), bbad445.