Predicting drug–disease associations by network embedding and biomedical data integration

Published on Apr 1, 2019in Drug Testing and Analysis2.799
· DOI :10.1108/DTA-01-2019-0004
Xiaomei Wei1
Estimated H-index: 1
(HAU: Huazhong Agricultural University),
Yaliang Zhang (HAU: Huazhong Agricultural University)+ 1 AuthorsYaping Fang1
Estimated H-index: 1
(HAU: Huazhong Agricultural University)
The traditional drug development process is costly, time consuming and risky. Using computational methods to discover drug repositioning opportunities is a promising and efficient strategy in the era of big data. The explosive growth of large-scale genomic, phenotypic data and all kinds of “omics” data brings opportunities for developing new computational drug repositioning methods based on big data. The paper aims to discuss this issue.,Here, a new computational strategy is proposed for inferring drug–disease associations from rich biomedical resources toward drug repositioning. First, the network embedding (NE) algorithm is adopted to learn the latent feature representation of drugs from multiple biomedical resources. Furthermore, on the basis of the latent vectors of drugs from the NE module, a binary support vector machine classifier is trained to divide unknown drug–disease pairs into positive and negative instances. Finally, this model is validated on a well-established drug–disease association data set with tenfold cross-validation.,This model obtains the performance of an area under the receiver operating characteristic curve of 90.3 percent, which is comparable to those of similar systems. The authors also analyze the performance of the model and validate its effect on predicting the new indications of old drugs.,This study shows that the authors’ method is predictive, identifying novel drug–disease interactions for drug discovery. The new feature learning methods also positively contribute to the heterogeneous data integration.
  • References (38)
  • Citations (1)
📖 Papers frequently viewed together
2017BIOINFORMATICS: International Conference on Bioinformatics
2 Authors (Huiyuan Chen, Jing Li)
7 Citations
4 Authors (Xinxing Yang, ..., Jieyue He)
37 Citations
78% of Scinapse members use related papers. After signing in, all features are FREE.
#1Peng Cui (THU: Tsinghua University)H-Index: 27
#2Xiao Wang (THU: Tsinghua University)H-Index: 32
Last. Wenwu Zhu (THU: Tsinghua University)H-Index: 46
view all 4 authors...
Network embedding assigns nodes in a network to low-dimensional representations and effectively preserves the network structure. Recently, a significant amount of progresses have been made toward this emerging network analysis paradigm. In this survey, we focus on categorizing and then reviewing the current development on network embedding methods, and point out its future research directions. We first summarize the motivation of network embedding. We discuss the classical graph embedding algori...
219 CitationsSource
#1Sanja Krakan (University of Zagreb)H-Index: 1
#2Luka Humski (University of Zagreb)H-Index: 4
Last. Zoran Skočir (University of Zagreb)H-Index: 8
view all 3 authors...
Online social networks (OSN) are one of the most popular forms of modern communication and among the best known is Facebook. Information about the connection between users on the OSN is often very scarce. It is only known if users are connected, while the intensity of the connection is unknown. The aim of the research described was to determine and quantify friendship intensity between OSN users based on analysis of their interaction. We built a mathematical model, which uses: supervised machine...
3 CitationsSource
#1Wen Zhang (WHU: Wuhan University)H-Index: 17
#2Xiang Yue (WHU: Wuhan University)H-Index: 6
Last. Feng Liu (WHU: Wuhan University)H-Index: 7
view all 7 authors...
Drug-disease associations provide important information for the drug discovery. Wet experiments that identify drug-disease associations are time-consuming and expensive. However, many drug-disease associations are still unobserved or unknown. The development of computational methods for predicting unobserved drug-disease associations is an important and urgent task. In this paper, we proposed a similarity constrained matrix factorization method for the drug-disease association prediction (SCMFDD...
21 CitationsSource
#1Anil Belur Nagaraj (Case Western Reserve University)H-Index: 6
#2Quan Qiu WangH-Index: 5
Last. Analisa DiFeo (Case Western Reserve University)H-Index: 25
view all 13 authors...
Using a novel computational drug-repositioning approach (DrugPredict) to rapidly identify potent drug candidates for cancer treatment
16 CitationsSource
espanolEn este articulo de cierre se desarrollan los temas centrales que han sido elaborados en los 10 trabajos que componen el numero especial sobre Investigacion Orientada por la Practica (Practice Oriented Research, POR, siglas en ingles). En este sentido, se exponen estos temas en relacion con el panorama actual del campo de investigacion en psicoterapia, planteando los desafios fundamentales que deben atravesarse para favorecer una mayor articulacion entre clinicos e investigadores. Para el...
4 CitationsSource
Aug 13, 2016 in KDD (Knowledge Discovery and Data Mining)
#1Aditya Grover (Stanford University)H-Index: 8
#2Jure Leskovec (Stanford University)H-Index: 86
Prediction tasks over nodes and edges in networks require careful effort in engineering features used by learning algorithms. Recent research in the broader field of representation learning has led to significant progress in automating prediction by learning the features themselves. However, present feature learning approaches are not expressive enough to capture the diversity of connectivity patterns observed in networks. Here we propose node2vec, an algorithmic framework for learning continuou...
2,412 CitationsSource
#1Michael Kuhn (MPG: Max Planck Society)H-Index: 29
#2Ivica LetunicH-Index: 48
Last. Peer Bork (Molecular Medicine Partnership Unit)H-Index: 178
view all 4 authors...
Unwanted side effects of drugs are a burden on patients and a severe impediment in the development of new drugs. At the same time, adverse drug reactions (ADRs) recorded during clinical trials are an important source of human phenotypic data. It is therefore essential to combine data on drugs, targets and side effects into a more complete picture of the therapeutic mechanism of actions of drugs and the ways in which they cause adverse reactions. To this end, we have created the SIDER (‘Side Effe...
244 CitationsSource
2 CitationsSource
#1Robert Hoehndorf (KAUST: King Abdullah University of Science and Technology)H-Index: 27
#2Paul N. SchofieldH-Index: 41
Last. Georgios V. GkoutosH-Index: 27
view all 3 authors...
Phenotypes are the observable characteristics of an organism arising from its response to the environment. Phenotypes associated with engineered and natural genetic variation are widely recorded using phenotype ontologies in model organisms, as are signs and symptoms of human Mendelian diseases in databases such as OMIM and Orphanet. Exploiting these resources, several computational methods have been developed for integration and analysis of phenotype data to identify the genetic etiology of dis...
39 CitationsSource
May 18, 2015 in WWW (The Web Conference)
#1Jian Tang (Microsoft)H-Index: 15
#2Meng Qu (PKU: Peking University)H-Index: 8
Last. Qiaozhu Mei (UM: University of Michigan)H-Index: 38
view all 6 authors...
This paper studies the problem of embedding very large information networks into low-dimensional vector spaces, which is useful in many tasks such as visualization, node classification, and link prediction. Most existing graph embedding methods do not scale for real world information networks which usually contain millions of nodes. In this paper, we propose a novel network embedding method called the ``LINE,'' which is suitable for arbitrary types of information networks: undirected, directed, ...
1,528 CitationsSource
Cited By1
#1Zhen-Hao Guo (CAS: Chinese Academy of Sciences)H-Index: 1
#2Zhu-Hong You (CAS: Chinese Academy of Sciences)H-Index: 31
Last. Zhan-Heng Chen (CAS: Chinese Academy of Sciences)H-Index: 3
view all 6 authors...
BACKGROUND The explosive growth of genomic, chemical, and pathological data provides new opportunities and challenges for humans to thoroughly understand life activities in cells. However, there exist few computational models that aggregate various bioentities to comprehensively reveal the physical and functional landscape of biological systems. RESULTS We constructed a molecular association network, which contains 18 edges (relationships) between 8 nodes (bioentities). Based on this, we propose...