Faculty Publications

Document Type

Article

Publication Version

Published Version

Keywords

data mining and knowledge discovery, machine learning, biological data analysis, biological network, link prediction, relation inference, deep learning

Journal/Book/Conference Title Title

FRONTIERS IN GENETICS

Volume

10

Abstract

Advances in next-generation sequencing and high-throughput techniques have enabled the generation of vast amounts of diverse omics data. These big data provide an unprecedented opportunity in biology, but impose great challenges in data integration, data mining, and knowledge discovery due to the complexity, heterogeneity, dynamics, uncertainty, and high-dimensionality inherited in the omics data. Network has been widely used to represent relations between entities in biological system, such as protein-protein interaction, gene regulation, and brain connectivity (i.e. network construction) as well as to infer novel relations given a reconstructed network (aka link prediction). Particularly, heterogeneous multi-layered network (HMLN) has proven successful in integrating diverse biological data for the representation of the hierarchy of biological system. The HMLN provides unparalleled opportunities but imposes new computational challenges on establishing causal genotype-phenotype associations and understanding environmental impact on organisms. In this review, we focus on the recent advances in developing novel computational methods for the inference of novel biological relations from the HMLN. We first discuss the properties of biological HMLN. Then we survey four categories of state-of-the-art methods (matrix factorization, random walk, knowledge graph, and deep learning). Thirdly, we demonstrate their applications to omics data integration and analysis. Finally, we outline strategies for future directions in the development of new HMLN models.

Department

Department of Computer Science

Comments

First published in Frontiers in Genetics v.10 (January 2020).

Original Publication Date

1-2020

DOI of published version

10.3389/fgene.2019.01381

Repository

UNI ScholarWorks, Rod Library, University of Northern Iowa

Copyright

@2020 Aleksandar Poleksic

Creative Commons License

Creative Commons Attribution 3.0 License
This work is licensed under a Creative Commons Attribution 3.0 License.

Language

English

File Format

application/pdf

Share

COinS