International Journal of Medical Informatics
Volume 76, Supplement 3 , Pages S462-S475, December 2007

Inferring gene regulatory networks by integrating static and dynamic data

  • Fulvia Ferrazzi

      Affiliations

    • Dipartimento di Informatica e Sistemistica, Università degli Studi di Pavia, via Ferrata 1, 27100 Pavia, Italy
  • ,
  • Paolo Magni

      Affiliations

    • Dipartimento di Informatica e Sistemistica, Università degli Studi di Pavia, via Ferrata 1, 27100 Pavia, Italy
  • ,
  • Lucia Sacchi

      Affiliations

    • Dipartimento di Informatica e Sistemistica, Università degli Studi di Pavia, via Ferrata 1, 27100 Pavia, Italy
  • ,
  • Angelo Nuzzo

      Affiliations

    • Dipartimento di Informatica e Sistemistica, Università degli Studi di Pavia, via Ferrata 1, 27100 Pavia, Italy
  • ,
  • Uroš Petrovič

      Affiliations

    • J. Stefan Institute, Ljubljana, Slovenia
  • ,
  • Riccardo Bellazzi

      Affiliations

    • Dipartimento di Informatica e Sistemistica, Università degli Studi di Pavia, via Ferrata 1, 27100 Pavia, Italy
    • Corresponding Author InformationCorresponding author.

Received 5 January 2007; received in revised form 13 June 2007; accepted 26 July 2007. published online 30 January 2009.

Abstract 

Objectives

The purpose of the paper is to propose a methodology for learning gene regulatory networks from DNA microarray data based on the integration of different data and knowledge sources. We applied our method to Saccharomyces cerevisiae experiments, focusing our attention on cell cycle regulatory mechanisms. We exploited data from deletion mutant experiments (static data), gene expression time series (dynamic data) and the knowledge encoded in the Gene Ontology.

Methods

The proposed method is based on four phases. An initial gene network was derived from static data by means of a simple statistical approach. Then, the genes classified in the Gene Ontology as being involved in the cell cycle were selected. As a third step, the network structure was used to initialize a linear dynamic model of gene expression profiles. Finally, a genetic algorithm was applied to update the gene network exploiting data coming from an experiment on the yeast cell cycle.

Results

We compared the network models provided by our approach with those obtained with a fully data-driven approach, by looking at their AIC scores and at the percentage of preserved connections in the best solutions. The results show that several nearly equivalent solutions, in terms of AIC scores, can be found. This problem is greatly mitigated by following our approach, which is able to find more robust models by fixing a portion of the network structure on the basis of prior knowledge. The best network structure was biologically evaluated on a set of 22 known cell cycle genes against independent knowledge sources.

Conclusions

An approach able to integrate several sources of information is needed to infer gene regulatory networks, as a fully data-driven search is in general prone to overfitting and to unidentifiability problems. The learned networks encode hypotheses on regulatory relationships that need to be verified by means of wet-lab experiments.

Keywords: Gene regulatory networks, Machine learning, Data interpretation, Microarray analysis of gene expression, Genetic algorithm

To access this article, please choose from the options below

Login to an existing account or Register a new account.

  • Purchase this article for 31.50 USD (You must login/register to purchase this article)

    Online access for 24 hours. The PDF version can be downloaded as your permanent record.

  • Subscribe to this title

    Get unlimited online access to this article and all other articles in this title 24/7 for one year.

  • Claim access now

    For current subscribers with Society Membership or Account Number.

  • Visit SciVerse ScienceDirect to see if you have access via your institution.
 

PII: S1386-5056(07)00138-4

doi:10.1016/j.ijmedinf.2007.07.005

International Journal of Medical Informatics
Volume 76, Supplement 3 , Pages S462-S475, December 2007