Generic framework for high-dimensional fixed-effects ANOVA

In functional genomics it is more rule than exception that experimental designs are used to generate the data.The samples of the resulting data sets are thus organized according to this design and for each sample many biochemical compounds are measured, e.g. typically thousands of gene-expressions or hundreds of metabolites. This results in high-dimensional data sets with an underlying experimental design. Several methods have recently become available for analyzing such data while utilizing the underlying design.We review these methods by putting them in a unifying and general framework to facilitate understanding the (dis-)similarities between the methods. The biological question dictates which method to use and the framework allows for building newmethods to accommodate a range of such biological questions. The framework is built on well known fixed-effect ANOVA models and subsequent dimension reduction.We present the framework both in matrix algebra as well as in more insightful geometrical terms.We show the workings of the different special cases of our framework with a real-life metabolomics example from nutritional research and a gene-expression example from the field of virology.

Authors: 
A.K. Smilde, M.E. Timmerman, M.M.W.B. Hendriks, J.J. Jansen, H.C. Hoefsloot
DOI: 
10.1093/bib/bbr071
Pages: 
2012; 13, 524-535
Published in: 
Briefings in Bioinformatics
Date of publication: 
September, 2012
Status of the publication: 
Published/accepted