|
OOMPA
Object-Oriented Microarray and Proteomic Analysis
Installation
Note: As of 1 December 2009, OOMPA has been
upgraded to work with R 2.10.0.
As of 24 July 2009, OOMPA has been upgraded to work with R
2.9.0.
Beginning with Release 2.8, we set up a proper repository of R
code. With Release 2.9, believing in the adage that good
programmers write good code, but great programmers steal great
code, we blithely stole the repository management scripts from
BioConductor and adapted them to
work for OOMPA. So, the simplest way to install the OOMPA packages is
now to fire up your local version of R and use the command:
source("http://bioinformatics.mdanderson.org/OOMPA/oompaLite.R")
oompaLite()
These commands will install the basic OOMPA packages. In order to get
a slightly larger set of packages, you can execute the command
oompainstall()
If you want to get everything (which may include some
experimental packages that are stil being developed, then use the
command
oompainstall(groupName="all")
If you want more control over which packages get installed, execute
the colloing command and select from the resulting list.
install.packages(repos="http://bioinformatics.mdanderson.org/OOMPA/2.9")
Source code and windows binaries are available. If someone wants to volunteer
to put Macintosh binaries together, we will post those as well.
Older Versions
For versions of OOMPA that work on R 2.5.0 through R 2.7.0, go to this
archived web page. For versions of
OOMPA that work with even older versions of R,
see this archive. The OOMPA suite
of R libraries is the successor to the earlier Object-Oriented
Microarray Analysis Library (OOMAL), which was originally written for
S-Plus 2000. The incorporation of routines to analyze proteomics
profiling data in addition to gene expression microarray data prompted
a name change. (It also inspired our icon. It suggests some
possibilities for theme music, but we're pretty sure we don't
want to go there.)
Description
OOMPA is a suite of R libraries for the analysis of gene expression
(RNA) microarray data and of proteomics profiling mass spectrometry
data. OOMPA uses S4 classes to construct object-oriented tools with a
consistent user interface. All higher level analysis tools in OOMPA
work with the expressionSet classes defined in
BioConductor. The lower level processing tools offer an alternative to
parts of BioConductor, but can also be used to enhance the existing
BioConductor packages.
The packages included in the current release (OOMPA 2.9) are
- oompaBase (Manual)
(Vignette)
- Contains definitions that are needed in order for other packages
to load properly. Some definitions (like class unions) must be
visible before loading a package that uses them, and cannot be
defined in the same
package.
- PreProcess (Manual)
(Vignette)
- Basic library for low-level preprocessing of microarray
data. Provides tools for using consistent color schemes in
diagnostic and other plots. Also defines the
Processor
and Pipeline classes used so objects can maintain a
history of how they were produced.
- ClassComparison (Manual)
(Vignette)
- The ClassComparison library provides tools to perform "class comparison"
analyses of microarray or proteomics data. Class comparison problems start with
two or more known groups of samples, and ask the analyst to find genes or
proteins that are different in some way between the two groups. Methods
implemented in this release include
- Two-sample t-test
- Fixed-effects linear models with ANOVA
- Beta-uniform mixture (BUM) model to account for multiple testing by
controlling the false discovery rate (FDR).
- Wilcoxon rank-sum test with empirical Bayes
- Signficance Analysis of Microarrays (SAM)
- Total Number of Misclassification (TNoM)
- Dudoit's adjustment of p-values to control the family-wise error rate (FWER)
- Smooth t-test
- ClassDiscovery (Manual)
(Vignette)
- The ClassDiscovery library provides tools to perform "class discovery"
analyses of microarray or proteomics data. Class discovery methods perform
unsupervised analyses to try to "learn" or "discover" group structure in the
data. Methods implemented in this release include
- Nonparametric bootstrap to test the significance of clusters
- Parametric bootstrap with gaussian noise to test the significance of
clusters
- Principal components analysis of the biological samples
- Mosiac plots (i.e., the red-green two-way hierarchical clustering plots
introduced into the microarray world by Mike Eisen)
- PCANOVA, which provides an "analysis of variance" inspired method thatb uses
principal components to test whether putative group structures are really
present in the data
- New! We now include functions to compute
the bimodality index, a tool for ranking genes by how likely
they are to follow a "useful" bimodal distribution. This method was
introduced by Wang et al. in a manuscript to appear
in
Cancer Informatics.
- TailRank (Manual)
(Vignette)
- The Tail Rank test is a new method we have developed for finding
biomarkers in microarray or proteomics data sets. The method is
essentially non-parametric, focusing on the tails of the
distributions in the two classes being compared. The method allows
analysts to perform realistic sample size and power computations.
- SuperCurve (Manual)
(Vignette)
- SuperCurve is a package we have developed to analyze
reverse-phase protein arrays. The package includes routines to load
raw data files quantified
by MicroVigene,
to fit a four-parameter joint logistic model in order to estimate
protein concentrations, along with methods to assess the quality of the fit.
- SuperCurveGUI (Manual)
- SuperCurveGUI provides a graphical user interface for the
SUperCurvepackage.
- SlideDesignerGUI (Manual)
- SlideDesignerGUI is a graphicl tool to allow researchers to
describe the location and concentration of different positive and
negative controls on a reverse phase protein array..
|