From MD Anderson Bioinformatics
Bioinformatics Software and Services
Here is a list of all software and services provided by the Department of Bioinformatics and Computational Biology. They are copyrighted by the University of Texas MD Anderson Cancer Center and by the individual employees of the cancer center who helped develop them. They are freely available for personal use in research projects; however, anyone wishing to use them or modify them for use in a commercial project should contact the Office of Technology Commercialization.
A to G
- A NGS genomic loci mapping refiner, which improves the mapping of the multireads (reads mapped to more than one genomic location with similar fidelities), as a refinement step after the general read-alignment is completed.
- A tool for performing model-based inference for differential gene expression, using a non-parametric Bayesian mixture probability model for the distribution of gene intensities under different conditions.
- Cromwell is a set of MATLAB scripts for low-level processing of mass spectrometry proteomics data. Cromwell relies on the undecimated discrete wavelet transform for denoising spectra.
- DyCE is a server for enabling remote users to access advanced computational modeling and analysis tools and view the results of the analysis directly in their browser.
- A tool for predicting tumor purity, and the presence of infiltrating stromal/immune cells in tumor tissues using gene expression data.
- FamSeq is a computational tool for calculating probability of variants in family-based sequencing data
- FASMIC is a comprehensive database of experimental-evidence-based functional impact of somatic mutations in cancer.
- A tool for exploratory analysis of gene expression microarray data.
- geneSmash is a mash-up of various sources of information about human genes including the NCBI Entrez gene FTP site, UCSC Genome Browser, miRBase and human gene expression array annotation extracted from Manufacturers' websites. Use geneSmash
H to R
- An interactive tool for viewing the location and annotation of genes on a human chromosome.
- TP53 mutation is a main cause of Li-Fraumeni syndrome. This package is designed to estimate the probability that the counselee is a TP53 mutation carrier on the basis of his/her family cancer history. We also included LFS classic and Chompret criteria in the package.
- MCLP provides a comprehensive resource for accessing, visualizing, and analyzing functional proteomics of cancer cell lines.
- An interactive tool for mapping microarray gene expression data onto the GeneOntology directed acyclic graph.
- Zoomable (clustered) heat maps with links to statistical information, databases, and other related analyses. Version 2 NG-CHMs feature a faster, more dynamic interface and a tool for building NG-CHMs within the Galaxy bioinformatics platform.
- OOMPA is an object-oriented microarray and proteomics analysis library implemented in R using S4 classes and compatible with BioConductor.
- An Open-Use Integrated API of Pathways, Genes, Directional Gene Interactions, and the Gene Ontology with Data Versioning for Provenance.
- A tool to quantify Affymetric microarrays. Quantifications are based on the "Position Dependent Nearest Neighbor" model, which makes explicit use of probe sequence information.
- PRADA is a pipeline to analyze paired end RNA-Seq data to generate gene expression values (RPKM) and gene-fusion candidates.
- The Report and Analysis Template Builder (RATb) uses XML and R to create reusable, structured Sweave files for the statistical and bioinformatic analysis of high-throughput 'omics' datasets. Use RAT Builder
- Rocket is a set of Perl scripts and modules used to confirm that the clones sequenced in our Core Laboratory match the annotations provided by the supplier. (More precisely, Rocket provides a graphical interface using Perl/Tk to a command line Perl script.) Rocket will require some customization before you can use it. It assumes the existence of a local database containing the supplier's clone annotations; you'll have to create such a database separately and make sure the names of the fields in the database match those used by Rocket.
S to Z
- A tool to compute the number of samples needed to detect expression changes for a microarray experiment. Use Sample Sizes
- SpliceSeq provides a quick, easy method of investigating alternative mRNA splicing in next generation mRNA sequence data. The tool may be used on a single mRNA-Seq sample to identify genes with multiple spliceforms or on a pair of samples to identify differential splicing between the samples. Sequence reads are mapped to splice graphs that unambiguously quantify the inclusion level of each exon and splice junction. The graphs are then traversed to predict the protein isoforms that are likely to result from the observed exon and splice junction reads. UniProt annotations are mapped to each protein isoform to identify potential functional impacts of alternative splicing.
- SuperCurve is a stand-alone package, bundled with OOMPA, that provides tools for the analysis of reverse phase protein arrays.
- This service provides a bioinformatic web app for identifying network-based biomarkers that most correlate with patient survival data. Use SurvNet
- This service provides a bioinformatic web app for positive selection in proteins using a sliding window substitution rate analysis. Use SWAKK
- Tissue microarrays are increasingly important tools that bring high-throughput technology to traditional pathology laboratories. Inmany cases, each spot on a tissue microarray is scored by a skilled pathologist and recorded manually. TAD consists of an Active ServerPage web interface to a relational SQL database that automatesrecording scores and linking them with clinical data for future interpretation.
- TANRIC provides an open-access resource for interactive exploration of lncRNAs in cancer.
- TCPA provides a comprehensive resource for accessing, visualizing, and analyzing cancer proteomics.
- The Cancer Genomic Atlas (TCGA) project studies different types of cancer by obtaining datasets from different tissue source sites, using different sequencing centers and technologies. Data sets can sometimes be biased depending on the batch from which it came. The TCGA Batch Effects Tool provides pre-computed graphical annotations of different TCGA data sets that allows users to screen for batch biases in the dataset. For information on the first version of the tool please click here.
- TCGA SpliceSeq presents an analysis of alternative mRNA splicing in thousands of TCGA cancer samples across 33 different tumor types. Percent splice in values for all possible splice events of all genes can be queried, graphically displayed, and downloaded.
- Code to obtain MCMC samples for wavelet-based functional mixed model method in Morris and Carroll (2006). Obtains posterior samples of model parameters in functional mixed model. Original SiteDownload Package