Reproducibility of Chemopredictors from Cell Lines

This web page contains files summarizing the analyses of data described in "Deriving Chemosensitivity from Cell Lines: Forensic Bioinformatics and Reproducible Research in High-Throughput Biology" by Keith A. Baggerly and Kevin R. Coombes. The paper has now appeared in the Annals of Applied Statistics, 2009 3(4):1309-1334. All analyses were performed by Keith A. Baggerly and Kevin R. Coombes.


This page was updated on Sunday, May 9, 2010, to include the paper reference above. Everything below the following horizontal line is frozen, and will not change. This page was updated on Thursday, November 19, 2009, after galleys were marked up. We have made one change since Thursday, September 24, 2009, adding a dashed line to one figure in examiningDoxorubicinInDetail and rebundling the related zip files. Our intention is to avoid changing the files posted here after publication, allowing the web site to serve as permanent documentation of our analysis. Any changes will be posted on a separate page designed for addenda, errata, corrigenda and other related latinisms.


Here is a list of the available material. The supplementary reports are in PDF format. The source code, intermediate results, and public data sources are in zip files, starting from a common directory. The simplest approach is to download "Everything" and unzip the file. For a minimal reconstruction, we download the Source Code (containing Scripts and RNowebSource), Raw Data, and Matlab Files, create folders for Figures, Latex, RDataObjects, and RTangle, and source "Scripts/runAll.R". Since runAll.R is system specific, some modification will be required to point to your local installations of pdflatex, etc. This creates all of the Figures, Reports, RDataObjects, and R scripts (in RTangle) used to produce the results. This takes about 7.5 minutes on my MacBook Pro.

Our analysis source code relies on a number of software programs and auxiliary packages; we provide scripts, not stand-alone executables. Detailed descriptions of the packages (with version numbers) are listed in the methods section of the main manuscript. The pieces of software required to execute the source code can be obtained from the following locations: