Legal notice: Unauthorized access to the network is prohibited. This system is for the use of authorized users only. Individuals using this computer system without authority, or in excess of their authority, are subject to having all of their activities on this system monitored and recorded by system personnel. In the course of monitoring individuals improperly using this system, or in the course of system maintenance, the activities of authorized users may also be monitored. Anyone using this system expressly consents to such monitoring and is advised that if such monitoring reveals possible evidence of criminal activity, system personnel may provide the evidence of such monitoring to law enforcement officials. (Auto-displayed notice will close after 3 seconds or click this banner to hide.)
The Files Category lists the available types of files found within each dataset archive.
The Source Category lists the source of the data, such as the GDC or PanCan Study Group.
The Derivations Category lists the derivation of data within a Source, such as, for the GDC, "current" or "legacy" data.
The Archive Type Category lists the variation of data in the dataset--for the GDC, this is "standardized". Other datasets may provide other Archive Type Categories.
The Algorithms Category divides the data into "continuous", amenable to most standardize statistical processing, and "discrete", generally sparse matrices and not amenable to many statistical methods.
The Versions Category are the timestamps for when the data was acquired by the Query Form. This Category works different from the rest. By default, the Query Form will show the newest version of each dataset. Selecting one or more Versions, limits the results to that particular version. Note that in Standardized Data, each Version may only contain a single dataset.
The Projects Category lists the higher-level project, like TCGA or TARGET, for the dataset.
The Sub-Projects Category lists what is generally the disease (cancer type) being processed. Some Projects do not divide data by disease, hence the more generic name for this Category.
The Data Type Category divides the datasets into general type of data. Currently, some Data Types can be overly specific (such as for different mutation data) and some overly general or redundant (such as "Copy Number Segment" and "Copy number variation").
The Details Category allows filtering on detailed options for datasets, in particular the Methylation data option to include (wXY) or exclude (noXY) sex chromosomes.
The Platforms Category lists the available platforms. Currently, some may be redundant and misleading, such as the Legacy GDC data having "Illumina Human Methylation 27" and "Illumina Human Methylation 450" compared to the Current GDC data using "Liftover".
*DSC P-Values are not corrected for multiple testing.
The DSC P-Value Category lists common p-value cut-offs.
The Min DSC Value indicates the DSC of the results should be less than or equal to this value. Empty means accept any value.
The Max DSC Value indicates the DSC of the results should be less than or equal to this value. Empty means accept any value.