Department of Bioinformatics and Computational Biology

Home > Public Software > NG-CHM > Clustered Heatmaps

Clustered Heat Maps (CHMs)

CHMs: An Overview

Clustered heat maps are powerful visualization tools that combine two primary techniques, heat mapping and hierarchical clustering, to reveal patterns and relationships in complex datasets that may not be immediately apparent through other forms of analysis. Widely used in various scientific fields, especially in biology and medicine, clustered heat maps provide an intuitive way to analyze high-dimensional data, identify meaningful patterns, and generate hypotheses for further research. When used appropriately, they can significantly advance our understanding of biological systems and inform the development of targeted therapies and personalized medicine approaches.


Understanding Clustered Heat Maps

A clustered heat map is a two-dimensional representation of data where individual values contained in a matrix are represented as colors. The key aspect that differentiates clustered heat maps from simple heat maps is the integration of hierarchical clustering. This method groups similar rows and columns of the matrix together based on a chosen similarity measure, such as Euclidean distance or Pearson correlation. The resulting clusters are represented as dendrograms (tree-like structures) adjacent to the rows and columns of the heat map, providing a visual summary of the relationships within the data.


Clustered Heat Map Example


Construction of Clustered Heat Maps

The construction of clustered heat maps involves several steps:


Interpretation and Limitations

Interpretation

Interpreting clustered heat maps requires understanding both the data and the clustering process. Clusters identified in a heat map do not imply causation or biological relevance; they represent patterns of similarity. These patterns must be validated with additional statistical methods or experimental validation.

Limitations


History of Clustered Heat Maps in Biological Research

The use of clustered heat maps in biological research has a rich history, rooted in the need to visualize complex and high-dimensional data generated by modern molecular biology techniques. Key milestones in their development are discussed in the paper The History of the Cluster Heat Map.

Clustered heat maps have been extensively utilized in biological and medical research to visualize complex, high-dimensional data, particularly in genomics, metabolomics, and proteomics. Here are some notable examples where clustered heat maps were crucial for research:

Gene Expression Studies

Clustered heat maps have been instrumental in studies involving genome-wide association and gene expression profiling. For example, in research exploring gene expression patterns across different cancer types, such as breast cancer or colorectal cancer, clustered heat maps helped identify gene clusters that are co-expressed or have similar expression patterns across samples. This clustering enables researchers to detect cancer subtypes and understand the underlying biological processes driving cancer progression. These studies often rely on heat maps to reveal patterns that might suggest potential biomarkers or therapeutic targets.

Metabolomics and Proteomics

In metabolomics, clustered heat maps have been used to visualize the relative abundance of metabolites across different conditions or samples. For example, in a study involving cerebrospinal fluid metabolomics, clustered heat maps allowed researchers to identify metabolic patterns associated with neurological diseases. By clustering metabolite data, the researchers could distinguish between healthy controls and disease states, providing insights into potential diagnostic biomarkers or metabolic pathways involved in disease progression.

Microbial Community Studies

Clustered heat maps have also been used to explore relationships between environmental variables and microbial communities. In microbiome research, they help visualize the abundance of various microbial taxa across different environmental conditions or host states. This visualization can uncover patterns of microbial co-occurrence or exclusion, suggesting ecological interactions that could be critical for understanding health and disease states.

Clinical Research and Patient Stratification

In clinical research, especially in oncology, clustered heat maps are valuable for patient stratification based on molecular profiles. For instance, studies using The Cancer Genome Atlas (TCGA) data have employed clustered heat maps to classify patients into subgroups with distinct molecular signatures. This stratification can inform personalized treatment strategies, tailoring therapies based on the molecular characteristics of each patient’s tumor.

Functional Genomics

In functional genomics, clustered heat maps are used to explore gene function and regulation. For instance, researchers might cluster genes based on their expression profiles across multiple conditions (such as different developmental stages or stress responses) to identify co-regulated genes and infer their roles in specific biological pathways.


These examples illustrate the versatility and importance of clustered heat maps in biological and medical research. They provide a powerful tool for visualizing complex datasets, identifying patterns, and generating hypotheses about underlying biological mechanisms.


Common R and Python Software for Creating Clustered Heat Maps

R and Python are common programming languages used in bioinformatics. Many applications exist for creating static clustered heatmaps in these languages. Some commonly used software is listed below.


R Packages:


Python Libraries:


Advantages of Interactive Heat Maps like NG-CHMs Over Static Heat Maps

Next-Generation Clustered Heat Maps (NG-CHMs), developed by MD Anderson Cancer Center, provide significant advancements over traditional static heat maps. NG-CHMs offer a highly interactive and dynamic environment for data exploration, making them superior for analyzing complex datasets.


Key Features of NG-CHMs:


How NG-CHMs Improve Over Static Heat Maps: