TY - JOUR
T1 - Mapping cancer heterogeneity
T2 - a consensus network approach to subtypes and pathways
AU - Hu, Geng Ming
AU - Chen, Hsin Wei
AU - Chen, Chi Ming
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/9/1
Y1 - 2025/9/1
N2 - We introduce consensus MSClustering, an unsupervised hierarchical network approach that integrates multi-omics data to identify molecular subtypes and conserved pathways across diverse cancers. Using a novel heterogeneity index, we selected 167 key genes with functionally coherent roles validated through Gene Ontology analysis. Applied to 2439 tumors spanning 10 cancer types—and successfully extended to 2675 tumors (12 types) including cases with incomplete molecular data—MSClustering demonstrated: (i) precise classification of major cancer types and breast cancer molecular subtypes; (ii) discovery of novel pan-cancer squamous metaplastic signatures; (iii) exceptional prognostic stratification (log-rank P= 2.3 × 10−46); and (iv) superior performance over existing methods (COCA/SNF) in classification accuracy, cluster robustness, and computational efficiency. The method’s multi-scale architecture uniquely resolves breast cancer heterogeneity across biological resolution levels. Pathway analysis further revealed four key oncogenic programs—proteoglycan signaling, chromosomal stability, VEGF-mediated angiogenesis, and drug metabolism—along with disruptions in immune and digestive system functions. This integrative framework marks a significant advancement in cancer genomics by enabling more refined molecular classification, enhanced prognostic insights, and deeper understanding of disease mechanisms. These results highlight the potential of MSClustering to inform the development of clinically relevant biomarkers and support more personalized strategies in precision oncology.
AB - We introduce consensus MSClustering, an unsupervised hierarchical network approach that integrates multi-omics data to identify molecular subtypes and conserved pathways across diverse cancers. Using a novel heterogeneity index, we selected 167 key genes with functionally coherent roles validated through Gene Ontology analysis. Applied to 2439 tumors spanning 10 cancer types—and successfully extended to 2675 tumors (12 types) including cases with incomplete molecular data—MSClustering demonstrated: (i) precise classification of major cancer types and breast cancer molecular subtypes; (ii) discovery of novel pan-cancer squamous metaplastic signatures; (iii) exceptional prognostic stratification (log-rank P= 2.3 × 10−46); and (iv) superior performance over existing methods (COCA/SNF) in classification accuracy, cluster robustness, and computational efficiency. The method’s multi-scale architecture uniquely resolves breast cancer heterogeneity across biological resolution levels. Pathway analysis further revealed four key oncogenic programs—proteoglycan signaling, chromosomal stability, VEGF-mediated angiogenesis, and drug metabolism—along with disruptions in immune and digestive system functions. This integrative framework marks a significant advancement in cancer genomics by enabling more refined molecular classification, enhanced prognostic insights, and deeper understanding of disease mechanisms. These results highlight the potential of MSClustering to inform the development of clinically relevant biomarkers and support more personalized strategies in precision oncology.
KW - consensus clustering
KW - cross-platform analysis
KW - enriched pathway network
KW - heterogeneity
KW - key genes
KW - unsupervised learning
UR - https://www.scopus.com/pages/publications/105015221133
UR - https://www.scopus.com/pages/publications/105015221133#tab=citedBy
U2 - 10.1093/bib/bbaf452
DO - 10.1093/bib/bbaf452
M3 - Article
C2 - 40905375
AN - SCOPUS:105015221133
SN - 1467-5463
VL - 26
JO - Briefings in Bioinformatics
JF - Briefings in Bioinformatics
IS - 5
M1 - bbaf452
ER -