Clustering and visualizing similarity networks of membrane proteins

Geng Ming Hu, Te Lun Mai, Chi Ming Chen*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

7 Citations (Scopus)

Abstract

We proposed a fast and unsupervised clustering method, minimum span clustering (MSC), for analyzing the sequence-structure-function relationship of biological networks, and demonstrated its validity in clustering the sequence/structure similarity networks (SSN) of 682 membrane protein (MP) chains. The MSC clustering of MPs based on their sequence information was found to be consistent with their tertiary structures and functions. For the largest seven clusters predicted by MSC, the consistency in chain function within the same cluster is found to be 100%. From analyzing the edge distribution of SSN for MPs, we found a characteristic threshold distance for the boundary between clusters, over which SSN of MPs could be properly clustered by an unsupervised sparsification of the network distance matrix. The clustering results of MPs from both MSC and the unsupervised sparsification methods are consistent with each other, and have high intracluster similarity and low intercluster similarity in sequence, structure, and function. Our study showed a strong sequence-structure-function relationship of MPs. We discussed evidence of convergent evolution of MPs and suggested applications in finding structural similarities and predicting biological functions of MP chains based on their sequence information.

Original languageEnglish
Pages (from-to)1450-1461
Number of pages12
JournalProteins: Structure, Function and Bioinformatics
Volume83
Issue number8
DOIs
Publication statusPublished - 2015 Aug 1

Keywords

  • Membrane proteins
  • Network clustering
  • Protein function
  • Protein similarity networks
  • Protein structure
  • Sequence homology

ASJC Scopus subject areas

  • Structural Biology
  • Biochemistry
  • Molecular Biology

Fingerprint

Dive into the research topics of 'Clustering and visualizing similarity networks of membrane proteins'. Together they form a unique fingerprint.

Cite this