TY - JOUR
T1 - Unraveling the evolutionary patterns and phylogenomics of coronaviruses
T2 - A consensus network approach
AU - Hu, Geng Ming
AU - Tai, Yu Chen
AU - Chen, Chi Ming
N1 - Publisher Copyright:
© 2023 Wiley Periodicals LLC.
PY - 2023/11
Y1 - 2023/11
N2 - The COVID-19 pandemic emphasizes the significance of studying coronaviruses (CoVs). This study investigates the evolutionary patterns of 350 CoVs using four structural proteins (S, E, M, and N) and introduces a consensus methodology to construct a comprehensive phylogenomic network. Our clustering of CoVs into 4 genera is consistent with the current CoV classification. Additionally, we calculate network centrality measures to identify CoV strains with significant average weighted degree and betweenness centrality values, with a specific focus on RaTG13 in the beta genus and NGA/A116E7/2006 in the gamma genus. We compare the phylogenetics of CoVs using our distance-based approach and the character-based model with IQ-TREE. Both methods yield largely consistent outcomes, indicating the reliability of our consensus approach. However, it is worth mentioning that our consensus method achieves an approximate 5000-fold increase in speed compared to IQ-TREE when analyzing the data set of 350 CoVs. This improved efficiency enhances the feasibility of conducting large-scale phylogenomic studies on CoVs.
AB - The COVID-19 pandemic emphasizes the significance of studying coronaviruses (CoVs). This study investigates the evolutionary patterns of 350 CoVs using four structural proteins (S, E, M, and N) and introduces a consensus methodology to construct a comprehensive phylogenomic network. Our clustering of CoVs into 4 genera is consistent with the current CoV classification. Additionally, we calculate network centrality measures to identify CoV strains with significant average weighted degree and betweenness centrality values, with a specific focus on RaTG13 in the beta genus and NGA/A116E7/2006 in the gamma genus. We compare the phylogenetics of CoVs using our distance-based approach and the character-based model with IQ-TREE. Both methods yield largely consistent outcomes, indicating the reliability of our consensus approach. However, it is worth mentioning that our consensus method achieves an approximate 5000-fold increase in speed compared to IQ-TREE when analyzing the data set of 350 CoVs. This improved efficiency enhances the feasibility of conducting large-scale phylogenomic studies on CoVs.
KW - consensus clustering
KW - coronavirus evolution
KW - phylogenomic networks
UR - http://www.scopus.com/inward/record.url?scp=85177854380&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85177854380&partnerID=8YFLogxK
U2 - 10.1002/jmv.29233
DO - 10.1002/jmv.29233
M3 - Article
C2 - 38009694
AN - SCOPUS:85177854380
SN - 0146-6615
VL - 95
JO - Journal of Medical Virology
JF - Journal of Medical Virology
IS - 11
M1 - e29233
ER -