TY - GEN
T1 - Conditional Relationship Extraction for Diseases and Symptoms by a Web Search-Based Approach
AU - Lee, Yi Hui
AU - Koh, Jia Ling
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2019/1/10
Y1 - 2019/1/10
N2 - This paper studies the strategies of automatically extracting the conditional relationships between diseases and symptoms from a Chinese encyclopedia site and the disease-related web pages searched from the Internet. At first, the seed symptoms of a disease are extracted from an online medical encyclopedia automatically. These seed symptoms are utilized as query keywords to automatically find more symptoms of a disease from the unstructured documents of the disease-related search results. Next, a jointly learning method is used to construct the embedded representations of the conditional terms and pattern terms. Besides, the semantic similarity matrix of conditional terms is computed through the co-clustering algorithm to discover the representative conditional terms of the clusters. The result of experiments shows that the proposed method, which discovers the semantically related symptoms of a disease associated with conditionals, achieves high accuracy. Besides, many unusually known symptoms considered by the medical experts are discovered, which may be noticeable symptoms needing further verification in the future.
AB - This paper studies the strategies of automatically extracting the conditional relationships between diseases and symptoms from a Chinese encyclopedia site and the disease-related web pages searched from the Internet. At first, the seed symptoms of a disease are extracted from an online medical encyclopedia automatically. These seed symptoms are utilized as query keywords to automatically find more symptoms of a disease from the unstructured documents of the disease-related search results. Next, a jointly learning method is used to construct the embedded representations of the conditional terms and pattern terms. Besides, the semantic similarity matrix of conditional terms is computed through the co-clustering algorithm to discover the representative conditional terms of the clusters. The result of experiments shows that the proposed method, which discovers the semantically related symptoms of a disease associated with conditionals, achieves high accuracy. Besides, many unusually known symptoms considered by the medical experts are discovered, which may be noticeable symptoms needing further verification in the future.
KW - Information Extraction
KW - Semantic Networks
KW - Text Mining
UR - http://www.scopus.com/inward/record.url?scp=85061919698&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85061919698&partnerID=8YFLogxK
U2 - 10.1109/WI.2018.00-38
DO - 10.1109/WI.2018.00-38
M3 - Conference contribution
AN - SCOPUS:85061919698
T3 - Proceedings - 2018 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2018
SP - 554
EP - 561
BT - Proceedings - 2018 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 18th IEEE/WIC/ACM International Conference on Web Intelligence, WI 2018
Y2 - 3 December 2018 through 6 December 2018
ER -