Motivated by a long-term goal in education for measuring Taiwanese civic scientific literacy in media (SLiM), this work reports the detailed techniques to efficiently mine a concept map from 2 years of Chinese news articles (901,446 in total) for SLiM instrument development. From the Chinese news stories, key terms (important words or phrases), known or new to existing lexicons, were first extracted by a simple, yet effective, rule-based algorithm. They were subjected to an association analysis based on their co-occurrence in sentences to reveal their term-to-term relationship. A given list of 3657 index terms from science textbooks were then matched against the term association network. The resulting term network (including 95 scientific terms) was visualized in a concept map to scaffold the instrument developers. When developing an item, the linked term pair not only suggests the topic for the item due to the clear context being mutually reinforced by each other, but also the content itself because of the rich background provided by the recurrent snippets in which they co-occur. In this way, the resulting instrument (comprised of 50 items) reflect the scientific knowledge revealed in the daily news stories, meeting the goal for measuring civic scientific literacy in media. In addition, the concept map mined from the texts served as a convenient tool for item classification, developer collaboration, and expert review and discussion.
ASJC Scopus subject areas
- Computer Science(all)