TY - JOUR
T1 - Development and application of databases for institutional research and analysis
AU - Tseng, Yuen Hsien
PY - 2016/3/30
Y1 - 2016/3/30
N2 - This article elaborates on the possible best practice of developing databases for institutional research and analysis, based on the knowledge of Educational Science, Library Science, and Information Engineering, years of experience in developing educational databases, and a recent survey of related technology and products. Several developing options are compared to show their benefits and disadvantages under different conditions. Three representative analysis tasks are reported to verify and show the synergy of the mentioned ideas and experience. In particular, this article proposes a sustainable workflow: (1) data collection and aggregation, (2) cataloguing, (3) regulation, (4) archiving, and (5) usage, and describes their must-known caveats. The application situations of data normalization and de-normalization are described. Capability of domestic vendors of related products is briefly mentioned based on a proof-ofconcept testing. And finally, real-world institutional analyses are conducted to share our experience. Overall, the first four processes in the above workflow are most timeconsuming and costly. Once data have been well prepared, recent visualization analysis tools allow users to easily discover meaningful patterns and inspire hypotheses, and allow them to explore the database to find evidence to support their hypotheses and decisions. In the future, we expect that event evolution simulation techniques, which allow users to foresee the results given various input scenarios, could play an important role in educational data analysis, in addition to the maturing data visualization tools.
AB - This article elaborates on the possible best practice of developing databases for institutional research and analysis, based on the knowledge of Educational Science, Library Science, and Information Engineering, years of experience in developing educational databases, and a recent survey of related technology and products. Several developing options are compared to show their benefits and disadvantages under different conditions. Three representative analysis tasks are reported to verify and show the synergy of the mentioned ideas and experience. In particular, this article proposes a sustainable workflow: (1) data collection and aggregation, (2) cataloguing, (3) regulation, (4) archiving, and (5) usage, and describes their must-known caveats. The application situations of data normalization and de-normalization are described. Capability of domestic vendors of related products is briefly mentioned based on a proof-ofconcept testing. And finally, real-world institutional analyses are conducted to share our experience. Overall, the first four processes in the above workflow are most timeconsuming and costly. Once data have been well prepared, recent visualization analysis tools allow users to easily discover meaningful patterns and inspire hypotheses, and allow them to explore the database to find evidence to support their hypotheses and decisions. In the future, we expect that event evolution simulation techniques, which allow users to foresee the results given various input scenarios, could play an important role in educational data analysis, in addition to the maturing data visualization tools.
KW - Data consistency
KW - Data literacy
KW - Data normalization
KW - Data warehouse
KW - Visualization analysis
UR - http://www.scopus.com/inward/record.url?scp=85002582835&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85002582835&partnerID=8YFLogxK
U2 - 10.6151/CERQ.2016.2401.04
DO - 10.6151/CERQ.2016.2401.04
M3 - Article
AN - SCOPUS:85002582835
SN - 1814-4810
VL - 24
SP - 107
EP - 134
JO - Contemporary Educational Research Quarterly
JF - Contemporary Educational Research Quarterly
IS - 1
ER -