TY - GEN
T1 - Mining website log to improve its findability
AU - Shieh, Jiann Cherng
PY - 2010
Y1 - 2010
N2 - Under the network environments with large amounts of digitalized data, websites are the information strongholds that institutions, organizations or enterprises must set up for their specific purposes. No matter how they have been built, websites should offer the capability that users can find their required information quickly and intuitively. Surfing around the library websites, the website logs always keep tracks of users' factual behaviors of finding their required information. Thus we can apply data mining techniques possibly to explore users' information seeking behavior. Based on these evidences, we attempt to reconstruct the websites to promote their internal findability. In this paper, we proposed a heuristic algorithm to clean the website log data, to extract user sub-sessions according to their respective the critical time of session navigation, and to calculate each sub-session's the threshold time of target page with different weights to determine its navigating parent page. We utilized the alternate parent pages of weights to reconstruct various websites. We conduct task-oriented experiments of 4 tasks and 25 participants to measure the effects of their findability respectively. By the analysis of variance on time to complete the tasks, the result has shown that the reconstructed website has better findability performance.
AB - Under the network environments with large amounts of digitalized data, websites are the information strongholds that institutions, organizations or enterprises must set up for their specific purposes. No matter how they have been built, websites should offer the capability that users can find their required information quickly and intuitively. Surfing around the library websites, the website logs always keep tracks of users' factual behaviors of finding their required information. Thus we can apply data mining techniques possibly to explore users' information seeking behavior. Based on these evidences, we attempt to reconstruct the websites to promote their internal findability. In this paper, we proposed a heuristic algorithm to clean the website log data, to extract user sub-sessions according to their respective the critical time of session navigation, and to calculate each sub-session's the threshold time of target page with different weights to determine its navigating parent page. We utilized the alternate parent pages of weights to reconstruct various websites. We conduct task-oriented experiments of 4 tasks and 25 participants to measure the effects of their findability respectively. By the analysis of variance on time to complete the tasks, the result has shown that the reconstructed website has better findability performance.
KW - Usability
KW - findability
KW - web log mining
UR - http://www.scopus.com/inward/record.url?scp=77956123590&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77956123590&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-14306-9_24
DO - 10.1007/978-3-642-14306-9_24
M3 - Conference contribution
AN - SCOPUS:77956123590
SN - 3642143059
SN - 9783642143052
T3 - Communications in Computer and Information Science
SP - 239
EP - 247
BT - Networked Digital Technologies - Second International Conference, NDT 2010, Proceedings
T2 - 2nd International Conference on 'Networked Digital Technologies', NDT 2010
Y2 - 7 July 2010 through 9 July 2010
ER -