Mining website log to improve its findability

Jiann Cherng Shieh*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution


Under the network environments with large amounts of digitalized data, websites are the information strongholds that institutions, organizations or enterprises must set up for their specific purposes. No matter how they have been built, websites should offer the capability that users can find their required information quickly and intuitively. Surfing around the library websites, the website logs always keep tracks of users' factual behaviors of finding their required information. Thus we can apply data mining techniques possibly to explore users' information seeking behavior. Based on these evidences, we attempt to reconstruct the websites to promote their internal findability. In this paper, we proposed a heuristic algorithm to clean the website log data, to extract user sub-sessions according to their respective the critical time of session navigation, and to calculate each sub-session's the threshold time of target page with different weights to determine its navigating parent page. We utilized the alternate parent pages of weights to reconstruct various websites. We conduct task-oriented experiments of 4 tasks and 25 participants to measure the effects of their findability respectively. By the analysis of variance on time to complete the tasks, the result has shown that the reconstructed website has better findability performance.

Original languageEnglish
Title of host publicationNetworked Digital Technologies - Second International Conference, NDT 2010, Proceedings
Number of pages9
EditionPART 2
Publication statusPublished - 2010
Event2nd International Conference on 'Networked Digital Technologies', NDT 2010 - Prague, Czech Republic
Duration: 2010 Jul 72010 Jul 9

Publication series

NameCommunications in Computer and Information Science
NumberPART 2
Volume88 CCIS
ISSN (Print)1865-0929


Other2nd International Conference on 'Networked Digital Technologies', NDT 2010
Country/TerritoryCzech Republic


  • Usability
  • findability
  • web log mining

ASJC Scopus subject areas

  • General Computer Science
  • General Mathematics


Dive into the research topics of 'Mining website log to improve its findability'. Together they form a unique fingerprint.

Cite this