A pattern restore method for restoring missing patterns in server side clickstream data

I-H. Ting, D. Kudenko, C. Kimble

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

When analyzing patterns in server side data, it becomes quickly apparent that some of the data originating from the client is lost, mainly due to the caching of web pages. Missing data is a very important issue when using server side data to analyze a users browsing behavior, since the quality of the browsing patterns that can be identified depends on the quality of the data. In this paper, we present a series of experiments to demonstrate the extent of the data loss in different browsing environments and illustrate the difference this makes in the resulting browsing patterns when visualized as footstep graphs. We propose an algorithm, called the Pattern Restore Method (PRM), for restoring some of the data that has been lost and evaluate the efficiency and accuracy of this algorithm.
Original languageEnglish
Title of host publicationWeb Technologies Research and Development - APWeb 2005
EditorsY Zhang, K Tanaka, JX Yu, S Wang, M Li
Place of PublicationBerlin / Heidelberg
PublisherSpringer
Pages501-512
Number of pages11
ISBN (Print)978-3-540-25207-8
DOIs
Publication statusPublished - 2005
Event7th Asia-Pacific Web Conference - Shanghai, China
Duration: 29 Mar 20051 Apr 2005

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume3399

Conference

Conference7th Asia-Pacific Web Conference
Country/TerritoryChina
CityShanghai
Period29/03/051/04/05

Cite this