Publication Data
STAGGER: Periodicity Mining of Data Streams Using Expanding Sliding Windows
Abstract: Sensor devices are becoming ubiquitous, especially in
measurement and monitoring applications. Because of the real-time, append-only and
semi-infinite natures of the generated sensor data streams, an online incremental
approach is a necessity for mining stream data types. In this paper, we propose
STAGGER: a one-pass, online and incremental algorithm for mining periodic patterns in
data streams. STAGGER does not require that the user pre-specify the periodicity rate
of the data. Instead, STAGGER discovers the potential periodicity rates. STAGGER
maintains multiple expanding sliding windows staggered over the stream, where
computations are shared among the multiple overlapping windows. Small-length sliding
windows are imperative for early and real-time output, yet are limited to discover
short periodicity rates. As streamed data arrives continuously, the sliding windows
expand in length in order to cover the whole stream. Larger-length sliding windows are
able to discover longer periodicity rates. STAGGER incrementally maintains a tree-like
data structure for the frequent periodic patterns of each discovered potential
periodicity rate. In contrast to the Fourier/Wavelet-based approaches used for
discovering periodicity rates, STAGGER not only discovers a wider, more accurate set of
periodicities, but also discovers the periodic patterns themselves. In fact,
experimental results with real and synthetic data sets show that STAGGER outperforms
Fourier/Wavelet-based approaches by an order of magnitude in terms of the accuracy of
the discovered periodicity rates. Moreover, realdata experiments demonstrate the
practicality of the discovered periodic patterns.
