- (Chen & Candan, 2014) ⇒ Xilun Chen, and K. Selcuk Candan. (2014). “LWI-SVD: Low-rank, Windowed, Incremental Singular Value Decompositions on Time-evolving Data Sets.” In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2014) Journal. ISBN:978-1-4503-2956-9 doi:10.1145/2623330.2623671
Singular Value Decomposition (SVD) is computationally costly and therefore a naive implementation does not scale to the needs of scenarios where data evolves continuously. While there are various on-line analysis and incremental decomposition techniques, these may not accurately represent the data or may be slow for the needs of many applications. To address these challenges, in this paper, we propose a Low-rank, Windowed, Incremental SVD (LWI-SVD) algorithm, which (a) leverages efficient and accurate low-rank approximations to speed up incremental SVD updates and (b) uses a window-based approach to aggregate multiple incoming updates (insertions or deletions of rows and columns) and, thus, reduces on - line processing costs. We also present an LWI-SVD with restarts (LWI2-SVD) algorithm which leverages a novel highly efficient partial reconstruction based change detection scheme to support timely refreshing of the decomposition with significant changes in the data and prevent accumulation of errors over time. Experiment results, including comparisons to other state of the art techniques on different data sets and under different parameter settings, confirm that LWI-SVD and LWI2-SVD are both efficient and accurate in maintaining decompositions.
|2014 LWISVDLowRankWindowedIncrementa||Xilun Chen|
K. Selcuk Candan
|LWI-SVD: Low-rank, Windowed, Incremental Singular Value Decompositions on Time-evolving Data Sets||10.1145/2623330.2623671||2014|
|Author||Xilun Chen + and K. Selcuk Candan +|
|proceedings||Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining +|
|title||LWI-SVD: Low-rank, Windowed, Incremental Singular Value Decompositions on Time-evolving Data Sets +|