Abstract
SimRank is an arresting measure of node-pair similarity based on hyperlinks. It iteratively follows the concept that 2 nodes are similar if they are referenced by similar nodes. Real graphs are often large, and links constantly evolve with small changes over time. This paper considers fast incremental computations of SimRank on link-evolving graphs. The prior approach [12] to this issue factorizes the graph via a singular value decomposition (SVD) first, and then incrementally maintains this factorization for link updates at the expense of exactness. Consequently, all node-pair similarities are estimated in O(r4n2) time on a graph of n nodes, where r is the target rank of the low-rank approximation, which is not negligibly small in practice. In this paper, we propose a novel fast incremental paradigm. (1) We characterize the SimRank update matrix ΔS, in response to every link update, via a rank-one Sylvester matrix equation. By virtue of this, we devise a fast incremental algorithm computing similarities of n2 node-pairs in O(Kn2) time for K iterations. (2) We also propose an effective pruning technique capturing the “affected areas” of ΔS to skip unnecessary computations, without loss of exactness. This can further accelerate the incremental SimRank computation to O(K(nd+|AFF|)) time, where d is the average in-degree of the old graph, and |AFF| (≤ n2) is the size of “affected areas” in ΔS, and in practice, |AFF| ≪ n2. Our empirical evaluations verify that our algorithm (a) outperforms the best known link-update algorithm [12], and (b) runs much faster than its batch counterpart when link updates are small.
Original language | English |
---|---|
Title of host publication | 30th IEEE International Conference on Data Engineering (IEEE ICDE '14) |
Publisher | IEEE |
Pages | 304-315 |
Number of pages | 12 |
ISBN (Electronic) | 978-1-4799-2555-1 |
DOIs | |
Publication status | Published - 19 May 2014 |
Event | 2014 IEEE 30th International Conference on Data Engineering - Chicago, United States Duration: 31 Mar 2014 → 4 Apr 2014 |
Publication series
Name | 2014 IEEE 30th International Conference on Data Engineering |
---|---|
Publisher | IEEE |
ISSN (Print) | 1063-6382 |
ISSN (Electronic) | 2375-026X |
Conference
Conference | 2014 IEEE 30th International Conference on Data Engineering |
---|---|
Abbreviated title | ICDE 2014 |
Country/Territory | United States |
City | Chicago |
Period | 31/03/14 → 4/04/14 |
Bibliographical note
This paper has received CiSRA Best Research Paper Award in 2014.© 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.