The problem of finding connected components in a graph is common to several applications dealing with graph analytics, such as social network analysis, web graph mining and image processing. The exponentially growing size of graphs requires the definition of appropriated computational models and algorithms for their processing on high throughput distributed architectures. In this paper we present CRACKER, an efficient iterative algorithm to detect connected components in large graphs. The strategy of CRACKER is to iteratively grow a spanning tree for each connected component of the graph. Nodes added to such trees are discarded from the computation in the subsequent iterations. We provide an extensive experimental evaluation considering a wide variety of synthetic and real-world graphs. The experimental evaluation shows that CRACKER consistently outperforms state-of-the-art approaches both in terms of total computation time and volume of messages exchanged.
Cracker: Crumbling Large Graphs Into Connected Components
LUCCHESE, Claudio
2015-01-01
Abstract
The problem of finding connected components in a graph is common to several applications dealing with graph analytics, such as social network analysis, web graph mining and image processing. The exponentially growing size of graphs requires the definition of appropriated computational models and algorithms for their processing on high throughput distributed architectures. In this paper we present CRACKER, an efficient iterative algorithm to detect connected components in large graphs. The strategy of CRACKER is to iteratively grow a spanning tree for each connected component of the graph. Nodes added to such trees are discarded from the computation in the subsequent iterations. We provide an extensive experimental evaluation considering a wide variety of synthetic and real-world graphs. The experimental evaluation shows that CRACKER consistently outperforms state-of-the-art approaches both in terms of total computation time and volume of messages exchanged.I documenti in ARCA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.