Journal Article

Community detection: effective evaluation on large social networks

Conrad Lee and Pádraig Cunningham

in Journal of Complex Networks

Volume 2, issue 1, pages 19-37
Published in print March 2014 | ISSN: 2051-1310
Published online October 2013 | e-ISSN: 2051-1329 | DOI:
Community detection: effective evaluation on large social networks

Show Summary Details


While many recently proposed methods aim to detect network communities in large datasets, such as those generated by social media and telecommunications services, most evaluation (i.e. benchmarking) of this research is based on small, hand-curated datasets. We argue that these two types of networks differ so significantly that, by evaluating algorithms solely on the smaller networks, we know little about how well they perform on the larger datasets. Recent work addresses this problem by introducing social network datasets annotated with meta-data that is believed to approximately indicate a ‘ground truth’ set of network communities. While such efforts are a step in the right direction, we find this meta-data problematic for two reasons. First, in practice, the groups contained in such meta-data may only be a subset of a network's communities. Second, while it is often reasonable to assume that meta-data is related to network communities in some way, we must be cautious about assuming that these groups correspond closely to network communities. Here, we consider these difficulties and propose an evaluation scheme based on a classification task that is tailored to deal with them.

Keywords: social networks; community detection; evaluation; benchmarking

Journal Article.  9302 words.  Illustrated.

Subjects: Mathematics ; Computer Science

Full text: subscription required

How to subscribe Recommend to my Librarian

Users without a subscription are not able to see the full content. Please, subscribe or login to access all content. subscribe or login to access all content.