Journal Article

MEI: Mutual Enhanced Infinite Community–Topic Model for Analyzing Text-Augmented Social Networks

Dongsheng Duan, Yuhua Li, Ruixuan Li, Zhengding Lu and Aiming Wen

in The Computer Journal

Published on behalf of British Computer Society

Volume 56, issue 3, pages 336-354
Published in print March 2013 | ISSN: 0010-4620
Published online May 2012 | e-ISSN: 1460-2067 | DOI: https://dx.doi.org/10.1093/comjnl/bxs045

Show Summary Details

Preview

The community and the topic can help summarize the text-augmented social networks. Existing works mixed up the community and the topic by regarding them as the same. However, there is an inherent difference between the community and the topic such that considering them as the same is not so flexible. We propose a mutual enhanced infinite (MEI) community–topic model to detect communities and topics simultaneously in text-augmented social networks. The community and the topic are correlated via a community–topic distribution. The mutual enhancement effect between the community and the topic is validated by introducing two novel measures perplexity with community (perplexityc) and mean of the rank (MRK) with topic (MRKt). To determine the numbers of communities and topics automatically, the Dirichlet Process Mixture (DPM) model and the Hierarchical Dirichlet Process mixture (HDP) model are used to model the community and the topic, respectively. We further introduce parameters to model the weight of the community and the topic responsible for community inference. Experiments on the co-author network built from a subset of DBLP data show that MEI outperforms the baseline models in terms of the generalization performance. Parameter study shows that MEI is averagely improved by 3.7 and 15.5% in perplexityc and MRKt, respectively, by setting a low weight for the topic. We also experimentally validate that MEI can determine the appropriate numbers of communities and topics.

Keywords: mutual enhanced infinite community–topic model; community; topic; Dirichlet process; Gibbs sampling

Journal Article.  0 words. 

Subjects: Computer Science