Statistics – Machine Learning
Scientific paper
2008-03-11
Statistics
Machine Learning
Scientific paper
Being among the easiest ways to find meaningful structure from discrete data, Latent Dirichlet Allocation (LDA) and related component models have been applied widely. They are simple, computationally fast and scalable, interpretable, and admit nonparametric priors. In the currently popular field of network modeling, relatively little work has taken uncertainty of data seriously in the Bayesian sense, and component models have been introduced to the field only recently, by treating each node as a bag of out-going links. We introduce an alternative, interaction component model for communities (ICMc), where the whole network is a bag of links, stemming from different components. The former finds both disassortative and assortative structure, while the alternative assumes assortativity and finds community-like structures like the earlier methods motivated by physics. With Dirichlet Process priors and an efficient implementation the models are highly scalable, as demonstrated with a social network from the Last.fm web site, with 670,000 nodes and 1.89 million links.
Aukia Janne
Kaski Samuel
Sinkkonen Janne
No associations
LandOfFree
Component models for large networks does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Component models for large networks, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Component models for large networks will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-286523