How the result of graph clustering methods depends on the construction of the graph

Statistics – Machine Learning

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

We study the scenario of graph-based clustering algorithms such as spectral clustering. Given a set of data points, one first has to construct a graph on the data points and then apply a graph clustering algorithm to find a suitable partition of the graph. Our main question is if and how the construction of the graph (choice of the graph, choice of parameters, choice of weights) influences the outcome of the final clustering result. To this end we study the convergence of cluster quality measures such as the normalized cut or the Cheeger cut on various kinds of random geometric graphs as the sample size tends to infinity. It turns out that the limit values of the same objective function are systematically different on different types of graphs. This implies that clustering results systematically depend on the graph and can be very different for different types of graph. We provide examples to illustrate the implications on spectral clustering.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

How the result of graph clustering methods depends on the construction of the graph does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with How the result of graph clustering methods depends on the construction of the graph, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and How the result of graph clustering methods depends on the construction of the graph will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-694278

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.