Computer Science – Information Retrieval
Scientific paper
2009-09-12
International Journal of Information Technology, Vol. 15 No. 1, 2009
Computer Science
Information Retrieval
16 pages, 7 figures
Scientific paper
This paper describes a clustering method to group the most similar and important weblogs with their descriptive shared words by using a technique from multilinear algebra known as PARAFAC tensor decomposition. The proposed method first creates labeled-link network representation of the weblog datasets, where the nodes are the blogs and the labels are the shared words. Then, 3-way adjacency tensor is extracted from the network and the PARAFAC decomposition is applied to the tensor to get pairs of node lists and label lists with scores attached to each list as the indication of the degree of importance. The clustering is done by sorting the lists in decreasing order and taking the pairs of top ranked blogs and words. Thus, unlike standard co-clustering methods, this method not only groups the similar blogs with their descriptive words but also tends to produce clusters of important blogs and descriptive words.
No associations
LandOfFree
Weblog Clustering in Multilinear Algebra Perspective does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Weblog Clustering in Multilinear Algebra Perspective, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Weblog Clustering in Multilinear Algebra Perspective will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-477464