Semantic Content Filtering with Wikipedia and Ontologies

Computer Science – Information Retrieval

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

9 pages, Third International Workshop on Semantic Aspects in Data Mining (SADM'10) in conjunction with the 2010 IEEE Internati

Scientific paper

The use of domain knowledge is generally found to improve query efficiency in content filtering applications. In particular, tangible benefits have been achieved when using knowledge-based approaches within more specialized fields, such as medical free texts or legal documents. However, the problem is that sources of domain knowledge are time-consuming to build and equally costly to maintain. As a potential remedy, recent studies on Wikipedia suggest that this large body of socially constructed knowledge can be effectively harnessed to provide not only facts but also accurate information about semantic concept-similarities. This paper describes a framework for document filtering, where Wikipedia's concept-relatedness information is combined with a domain ontology to produce semantic content classifiers. The approach is evaluated using Reuters RCV1 corpus and TREC-11 filtering task definitions. In a comparative study, the approach shows robust performance and appears to outperform content classifiers based on Support Vector Machines (SVM) and C4.5 algorithm.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Semantic Content Filtering with Wikipedia and Ontologies does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Semantic Content Filtering with Wikipedia and Ontologies, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Semantic Content Filtering with Wikipedia and Ontologies will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-98891

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.