Links tell us about lexical and semantic Web content

Computer Science – Information Retrieval

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

10 pages, 4 figures

Scientific paper

The latest generation of Web search tools is beginning to exploit hypertext link information to improve ranking\cite{Brin98,Kleinberg98} and crawling\cite{Menczer00,Ben-Shaul99etal,Chakrabarti99} algorithms. The hidden assumption behind such approaches, a correlation between the graph structure of the Web and its content, has not been tested explicitly despite increasing research on Web topology\cite{Lawrence98,Albert99,Adamic99,Butler00}. Here I formalize and quantitatively validate two conjectures drawing connections from link information to lexical and semantic Web content. The clink-content conjecture states that a page is similar to the pages that link to it, i.e., one can infer the lexical content of a page by looking at the pages that link to it. I also show that lexical inferences based on link cues are quite heterogeneous across Web communities. The link-cluster conjecture states that pages about the same topic are clustered together, i.e., one can infer the meaning of a page by looking at its neighbours. These results explain the success of the newest search technologies and open the way for more dynamic and scalable methods to locate information in a topic or user driven way.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Links tell us about lexical and semantic Web content does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Links tell us about lexical and semantic Web content, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Links tell us about lexical and semantic Web content will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-681977

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.