A Proposed Architecture for Continuous Web Monitoring Through Online Crawling of Blogs

Computer Science – Information Retrieval

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

10 pages, 2 figures

Scientific paper

10.5121/iju.2012.3102

Getting informed of what is registered in the Web space on time, can greatly help the psychologists, marketers and political analysts to familiarize, analyse, make decision and act correctly based on the society`s different needs. The great volume of information in the Web space hinders us to continuously online investigate the whole space of the Web. Focusing on the considered blogs limits our working domain and makes the online crawling in the Web space possible. In this article, an architecture is offered which continuously online crawls the related blogs, using focused crawler, and investigates and analyses the obtained data. The online fetching is done based on the latest announcements of the ping server machines. A weighted graph is formed based on targeting the important key phrases, so that a focused crawler can do the fetching of the complete texts of the related Web pages, based on the weighted graph.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

A Proposed Architecture for Continuous Web Monitoring Through Online Crawling of Blogs does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with A Proposed Architecture for Continuous Web Monitoring Through Online Crawling of Blogs, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Proposed Architecture for Continuous Web Monitoring Through Online Crawling of Blogs will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-156170

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.