Innovation Award
 June 2008
Number 3 |
When you need to provide a search engine for a site, usually it is better to have a crawler program retrieving the site contents and index the contents in a database, so it can searched more easily.
However, when you need to search a site that you do not control or was not indexed by a search engine, you need to crawl and search on demand.
This class provides an on demand search solution to crawl and search text in the pages of a given site.
Manuel Lemos |
This class can be used to crawl Web pages to search for given text in it.
It retrieves a given Web page and searches for links contained in it.
The new links that are found are added to a queue to be crawled later and so implement recursive searching up to a given depth limit.
The class looks for pages with text that match a given regular expression.
 |
| Project record: |
link_searcher |
| Popularity score: |
18.0 |
| Vitality score: |
1.0 |
|
| Not yet rated by the users |
No application links were specified for this class.

If you know an application of this package, send a message to the
author to add a link here.