PHP Classes

Link Searcher: Crawl Web pages to search for given text

Recommend this page to a friend!
Stumble It! Stumble It! Bookmark in Bookmark in
  Info   Screenshots Screenshots   View files View files (5)   DownloadInstall with Composer Download .zip   Reputation   Support forum   Blog    
Last Updated Ratings Unique User Downloads Download Rankings  
2015-01-12 (4 months ago) RSS 2.0 feedNot yet rated by the usersTotal: 1,857 This week: 1All time: 1,993 This week: 1,201Up
Version License PHP version Categories  
link_searcher 1.0GNU General Publi...4.0HTTP, Searching
Description Author  

This class can be used to crawl Web pages to search for given text in it.

It retrieves a given Web page and searches for links contained in it.

The new links that are found are added to a queue to be crawled later and so implement recursive searching up to a given depth limit.

The class looks for pages with text that match a given regular expression.

Innovation Award  
PHP Programming Innovation award nominee
June 2008
Number 3

Prize: One book of choice by Apress
When you need to provide a search engine for a site, usually it is better to have a crawler program retrieving the site contents and index the contents in a database, so it can searched more easily.

However, when you need to search a site that you do not control or was not indexed by a search engine, you need to crawl and search on demand.

This class provides an on demand search solution to crawl and search text in the pages of a given site.

Manuel Lemos
Picture of Nadir Latif
Name: Nadir Latif is available for providing paid consulting. Contact Nadir Latif .
Classes: 14 packages by
Country: Pakistan Pakistan
Age: 32
All time rank: 881 in Pakistan Pakistan
Week rank: 55 Up2 in Pakistan Pakistan Down
Innovation award
Innovation award
Nominee: 9x

Winner: 1x

  • screen_shot.jpg
  Files folder image Files  
File Role Description
Plain text file index.php Example initial file
Plain text file link_searcher.php Class main program file
Plain text file queue.php Class used to store the links in a page
Plain text file readme.txt Doc. help file
Plain text file LICENSE.txt Doc. Documentation

 Version Control Unique User Downloads Download Rankings  
 100%Total:1,857All time:1,993
 This week:1This week:1,201Up