Login   Register  
PHP Classes
elePHPant
Icontem

Page Crawler: Crawl Web pages to extract the link URLs

Recommend this page to a friend!
Stumble It! Stumble It! Bookmark in del.icio.us Bookmark in del.icio.us
  Info   Screenshots Screenshots   View files View files (5)   DownloadInstall with Composer Download .zip   Reputation   Support forum (1)   Blog    
Last Updated Ratings Unique User Downloads Download Rankings  
2011-08-25 (3 years ago) RSS 2.0 feedNot enough user ratingsTotal: 655 This week: 2All time: 4,455 This week: 935Up
Version License PHP version Categories  
page-crawler 2BSD License5.0HTML, PHP 5, Searching
Description Author  

This class can crawl Web pages to extract the link URLs.

It can retrieve HTML pages from given URLs and parse them to extract the link URLs.

The class can recursively retrieve the linked page URLs to also extract its links.

It can filter the retrieved links to exclude certain URLs, anchor text, link depth, external links.

The links can be returned as an array or as an HTML list.

Picture of Jacek Lukasiewicz
Name: Jacek Lukasiewicz is available for providing paid consulting. Contact Jacek Lukasiewicz .
Classes: 6 packages by
Country: Poland Poland
Age: 39
All time rank: 3184 in Poland Poland
Week rank: 89 Up2 in Poland Poland Up
Innovation award
Innovation award
Nominee: 2x

Winner: 2x

Screenshots  
  • screenshot.jpg
  Files folder image Files  
File Role Description
Plain text file Crawler.php Class Crawler main Class
Plain text file example.php Example Examples
Plain text file IReader.php Class Interface for custom reader classes
Plain text file Reader.php Class Default reader class
Image file screen.JPG Output screenshot

 Version Control Unique User Downloads Download Rankings  
 0%Total:655All time:4,455
 This week:2This week:935Up
 User Comments (1)