Login   Register  
PHP Classes

Page Crawler: Crawl Web pages to extract the link URLs

Recommend this page to a friend!
Stumble It! Stumble It! Bookmark in del.icio.us Bookmark in del.icio.us

  Author Author  
Picture of Jacek Lukasiewicz
Name: Jacek Lukasiewicz is available for providing paid consulting. Contact Jacek Lukasiewicz .
Packages: 6 Browse all classes by Jacek Lukasiewicz Browse all classes by
Country: Poland Poland - PHP jobs in Poland
Age: 39
All time rank: 3334 in Poland Poland
Week rank: 67 Up4 in Poland Poland Equal
Innovation award
Innovation award
Nominee: 2x

Winner: 2x

  Detailed description   Download Download .zip .tar.gz  
This class can crawl Web pages to extract the link URLs.

It can retrieve HTML pages from given URLs and parse them to extract the link URLs.

The class can recursively retrieve the linked page URLs to also extract its links.

It can filter the retrieved links to exclude certain URLs, anchor text, link depth, external links.

The links can be returned as an array or as an HTML list.

  Classes of Jacek Lukasiewicz  >  Page Crawler  >  Download Download .zip .tar.gz  >  Support forum Support forum (1)  >  Blog Blog  >  RSS 1.0 feed RSS 2.0 feed Latest changes  
Name: Page Crawler
Base name: page-crawler
Description: Crawl Web pages to extract the link URLs
Version: 2
PHP version: 5.0
License: BSD License
All time users: 634 users
All time rank: 4481
Week users: 1 user
Week rank: 1884 Up
  Groups   Screenshots Screenshots   Rate classes User ratings   Applications   Files Files  

Group folder image HTML HTML generation and processing View top rated classes
Group folder image PHP 5 Classes using PHP 5 specific features View top rated classes
Group folder image Searching Search engines, crawling and indexing View top rated classes

  Files folder image Screenshots  
File Role Description
Accessible without login Image file screenshot.jpg Screen scr

  User ratings  
There are not enough user ratings to display for this class.

  Applications that use this class  
No application links were specified for this class.
Add link image If you know an application of this package, send a message to the author to add a link here.
  Files folder image Files  
File Role Description
Plain text file Crawler.php Class Crawler main Class
Plain text file example.php Example Examples
Plain text file IReader.php Class Interface for custom reader classes
Plain text file Reader.php Class Default reader class
Image file screen.JPG Output screenshot

Download Download all files: page-crawler.tar.gz page-crawler.zip
NOTICE: if you are using a download manager program like 'GetRight', please Login before trying to download this archive.