Login   Register  
PHP Classes
elePHPant
Icontem

Class: Spider website

Recommend this page to a friend!
Stumble It! Stumble It! Bookmark in del.icio.us Bookmark in del.icio.us
  Classes of Karol Janyst  >  Spider website  >  Download .tar.gz .zip  >  Support forum Support forum (2)  >  Blog Blog  >  RSS 1.0 feed RSS 2.0 feed Latest changes  
Name: Spider website
Base name: spider
Description: Crawl a site and retrieve the the URL of all links
Related classes: , , , , , , ,
Version: 0.1
PHP version: 5.0
License: GNU General Public License (GPL)
All time users: 1757 users
All time rank: 1860
Week users: 3 users
Week rank: 1216
Picture of Karol Janyst
Author: Karol Janyst <e-mail contact>
Packages: 2 Browse this author's classes Browse this author's classes
Country: Poland Poland - PHP jobs in Poland
Age: 24
All time rank: 58213 in Poland Poland
Week rank: 115 Up4 in Poland Poland Up


  Detailed description  
This class can be used to crawl a site and retrieve the the URL of all links.

It can retrieve a page of a site and follow all links recursively to retrieve all the site URLs.

The class can restrict the crawling to URLs with a given extension and avoids accessing pages listed in the site robots.txt file, or pages set with the no index or no follow meta tags.

 

  Groups  
Group folder image HTML HTML generation and processing View top rated classes
Group folder image PHP 5 Classes using PHP 5 specific features View top rated classes
Group folder image Searching Search engines, crawling and indexing View top rated classes

  Rate classes User ratings   Applications   Files Files  

  User ratings  
Ratings
Utility
Consistency
Documentation
Examples
Tests
Videos
Overall
Rank
All time:
Sufficient (66.7%)
Good (87.5%)
-
Sufficient (70.8%)
-
-
Not sure (49.2%)
1554
Month:
Not yet rated by the users

  Applications that use this class  
No application links were specified for this class.
Add link image If you know an application of this package, send a message to the author to add a link here.
  Files folder image Files  
File Role Description
Plain text file spider.class.php Class Main class file
Plain text file example.php Example Example file

Download all files: spider.tar.gz spider.zip
NOTICE: if you are using a download manager program like 'GetRight', please Login before trying to download this archive.