This class can retrieve Web pages and parse them to extract the list of their links to continue crawling all linked pages.
The pages may be retrieved iteratively until it is reached a given limit of pages or link depth.
It is possible to set regular expressions for both link definitions and content matches, changeable at every depth.
| Ratings | Utility |
Consistency |
Documentation |
Examples |
Tests |
Videos |
Overall |
Rank |
| All time: |
Sufficient (65.0%) |
Sufficient (70.0%) |
Not sure (55.0%) |
Not sure (50.0%) |
- |
- |
Not sure (50.0%) |
1314 |
| Month: |
Not yet rated by the users |
| Link |
Description |
| mytube |
retrieves performance data from london underground ("the tube") |
| mytube2 |
Data on performance of London Underground |

If you know an application of this package, send a message to the
author to add a link here.
| File |
Role |
Description |
overview.txt |
Doc. |
Overview for spiderClass.php |
spiderClass.php |
Class |
spiderClass source file |
spiderExamp.php |
Example |
Example usage - NOTE: BE KIND TO THE BBC AND DO NOT RUN THIS WITHOUT CHANGING THE PARAMETERS |