PHP Classes

Web scraper: Extract information from Web site pages

Recommend this page to a friend!
Stumble It! Stumble It! Bookmark in Bookmark in
  Info   Screenshots Screenshots   View files View files (5)   DownloadInstall with Composer Download .zip   Reputation   Support forum (1)   Blog    
Last Updated Ratings Unique User Downloads Download Rankings  
2011-10-04 (3 years ago) RSS 2.0 feedNot enough user ratingsTotal: 1,601 All time: 2,309 This week: 669Up
Version License PHP version Categories  
web-scraper 1.0BSD License5.0HTML, PHP 5, Web services
Description Author  

This class can extract information from Web site pages.

It can either retrieve a single page, a group of pages from a Web site given a base URL and a range of values that will replace template parameters in that URL, or a group of pages with given URLs.

The class can use selector path values to define elements in the page from which it will extract the relevant page content values.

Innovation Award  
PHP Programming Innovation award winner
October 2011

Prize: One subscription to the PDF edition of the PHP Architect magazine
Some applications need to retrieve information that is only available to the public in Web site pages.

This class makes it easier to retrieve and parse many Web pages at once to extract information that is displayed in the same relative position of the pages.

Manuel Lemos
Picture of Jacek Lukasiewicz
Name: Jacek Lukasiewicz is available for providing paid consulting. Contact Jacek Lukasiewicz .
Classes: 6 packages by
Country: Poland Poland
Age: 40
All time rank: 3054 in Poland Poland
Week rank: 148 Up4 in Poland Poland Equal
Innovation award
Innovation award
Nominee: 2x

Winner: 2x

  • screen.jpg
  Files folder image Files  
File Role Description
Files folder imagelib (1 file)
Accessible without login Plain text file documentation.txt Doc. documentation
Accessible without login Plain text file index.php Example example using
Accessible without login Plain text file scraper.php Class scraper class
Accessible without login Plain text file test.html Data test file

  Files folder image Files  /  lib  
File Role Description
  Plain text file phpQuery-onefile.php Class phpQuery library

 Version Control Unique User Downloads Download Rankings  
 0%Total:1,601All time:2,309
 This week:0This week:669Up
 User Comments (1)  
Easy to use, working well.
3 years ago (miron1)