PHP Classes

Web scraper: Extract information from Web site pages

Recommend this page to a friend!
Stumble It! Stumble It! Bookmark in Bookmark in
  Info   Screenshots Screenshots   View files View files (5)   DownloadInstall with Composer Download .zip   Reputation   Support forum (1)   Blog    
Last Updated Ratings Unique User Downloads Download Rankings  
2011-10-04 (4 years ago) RSS 2.0 feedNot enough user ratingsTotal: 1,618 This week: 1All time: 2,303 This week: 1,103Up
Version License PHP version Categories  
web-scraper 1.0BSD License5.0HTML, PHP 5, Web services
Description Author  

This class can extract information from Web site pages.

It can either retrieve a single page, a group of pages from a Web site given a base URL and a range of values that will replace template parameters in that URL, or a group of pages with given URLs.

The class can use selector path values to define elements in the page from which it will extract the relevant page content values.

Innovation Award  
PHP Programming Innovation award winner
October 2011

Prize: One subscription to the PDF edition of the PHP Architect magazine
Some applications need to retrieve information that is only available to the public in Web site pages.

This class makes it easier to retrieve and parse many Web pages at once to extract information that is displayed in the same relative position of the pages.

Manuel Lemos
Picture of Jacek Lukasiewicz
Name: Jacek Lukasiewicz is available for providing paid consulting. Contact Jacek Lukasiewicz .
Classes: 6 packages by
Country: Poland Poland
Age: 40
All time rank: 2984 in Poland Poland
Week rank: 86 Up1 in Poland Poland Up
Innovation award
Innovation award
Nominee: 2x

Winner: 2x

  • screen.jpg
  Files folder image Files  
File Role Description
Files folder imagelib (1 file)
Accessible without login Plain text file documentation.txt Doc. documentation
Accessible without login Plain text file index.php Example example using
Accessible without login Plain text file scraper.php Class scraper class
Accessible without login Plain text file test.html Data test file

  Files folder image Files  /  lib  
File Role Description
  Plain text file phpQuery-onefile.php Class phpQuery library

 Version Control Unique User Downloads Download Rankings  
 0%Total:1,618All time:2,303
 This week:1This week:1,103Up
 User Comments (1)  
Easy to use, working well.
4 years ago (miron1)