PHP Classes
elePHPant
Icontem

Puller: Scrape product data pages from e-commerce sites

Recommend this page to a friend!
  Info   View files View files (15)   DownloadInstall with Composer Download .zip   Reputation   Support forum   Blog    
Last Updated Ratings Unique User Downloads Download Rankings
2014-05-02 (3 years ago) RSS 2.0 feedNot enough user ratingsTotal: 793 This week: 1All time: 4,174 This week: 708Up
Version License PHP version Categories
puller 0.1.2GNU Lesser Genera...5.3PHP 5, E-Commerce, Web services
Description Author

This package can scrape product data pages from e-commerce sites.

It provides a base class that can retrieve pages from Web sites and extract information from them.

Several implementation classes are provided to retrieve product pages given the product identifier.

Those classes extract relevant product information like the product price, description and picture from certain e-commerce sites.

Currently it provides classes for extracting product information from several Brazilian e-commerce sites like Americanas, NetShoes, Saraiva and Submarino.

Innovation Award
PHP Programming Innovation award winner
June 2013
Winner


Prize: One subscription to the PDF edition of the PHP Architect magazine
Many sites publish listings of products available in different stores.

The information of the products and the prices can be obtained from the stores if they provide it, or scraped directly from the pages of the sites.

This class provides a general solution for scraping product information scraped from the store Web sites.

Different driver classes are provides to scrape information from specific store sites.

Manuel Lemos
  Performance   Level  
Name: Evaldo Barbosa <contact>
Classes: 3 packages by
Country: Brazil Brazil
Age: 36
All time rank: 135690 in Brazil Brazil
Week rank: 548 Up38 in Brazil Brazil Up
Innovation award
Innovation award
Nominee: 3x

Winner: 1x

Details
pp
==

You will get all needed information about a product in brazilian e-commerce sites.

Simple to use, PP is the way to collect data about a product in brazilian e-commerce including price, description and picture.

See the code of tests.

To run the test:

$ bin/phpunit -c tests/phpunit.xml --group=Info

Example:

```php
<?
/**
* Getting information from products
*/

use Puller\Target\SubmarinoProductInfo;

//http://www.submarino.com.br/produto/111970051
$productid = 111970051;

$p = new SubmarinoProductInfo( $productid );
echo $p->productId, "\n", $p->productName, "\n";
print_r( $p->productTable );

$p = new NetShoesProductInfo( '094-0460-014-03' );
echo $p->productId, "\n", $p->productName, "\n";
print_r( $p->productTable );

$p = new PontoFrioProductInfo( 'TelefoneseCelulares/Smartphones/Celular-Desbloqueado-Motorola-RAZR-i-Preto-com-Processador-Intel-de-2-GHz-Tela-de-4-3’’-Android-4-0-Camera-8MP-Wi-Fi-3G-NFC-GPS-e-Bluetooth-1748861.html' );
echo $p->productId, "\n", $p->productName, "\n";
print_r( $p->productTable );
</pre>
```

## Important

> *This software now is under LPGLv3. Se this:*<br/>
> *http://www.gnu.org/copyleft/lesser.txt*
> Winner - June 2013
> <img src="http://www.phpclasses.org/award/innovation/winner.png"/><br/>
> http://www.phpclasses.org/package/8077-PHP-Scrape-product-data-pages-from-e-commerce-sites.html
  Files folder image Files  
File Role Description
Files folder imagesrc (2 directories)
Files folder imagetests (2 files, 1 directory)
Accessible without login Plain text file composer.json Data Auxiliary data
Accessible without login Plain text file LICENCE Data Auxiliary data
Accessible without login Plain text file README.md Data Auxiliary data

  Files folder image Files  /  src  
File Role Description
Files folder imageInfra (1 directory)
Files folder imagePuller (1 file, 1 directory)

  Files folder image Files  /  src  /  Infra  
File Role Description
Files folder imageUtils (1 file)

  Files folder image Files  /  src  /  Infra  /  Utils  
File Role Description
  Plain text file RestRequest.php Class Class source

  Files folder image Files  /  src  /  Puller  
File Role Description
Files folder imageTarget (7 files)
  Plain text file AbstractProductInfo.php Class Class source

  Files folder image Files  /  src  /  Puller  /  Target  
File Role Description
  Plain text file AmericanasProductInfo.php Class Class source
  Plain text file MagazineLuizaProductInfo.php Class Class source
  Plain text file NetShoesProductInfo.php Class Class source
  Plain text file PontoFrioProductInfo.php Class Class source
  Plain text file RicardoEletroProductInfo.php Class Class source
  Plain text file SaraivaProductInfo.php Class Class source
  Plain text file SubmarinoProductInfo.php Class Class source

  Files folder image Files  /  tests  
File Role Description
Files folder imagesrc (1 directory)
  Accessible without login Plain text file bootstrap.php Conf. Configuration script
  Accessible without login Plain text file phpunit.xml Data Auxiliary data

  Files folder image Files  /  tests  /  src  
File Role Description
Files folder imagePuller (1 file)

  Files folder image Files  /  tests  /  src  /  Puller  
File Role Description
  Accessible without login Plain text file ProductInfoTest.php Test Class source

 Version Control Unique User Downloads Download Rankings  
 100%
Total:793
This week:1
All time:4,174
This week:708Up