This class can be used to crawl Web pages and scrape e-mail addresses into a MySQL database.
It retrieves a given start Web page, follows its links to retrieve other pages and any other pages linked from those. It does not follow links with URLs that have certain file name extensions.
The class scrapes each retrieved page to extract any e-mail addresses found in those pages. |