This class is meant to retrieve and process content data from a site with given URL or from a given file.
The class can retrieve the data and filter it into multiple blocks split by a given block separator, strip Javascript code, strip HTML tags and extract data with a given regular expression.
The comments in the class code are in Portuguese. |