|All requests||>||What is the best PHP pdf to html class?||>||Request new recommendation||>||Featured requests||>||No recommendations|
by VEDPRAKASH PODDAR - 3 years ago (2016-07-01)
I need to convert PDF document to HTML using PHP.
Converting PDF to HTML in pure PHP is hard. There are some packages for that but they rely on external programs, so they are not in pure PHP.
On the other hand this PDF to text class can be the basis for generating HTML from a PDF document. Maybe with some work it can extract more than just the text.
1. by Christian Vigh - 3 years ago (2016-07-03) Reply
As Manuel said, extracting text from PDF is hard since you have to face so many different situations.
There are commercial products that exist.
On the other hand, my PdfToText class was aimed at extracting only text from a PDF file.
Thanks to phpclasses users, this class is constantly evolving, since I received many many samples that presented issues, which helped making it better for interpreting pdf contents.
So, please feel free to have a try with it. It's a complete standalone PHP class, that do not use any external tool at all. It can even extract individual page contents and images.
And, of course, if you encounter issues when extracting text from your samples, please feel free to send them to me at this address :
I will be happy to handle the issues that will help me enhance my class.