|All requests||>||PHP docx to html with images converter||>||Request new recommendation||>||Featured requests||>||No recommendations|
by samy tech - 2 months ago (2020-02-05)
I have to import data from word file in database, so i searched for that , but didn't get any valid tutorial, that's why I have to try to convert a MicroSoft Word to HTML then get HTML data with the help of Curl into a database .
If you have any idea regarding this please share with me.
The DOCX format in reality is a ZIP archive with an HTML document inside. That HTML document uses some custom styles defined by MicroSoft Word but you should be able to use that HTML document for rendering purposes.
So you can use the code of the recommended class above to extract the HTML that is inside.
If you want to extract specific sections you can use PHP DOM HTML classes or other custom HTML parser classes like this one below that I developed: