Welcome Guest, Not a member yet? Register   Sign In
Text extraction from docx, doc, excel, ppt, pdf, etc formats
#1

[eluser]ethereal1m[/eluser]
Dear all,
is there any library that supports text extraction from docx,doc, excel, pdf, etc formats like Apache POI does on Java?

Or should I port Apache POI classes to Codeigniter code?

best regards,
ethereal1m
#2

[eluser]weboap[/eluser]
never tried this. but look it up
http://davidwalsh.name/read-pdf-doc-file-php
#3

[eluser]ethereal1m[/eluser]
@weboap,
unfortunately the app doesn't read docx format....
#4

[eluser]CroNiX[/eluser]
There's a docx library for zend framework, which you can use in CI.
#5

[eluser]Samus[/eluser]
[quote author="ethereal1m" date="1336707421"]@weboap,
unfortunately the app doesn't read docx format....[/quote]
it's not codeigniter itself, PHP can't read extensions such as .docx by default, you're always going to need some third party library or something.
#6

[eluser]ethereal1m[/eluser]
@samus,
yes, I'm looking that 3rd party library that anybody could recommend one....




Theme © iAndrew 2016 - Forum software by © MyBB