• 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Text extraction from docx, doc, excel, ppt, pdf, etc formats

#1
[eluser]ethereal1m[/eluser]
Dear all,
is there any library that supports text extraction from docx,doc, excel, pdf, etc formats like Apache POI does on Java?

Or should I port Apache POI classes to Codeigniter code?

best regards,
ethereal1m

#2
[eluser]weboap[/eluser]
never tried this. but look it up
http://davidwalsh.name/read-pdf-doc-file-php

#3
[eluser]ethereal1m[/eluser]
@weboap,
unfortunately the app doesn't read docx format....

#4
[eluser]CroNiX[/eluser]
There's a docx library for zend framework, which you can use in CI.

#5
[eluser]Samus[/eluser]
[quote author="ethereal1m" date="1336707421"]@weboap,
unfortunately the app doesn't read docx format....[/quote]
it's not codeigniter itself, PHP can't read extensions such as .docx by default, you're always going to need some third party library or something.

#6
[eluser]ethereal1m[/eluser]
@samus,
yes, I'm looking that 3rd party library that anybody could recommend one....


Digg   Delicious   Reddit   Facebook   Twitter   StumbleUpon  


  Theme © 2014 iAndrew  
Powered By MyBB, © 2002-2020 MyBB Group.