Anyway, I was just looking at doing something like this...maybe, if i could do it quickly enough. I have about around 3000 .pdf documents that need the text extracted and archived. I think what the boss wants to end up doing is have Adobe do it in some way. If you come up with something please let the rest of us know.
You know why they call it Adobe Acrobat? Because you have to be an acrobat to use it. Ughhhh...I hate .pdf to begin with.
From what I know about the pdf format though I really don't think it would be difficult to code such a routine from scratch. Something I'd be interested in looking at when I get time.
I may look like a mule, but I'm not a complete ass.
There you are, I tested it briefly and didn't find any bugs. Let me now if you find one.
As I already have an EsGrid license I want you to donate the money to Srod,
he truly deserves it!
Caught this thread by a happy accident. @milan1612 - I tested your code on two different PDF files and it only wrote a 0 byte text file. Do you have a small PDF file that worked on your system for me to test on mine?
There you are, I tested it briefly and didn't find any bugs. Let me now if you find one.
As I already have an EsGrid license I want you to donate the money to Srod,
he truly deserves it!
Marco, please - if I can, whilst it's a very kind offer and much appreciated, would you mind donating to Purebasic instead; I think that Fred and co are more deserving than I.
I may look like a mule, but I'm not a complete ass.