I got this email from my friend a few days back:
"Hi, do you have any idea of a software that I could use to convert pdf to Word. I'm working on a movie but I've got the script in PDF... I want to convert it to Word. I tried but i am not able to copy it... any idea???"
Most probably the PDF has the text as image (likely because its a scanned page stored in PDF). It is usually difficult to copy text out of such PDF documents unless the parser is able to use some character recognition to figure out the text.
But the question is can we carry out a conversion of a document in PDF to MS Word?
I tried to search and did find many softwares to do so but all turned out to be closed source, costly and at times not good enough. I was looking for a open source option to do the same.
So finally I advised him to use PDFtoHTML to convert the pdf document to html and then use MS Word to open the same copying the text out from there.
I know this is a cross country trip but if someone has a better way to carry out the same using only open source software please let us know.