Technology in terms you understand. Sign up for the Confident Computing newsletter for weekly solutions to make your life easier. Click here and get The Ask Leo! Guide to Staying Safe on the Internet — FREE Edition as my thank you for subscribing!

How do I convert a PDF document into a Word document?

Question:

I am trying to convert a PDF into a word document and there does not seem to
be any free programs that will do this. I have tried several. Can you help?

Probably not.

Your question is a common one, but it represents a fundamental
misunderstanding of exactly what PDF documents are and how they were intended
to be used.

I’ll put it another way: let me explain why I can’t help you.

]]>

PDF Format

PDF, or Portable Document Format, is a document format that is intended to address the fact that every computer is different from almost every other computer in some ways. Frequently, those ways affect how documents are displayed and/or printed.

For example, a Word document you receive might use fonts that aren’t installed on your machine, so when displayed Word has to pick a “close” font – but it’s still different. Perhaps your printer has a minimum margin of 1/2 an inch, but your friend’s printer can handle 1/8th of an inch – when you each print the same Word document it wraps, paginates and fundamentally looks different depending on the specifics of your printer.

“The upshot is that PDF is fundamentally a display format.”

The things that contribute to visible differences in document presentation aren’t limited to fonts and printers, nor are they limited to Word documents – almost any program that produces a document is susceptible to all sorts of system-to-system differences.

That’s what PDF format sets out to solve: a PDF document is intended to look the same everywhere, and in practice it pretty much does. And across not only a wide variety of computers, but also these days on devices ranging from portable readers to cell phones. It’s a pretty powerful concept.

PDF’s Intent

I want to stress that last point again: “a PDF document is intended to look the same everywhere.”

The upshot is that PDF is fundamentally a display format. It’s often been termed “electronic paper” and that’s a great way to think of it. One of the most common ways to create a PDF file is to install a virtual printer driver and “print” one.

Display. Output.

It was never intended that a PDF document be used as “input” to some process to extract, edit or copy its contents.

Editing or Converting PDF’s

What do you do after you print a document to paper and find an error that needs to be corrected?

You reprint it.

That’s the “correct” way to make a change in a PDF – alter the original document that it was created from and re-create the PDF.

That’s also the “correct” way to “convert” it to a Word document – keep the original Word (or other) document around for editing purposes.

One of the problems is that there’s really no always-effective way to get the data back out of a PDF. PDF’s can contain so many different kinds of data – text, pictures, pictures of text – that getting data out could be as simple as copy/paste, and as complex has having to run every page through OCR (optical character recognition) software to recover text in some kind of editable format.

In addition, the information in a PDF is organized for layout, page by page. All your careful organization by topic and chapters, with paragraphs that flow neatly from one to another as you edit the document is completely replaced with an organization that reflects the physical layout of the information of the page. The display organization of a document is often very different than your conceptual organization of the document’s contents.

PDF Editing and Extraction Tools

There are tools and as you’ve seen they often don’t work, don’t work well, don’t work completely, or don’t work the way we really want them to.

For example I can’t point you to a tool that’ll take a PDF in and produce a Word document out that matches the Word document you probably expect, if for no other reason that it’s difficult if not impossible to reconstruct the document’s logical layout from its physical. Tools can make lots of assumptions, but that’s all they are – assumptions, and by their very nature they’re often wrong.

Editing tools do exist – the makers of Foxit Reader, for example, have several. But once again not everything works the way you might expect: yes, you may be able to change a typo, but adding a paragraph or picture that would cause the entire document to re-flow and re-paginate is probably not something that’ll work as expected, if at all.

And of course not every PDF is editable – either by design (encryption or password protection when the PDF is created), or by practicality. A PDF that’s built from images – say .jpg’s – of pages may look exactly like a PDF that’s made from a Word document, but the ability to go back to the original text is severely impaired.

Do this

Subscribe to Confident Computing! Less frustration and more confidence, solutions, answers, and tips in your inbox every week.

I'll see you there!

19 comments on “How do I convert a PDF document into a Word document?”

  1. Well, you can try saving it as a word doc & if you are lucky, you can play around with it or yank your hair out trying, lol or save it as text, then copy to Word & format the whole doc to your liking. PDF’s are meant to be as is with no changes.

    Reply
  2. @Maria. The only way you can save a pdf document as a Word file is if you have a program such as Adobe Acrobat or Foxit Pro. Adobe & Foxit Readers can’t do that. An OCR program can, but the output would need a bit of work to get it formatted like the original. You can select, copy and paste the data into Word that you want to copy, but often the layout will be so weird it would take a long time to fix up. One of the best OCR programs I’ve found for preserving layout is FineReader but the Word document it produces can sometimes be very hard to edit properly.

    Reply
  3. I thought one of the purposes of PDF was security. Rather than send an editable Word or Excel document via email, some people deliberately convert to PDF so that the recipient cannot tamper with it.

    In the work environment we sometimes pull up reports from our accounting program, save them as PDFs, then use a program called Able2Extract to convert them to Excel spreadsheets if we want to work with the information. There’s an option for Word conversions as well.

    Reply
  4. The best PDF to Word converter, which I use every day, is ABBYY PDF Transformer. It will produce a Word document that exactly matches the original PDF, once you get to know how to use it. Unfortunately, its a professional tool and is not cheap (about $100).

    Reply
  5. My major problem with PDF conversions is that I am subject to notification mandates from the local federal court, and the list of people to be notifed are furnished to me in PDF format. The PDFs are supposed to be compatible with Avery labels, but they seldom are. I need to convert documents that I did not originate in order to generate mailing labels. So far the only way I have found to do this is the old fashioned manual keyboard.

    Reply
  6. Gmail just added the ability to convert pdf to doc when uploading to docs. Have not tried it myself, but since it’s google I have the feeling it’ll not be too bad.

    Reply
  7. Simple, free and very effective conversion can be done using free OpenOffice and free Sun PDF Import Extension.

    Reply
  8. Well! respected friend this time you have to use a technique which will pass from 3 Stages by using 2 Softwares, you will find a good result.
    2 Softwares named below:
    1) Adobe Acrobat 3D Ver.8/ use any software that convert PDF file into Image.
    2) OmniPage 14 that will scan image as it is then you can save it as Word document file.

    Remember in your prayers.

    I can help you if you need any more about it regarding software.

    Reply
  9. I have found that if you click edit>select all>copy then open a word blank page and paste you can save the pasting as a word .doc document. Granted you lose the formatting but you can alter the pasted copy any way you want, even filling in the blanks if so desired.(again, this changes the formatting but it’s really quite simple to fix…)

    Reply

Leave a reply:

Before commenting please:

  • Read the article.
  • Comment on the article.
  • No personal information.
  • No spam.

Comments violating those rules will be removed. Comments that don't add value will be removed, including off-topic or content-free comments, or comments that look even a little bit like spam. All comments containing links and certain keywords will be moderated before publication.

I want comments to be valuable for everyone, including those who come later and take the time to read.