PDF to Word

Extract text from PDF documents and download it as a Word (.docx) or text file.

Free to use. Runs in your browser.

Extract text from a PDF and download it as a Word (.docx) file.

Works best with selectable text. Selected files are handled in your browser after the page loads.

Drop PDF file here

or click to browse

Note: This tool extracts text from text-based PDFs and generates a proper .docx file. For scanned documents or complex layouts with tables and images, results may vary. The output preserves paragraph structure but not advanced formatting.

What Actually Happens When You Convert PDF to Word

PDFs and Word documents are built on completely different philosophies. A PDF says "draw this text at coordinates (72, 340) using 12pt Helvetica." A Word document says "this is a paragraph in the Body style, flow it however you need to." Converting between them means reverse-engineering a visual layout back into a structured, editable document.

This tool uses PDF.js (Mozilla's open-source PDF engine) to extract text content from each page, then the docx library to build a proper .docx file that Microsoft Word, Google Docs, and LibreOffice can all open. It preserves paragraph breaks, page structure, and text flow, but complex visual layouts may simplify during conversion.

Think of it like translating between languages. Simple sentences come across cleanly, but design-heavy pages lose nuance. A straightforward letter usually needs little cleanup. A brochure with text wrapping around images at angles will need manual work.

After the page has loaded, selected PDFs are handled in your browser and are not uploaded to an iForge Apps server by this tool. Still treat contracts, financial reports, HR documents, and other confidential files carefully, especially on shared devices.

Choosing a Conversion Approach

Approach	Good For	Limits	Privacy Check
Browser text extraction	Simple text-based PDFs, reports, letters, notes	No OCR, images, live form fields, or full layout reconstruction	Selected file is handled in the browser after page load
Full PDF editor	Complex tables, columns, forms, and design-heavy files	May require paid software or a larger local install	Check whether the editor works locally or sends files to cloud processing
OCR workflow	Scanned PDFs and photos of text	Can misread characters, tables, handwriting, and faint scans	Check where image recognition runs before using confidential files
Manual copy and rebuild	Short documents where formatting must be deliberate	Slower for long files, but gives the most control	Depends on the apps you use to view and edit the file

What this means for you: Use this tool when text extraction is enough. Use a fuller editor or OCR workflow when you need layout reconstruction, scanned text recognition, or form-field preservation.

Will My Document Convert Well?

Conversion quality depends entirely on the type of PDF. Search for your document type below to see what to expect before you start.

Document Type	Text Extraction	Layout Kept	Difficulty	Tip
Simple letter or email	Excellent	Excellent	Easy	Usually keeps the text cleanly, just check paragraph spacing
Report with headings	Excellent	Good	Easy	Heading hierarchy may flatten to same size; reapply heading styles in Word
Resume / CV	Good	Variable	Medium	Simple 1-column CVs convert well; 2-column designs often break
Invoice or receipt	Good	Fair	Medium	Table structure may simplify; verify numbers are intact
Legal contract	Excellent	Good	Easy	Text is linear; numbered clauses convert well. Check footnotes.
Academic paper	Good	Fair	Medium	Equations may convert as images or garbled text. 2-column layout may merge.
Presentation slides	Good	Poor	Hard	Each slide becomes scattered text boxes. Use PDF to JPG instead.
Newsletter / magazine	Fair	Poor	Hard	Complex multi-column layouts with wrapping rarely survive conversion
Form (fillable)	Good	Poor	Hard	Form field labels extract; interactive fields are lost. Re-create in Word.
Scanned document	None	None	Impossible	No text to extract, use OCR (Image to Text) first, then convert
Brochure / flyer	Fair	Poor	Hard	Design-heavy PDFs don't map to Word's flow layout. Use for text extraction only.
Bank statement	Good	Fair	Medium	Tabular data extracts but may lose column alignment. Verify amounts.
eBook / long document	Good	Good	Easy	Long flowing text converts well. Table of contents links will be lost.
Government form (pre-filled)	Variable	Poor	Hard	Some use custom fonts that garble during extraction. Try copy-paste test first.
Blueprint / technical drawing	Minimal	None	Impossible	These are essentially images with annotations. Use PDF to JPG instead.

Quick test: Open your PDF and try to select text with your cursor. If you can highlight individual words, the PDF has extractable text and this tool will work. If you can only select the whole page as a block (or nothing at all), it's a scanned/image PDF, use Image to Text (OCR) first.

Worked Example: Editing a Client Contract

Priya is a freelance consultant. A client sent her a contract as a PDF. She needs to change the project scope, update the payment terms, and add her company details before signing.

1.Quick test: Priya opens the PDF and highlights text with her cursor, it selects cleanly. Good sign, this is a text-based PDF, not a scan.
2.Convert: She selects the 8-page contract in this tool. Processing takes a few seconds. She downloads a .docx file.
3.Review: She opens the file in Word. The numbered clauses and paragraph text came through cleanly. Two things need fixing: the table of charges lost its column alignment, and the footer text moved into the body.
4.Edit: Takes 5 minutes to fix the table, update the scope section, change payment terms from 30 to 14 days, and add her company details.
5.Track changes: Uses Word's Track Changes feature so the client can see exactly what was modified. Saves as PDF and sends back.

Time saved: Retyping an 8-page contract from scratch would take an hour. Converting and editing took 7 minutes including cleanup.

What Gets Preserved (and What Doesn't)

Element	Preserved?	Details
Body text	Yes	All extractable text comes through accurately
Paragraph breaks	Yes	Line breaks and paragraph separations preserved
Page breaks	Yes	Each PDF page starts on a new page in Word
Font styling (bold, italic)	No	Bold, italic, and font changes are not carried over. The output is clean, uniform text.
Heading hierarchy	Partial	Headings extract as text; Word heading styles need manual reapplication
Bullet and numbered lists	Partial	Text preserved; may convert to plain text with manual bullet characters
Simple tables	Partial	Text from tables extracts; cell structure may simplify
Images	No	Embedded images are not carried to the Word file
Hyperlinks	No	URL text appears but links are not clickable in the output
Headers and footers	Partial	May extract as body text rather than Word header/footer fields
Multi-column layout	No	Columns merge into single-column flow
Form fields	No	Interactive form elements are lost; label text extracts

What this means for you: Text content is faithfully extracted. Structure and layout are best-effort. For simple documents, the output is ready to use. For complex layouts, expect 5-10 minutes of cleanup in Word, still much faster than retyping from scratch.

Post-Conversion Cleanup Checklist

After converting, open the Word file and run through these checks. Most take under a minute each:

Reapply heading styles

Select each heading and apply Word's Heading 1/2/3 styles. This fixes the document outline, enables auto-generated table of contents, and makes the structure navigable.

Fix bullet and numbered lists

Select list items and apply Word's built-in List Bullet or List Number styles. This gives you proper indentation and auto-numbering instead of manual characters.

Rebuild tables

If table data extracted as plain text with inconsistent spacing, select the text, use Insert → Table → Convert Text to Table. Set the delimiter to tabs or spaces.

Re-add hyperlinks

URL text survives but may not be clickable. Select each URL and use Ctrl+K (Cmd+K on Mac) to make it a working hyperlink.

Move headers/footers

If header/footer text ended up in the body, cut it and paste into Word's actual header/footer area (Insert → Header → Edit Header).

Spot-check numbers and dates

Verify that financial figures, dates, and reference numbers converted correctly. These are the highest-stakes elements, one wrong digit in a contract amount causes real problems.

PDF to Word or PDF to Image: Choose Wisely

Choose PDF to Word when...

You need to edit the text content
You want to reuse paragraphs in another document
You need to reformat, restyle, or translate the content
You want to enable Track Changes for collaborative editing
The document is text-based (not scanned)

Choose PDF to JPG when...

You need a visual snapshot of the page
You're embedding in a presentation or slide deck
The visual layout matters more than the text
You're sharing on social media or messaging
The PDF is heavily designed (newsletters, brochures)

Troubleshooting Common Issues

"The Word file is mostly blank"

Your PDF is a scanned document, it contains images of text, not actual text data. The tool can only extract text that exists as text in the PDF structure. Use Image to Text (OCR) to convert the scanned images to editable text first.

"Text is in the wrong order"

This happens with complex layouts where text boxes overlap visually but are stored in a different sequence internally. PDFs don't guarantee reading order for non-linear designs. You'll need to rearrange paragraphs manually in Word. This is most common with 2-column layouts and magazine-style designs.

"Characters are garbled or replaced with symbols"

Some PDFs use custom font encoding that maps characters non-standardly, especially older PDFs created by design software (InDesign, Illustrator). Try a full PDF editor, or use PDF to JPG and OCR as a workaround.

"The PDF is password-protected"

Use Unlock PDF to remove restrictions first, then convert. You'll need to enter the password if it's an "open" password. If you don't have the password, you can't convert the file.

"Tables came through as plain text"

PDF tables aren't real tables, they're text positioned to look like a grid. The converter extracts the text but may lose the grid structure. In Word, select the extracted table text and use Insert → Table → Convert Text to Table to rebuild it.

"The document looks nothing like the original"

If the PDF was created in a design tool (InDesign, Canva, Illustrator), it uses layout techniques that Word can't replicate, text on paths, complex text wrapping, overlapping elements. For these, extract just the text content and rebuild the layout in Word from scratch, or use PDF to JPG for a visual reference.

Common Mistakes

Not testing text selectability first

Open the PDF and try highlighting text with your cursor. If you can't select individual words, it's a scanned document, this tool won't help. Use OCR first.

Expecting pixel-perfect layout

PDF to Word conversion is fundamentally about text extraction, not layout recreation. Simple layouts survive well; complex designs need cleanup. If layout matters most, use PDF to JPG instead.

Not verifying numbers after conversion

Financial figures, dates, and reference numbers are the highest-stakes content. Always spot-check these in the output, one wrong digit in a contract can cause serious problems.

Converting when sharing is enough

If you just need someone to read the document (not edit it), send the PDF. Every device can open PDFs. Converting to Word adds an unnecessary step and risks formatting issues.

Forgetting to re-add links

Hyperlinks convert as plain text, the URL is there but not clickable. If links are important, go through the document and manually recreate them using Ctrl+K.

Not saving a backup of the original PDF

Always keep the original PDF. If the conversion isn't perfect, you can try again with different settings or use it as a visual reference while cleaning up the Word file.

Related Tools

PDF to JPG

Convert pages to images when layout matters most

Image to Text (OCR)

Extract text from scanned documents

Unlock PDF

Remove restrictions before converting

Split PDF

Extract specific pages to convert

Merge PDF

Combine PDFs before converting

Word Counter

Count words in the extracted text

How to use this tool

Select a text-based PDF file

Preview the extracted content

Download as Word or plain text

Common uses

Extracting editable text from PDF reports for revision or quoting
Extracting text from contract drafts for review notes
Pulling text from academic papers or articles for research notes
Pulling form labels and notes into an editable document
Converting text-based PDFs into formats easier to work with

Share this tool

Frequently Asked Questions

How accurate is the PDF to Word conversion?

This tool works best with text-based PDFs. It extracts text content and keeps basic paragraph structure. Complex layouts, tables, and images may need manual cleanup.

Can it convert scanned PDFs?

Scanned PDFs contain images, not text. This tool does not perform OCR. For scanned documents, use an OCR tool first.

What format is the output?

The output is a .docx file that opens in common word processors such as Microsoft Word, Google Docs, and LibreOffice.

How do I know if my PDF has selectable text?

Open the PDF in any viewer and try to highlight text with your cursor. If you can select it, this tool will extract it. If not, it's likely a scanned image.

Why is some text missing from the output?

Text in images, unusual fonts, or heavily designed layouts may not extract. The tool reads the PDF text layer, anything rendered as graphics won't transfer.

Can I convert a password-protected PDF?

Remove the restrictions first with our Unlock PDF tool, then select the unprotected file for conversion.

Will tables convert properly?

Simple tables may flatten into plain text. For complex tables, copy them manually from the PDF viewer or use a full PDF editor.

Where is my PDF processed?

After this page has loaded, the selected PDF is handled in your browser and is not uploaded to an iForge Apps server by this tool.

Can I also download as plain text?

Yes. After extraction, you can download as .docx (Word) or .txt (plain text), or copy the text directly to your clipboard.

Does the conversion keep formatting like bold and italic?

Basic paragraph structure is preserved, but bold, italic, and font changes are not carried over. The output is clean, uniform text.

What's the maximum file size?

There is no fixed site limit, but large PDFs may be slow or fail if your browser runs out of memory.

Can I convert multiple PDFs at once?

This tool handles one PDF at a time. To batch-convert, use our Merge PDF tool to combine files first, then convert the single merged document.

Results are for general informational purposes only and should be checked before use. They are not professional advice. See our Disclaimer and Terms of Service.