Image to Text (OCR)
Extract text from images using AI-powered OCR. 100% private, runs entirely in your browser. Supports 18+ languages.
Upload any image and our browser-based OCR engine will extract all readable text instantly.
Powered by Tesseract.js WASM. Your images never leave your device.
Drop an image here or click to upload
PNG, JPG, WebP, BMP, GIF, Max 20 MB
How OCR Works
OCR (Optical Character Recognition) analyses pixel patterns in an image to identify letters, numbers, and symbols. The software breaks the image into small regions, compares each against a trained model of known characters, and outputs the best match as editable digital text.
This tool uses Tesseract.js, a WebAssembly port of Google's Tesseract OCR engine. The OCR worker runs in your browser after worker, core, and language files load; the selected image is not uploaded for OCR.
Modern OCR handles printed text very well (95-99% accuracy with clean images). Handwriting recognition is improving but less reliable. For best results, use clear, high-contrast images with straight, well-lit text.
What to Expect: Accuracy by Input Type
| Input Type | Typical Accuracy | Tips |
|---|---|---|
| Printed document (scan) | 97-99% | High-res scan, clean background |
| Screenshot with text | 95-99% | Use original resolution, no scaling |
| Phone photo of document | 85-95% | Shoot straight-on, good lighting |
| Receipt or label | 80-90% | Small text is harder, get close |
| Text on complex background | 60-80% | Crop to just the text area first |
| Handwriting (neat print) | 50-70% | Block capitals work best |
| Cursive handwriting | 20-40% | Not reliable, type it manually |
Getting Better Results
Use 300+ DPI images. Higher resolution gives the engine more pixels to work with per character. Low-resolution screenshots of small text are the #1 cause of bad results.
Crop to just the text. Remove borders, images, and decorative elements. The less noise the engine has to filter, the better it focuses on the actual characters.
Select the right language. Each language has its own character model. English is loaded by default, but switching to the correct language before extraction dramatically improves accuracy for non-English text.
Watermarked or overlaid images. Text over photos, watermarks across documents, and coloured backgrounds behind text all confuse the recognition engine. Clean, high-contrast input gets clean output.
Heavily compressed JPGs. JPEG compression artefacts blur the edges of characters, especially at small sizes. If possible, use PNG or a high-quality JPG.
What to Do After Extraction
- Count words to check the extracted text length and reading time.
- Fix the case if the OCR misidentified capitalisation throughout the text.
- Clean whitespace to remove extra spaces and blank lines that OCR often introduces.
- Correct OCR errors manually or in your text editor, like "l" misread as "1" or "0" misread as "O".
Related Tools
How to use this tool
Upload an image or paste from clipboard, drag-and-drop works too.
Select the language of the text in the image for best accuracy.
Click 'Extract Text' and copy the result to your clipboard.
Common uses
- Extract text from scanned documents and PDFs
- Digitise printed notes, receipts, and invoices
- Copy text from screenshots and images
- Convert handwritten notes to digital text
- Translate text in foreign-language images
- Extract data from charts and infographics
Share this tool
Frequently Asked Questions
How does Image to Text (OCR) work?
What image formats are supported?
Which languages are supported?
Is my data private?
Why is the first run slow?
How can I improve OCR accuracy?
Can I extract text from handwriting?
Does it work with scanned PDFs?
Can I paste images from my clipboard?
What about text in tables or columns?
How accurate is the text extraction?
Can I extract text from multiple images?
Results are for general informational purposes only and should be checked before use. They are not professional advice. See our Disclaimer and Terms of Service.