Free image to text converter

Extract text from any image — free OCR in your browser

Drop a screenshot, photo, or scan and get editable text in seconds. Recognition runs on your device — your image is never uploaded.

Drop an image to extract its text
PNG · JPG · WebP · BMP — recognised privately in your browser
PNG · JPG · WEBP · BMP · .PNG · … · max 25.0 MB

Free — unlimited basic OCR in your browser. Upgrade to Pro for HD OCR with layout.

The complete guide to extracting text from images

How OCR turns pixels back into text

Optical character recognition is the art of undoing rasterisation: an image of a page is just a grid of coloured pixels, and OCR's job is to find the shapes in that grid that are letters, decide which letters they are, and stitch them back into words and lines. Modern engines do this in stages — first detecting regions that contain text, then segmenting lines and words, then running each word image through a recognition model trained on millions of samples of printed type.

This tool runs Tesseract, the most battle-tested open-source OCR engine, compiled to WebAssembly so it executes inside your browser at near-native speed. The engine and the language model are downloaded once from a CDN and cached; your image is decoded, analysed, and recognised entirely on your own machine. That has a privacy consequence worth spelling out: the photo of your contract, ID, or receipt is never transmitted anywhere when you use basic OCR.

Recognition output is richer than a wall of text. For every word the engine reports a confidence score and a bounding box — the exact pixel rectangle it read the word from. The side panel uses both: words under 75% confidence get a red underline so you know what to proofread, and hovering any word highlights its rectangle on the original image so verification is instant.

What people use image-to-text for

The most common case is the humble screenshot: an error message, a code snippet from a video, a quote from a slide, an address in a chat picture — content that is text to a human but pixels to your clipboard. OCR turns it back into something you can paste, search, and edit. It beats retyping every time, and the confidence highlighting tells you whether a quick skim is enough.

The second big family is paperwork. Receipts for expense reports, invoices for bookkeeping, business cards into contacts, printed forms into spreadsheets, book pages into study notes. For one-off documents the free in-browser path is ideal; for batches of scans where structure matters — line items in a table, headers versus body text — the Pro HD pass reconstructs the layout instead of flattening everything into one stream of words.

Accessibility and archiving round it out. Text extracted from images can be read aloud by screen readers, translated, and indexed by search. A searchable PDF — the original image with an invisible, perfectly aligned text layer — is the de-facto standard for digitised records precisely because it preserves the visual document while making every word findable.

Getting the most accurate results

OCR accuracy is mostly decided before the engine ever runs, by the quality of the input. Resolution matters first: characters need to be roughly 20 pixels tall or more to recognise reliably, so prefer the original screenshot over a re-compressed copy, and photograph documents close enough that the text fills the frame. Sharpness matters second — a slightly blurry photo that looks fine to your eye can halve recognition accuracy.

Geometry and lighting come next. Shoot from directly above so lines stay straight, avoid shadows falling across the page, and favour even, diffuse light over a harsh flash that blows out part of the text. Crop away busy backgrounds when you can: the less non-text content the engine has to consider, the fewer false detections you get.

Finally, tell the engine what it is reading. Recognition models are per-language — the English model has no idea what an umlaut or a Cyrillic letter is, so French text read with English settings comes out mangled. Pick the right language from the picker (the data downloads automatically), and watch the red underlines: a handful of flagged words is normal, but if half the result is underlined, the input likely needs a better photo or a different language.

Basic OCR vs HD OCR — an honest comparison

Free basic OCR runs Tesseract's LSTM recogniser in your browser. On clean printed text — screenshots, PDFs rendered to images, flatbed scans — it is genuinely excellent, and its privacy story is unbeatable because nothing is uploaded. Its weaknesses are structure and noise: output is plain text in reading order, tables collapse into word soup, and accuracy degrades on skewed photos, low contrast, and unusual fonts.

HD OCR, part of the Pro plan, runs PaddleOCR on our servers — a modern deep-learning detection-plus-recognition pipeline that consistently tops open benchmarks. The practical differences: it reconstructs layout (paragraphs, tables, and headers come back as structure, not just a stream of words), it detects languages automatically and handles mixed-language documents, it is markedly more robust to skew, noise, and odd typography, and it can emit a searchable PDF with the text layer aligned to the original image.

A good rule of thumb: start with the free pass — it is instant and private, and for most images it is all you need. Reach for HD when the document is structured (tables, forms, multi-column layouts), when the photo quality is rough, or when you need the searchable-PDF output for archiving. Each HD run costs one AI credit on the Pro plan.

Privacy, language support, and the wider toolkit

Because basic recognition is fully client-side, this tool is safe for sensitive material: contracts, IDs, medical paperwork, financial statements. The engine and language files are fetched from a CDN and cached by your browser; the image itself stays in memory on your device and is gone when you close the tab. Only the optional Pro HD pass transmits the image — over TLS, processed, and never used for anything but your job.

Language coverage spans the world's major scripts: Latin-alphabet languages from English to Vietnamese, Cyrillic for Russian and Ukrainian, Greek, right-to-left Arabic and Hebrew, the Indic scripts of Hindi and Bengali, Thai, and the CJK family — Japanese, Korean, and both Simplified and Traditional Chinese. Each model downloads only when you pick it, so the first run in a new language takes a few extra seconds and is instant afterwards.

OCR also slots into a longer pipeline. Got a PDF? Render its pages with our PDF-to-image tool, then OCR the result. Going the other way, our image-to-PDF tool binds images into a document, and the compressor shrinks scans before you share them. Together they cover the full journey from paper to pixels to editable, searchable text — with the private, in-browser path as the default at every step.

Frequently asked questions

Is extracting text from an image free?

Yes. Basic OCR is completely free, unlimited, and needs no sign-up. Recognition runs in your browser with Tesseract, the most widely used open-source OCR engine, so there is nothing for us to meter. Pro adds HD OCR — a server-side engine with structured layout and noticeably better accuracy on difficult scans.

Is my image uploaded to a server?

Not for basic OCR. The recognition engine runs inside your browser as WebAssembly: the engine and language data are downloaded once and cached, and your image never leaves your device. Only the optional Pro HD OCR sends the image to our servers — and the tool tells you exactly when that happens.

Which languages can it read?

The language picker offers more than twenty languages, including English, Spanish, French, German, Portuguese, Russian, Ukrainian, Arabic, Hebrew, Hindi, Bengali, Thai, Vietnamese, Japanese, Korean, and both Simplified and Traditional Chinese. English loads by default; picking another language fetches its recognition data on the spot. Pro HD OCR also detects the language automatically — useful for mixed-language documents.

What do the red underlines in the result mean?

Every recognised word carries a confidence score from the engine. Words it was less sure about — below 75% — are underlined in red so you can double-check them before using the text. Hover any word and the matching region lights up on the original image, which makes verifying a flagged word take a second.

How accurate is the OCR?

On a clean screenshot or a well-lit, flat document photo, accuracy is typically excellent — high-90s percent on printed text. Accuracy drops with blur, low resolution, skew, handwriting, or decorative fonts. The confidence highlighting tells you exactly where to look, and Pro HD OCR (a modern deep-learning engine) does significantly better on hard cases like receipts, low-contrast scans, and dense tables.

Can I download the result as a Word document?

Yes. Beyond copying the text to your clipboard, you can download it as a plain .txt file or as a .docx Word document, generated right in your browser with one paragraph per line. The .docx opens in Microsoft Word, Google Docs, LibreOffice, and Pages.

What is a searchable PDF?

A searchable PDF looks exactly like your original image but carries an invisible text layer underneath, so you can select, copy, and search the text — and so can document systems and search indexes. It is the standard way to digitise scanned paperwork. The searchable-PDF output is part of Pro HD OCR, which aligns the text layer using its layout analysis.

What image formats and sizes work?

PNG, JPG, WebP, and BMP up to 25MB. For best results feed the tool the highest-resolution version you have: text should be at least about 20 pixels tall to recognise reliably. If your text lives in a PDF, run our PDF-to-image tool first and OCR the rendered page.

Why did my photo of a document come out garbled?

The usual culprits are blur, perspective distortion, and shadows. Retake the photo from directly above in even light, fill the frame with the page, and make sure the text is in focus. Choosing the right language matters too — recognising German text with the English model will mangle umlauts. For stubborn documents, Pro HD OCR handles skew and noise much more gracefully.

What's the difference between basic and HD OCR?

Basic OCR runs Tesseract in your browser: free, private, and great for screenshots and clean documents, but it returns plain text only. HD OCR runs PaddleOCR — a modern deep-learning engine — on our servers: it reconstructs the document's structure (paragraphs, tables, headers), detects languages automatically, reads difficult scans far more accurately, and can emit a searchable PDF. HD OCR is part of the Pro plan and uses one AI credit per run.