QuickPdfOcr

OCR scanned PDFs in seconds — locally, with a clean desktop UI.

Drag and drop a PDF, click Start OCR, then copy the extracted text to your clipboard. Built with PySide6 (Qt6) and powered by Tesseract OCR.

Download Latest Release View on GitHub

What It Does

QuickPdfOcr is a lightweight PDF OCR application that helps you extract text from image-based or scanned PDFs using Tesseract. It's designed to be straightforward: choose a PDF (or drag it in), run OCR, and copy the text.

Key Features

📄

Drag & Drop

Simply drag and drop PDF files directly into the application window for instant processing.

📁

File Picker

Browse and select PDFs using the traditional file picker if you prefer.

🔍

Tesseract-Powered OCR

Industry-standard Tesseract OCR engine extracts text accurately from scanned pages.

📊

Progress & Status Updates

Real-time feedback shows processing status while OCR is running.

📋

One-Click Copy to Clipboard

Instantly copy extracted text to your clipboard on macOS, Linux, and Windows.

🔄

Error Recovery

Retry or start over options available if something fails during processing.

🎨

Modern UI

Clean, user-friendly interface with clear visual feedback throughout the process.

📦

Fully Standalone

Pre-built executables include everything—Python, Tesseract, and Poppler. No installation needed!

🔒

Offline & Privacy-Friendly

Your documents stay on your computer. QuickPdfOcr runs locally and performs OCR on-device. No uploads, no cloud processing, no data collection. Your privacy is guaranteed.

Installation / Downloads

Recommended: Download the pre-built executable for your OS from GitHub Releases. The app is intended to be "download and run." No installation required!

🪟

Windows

Standalone .exe executable

Download

🍎

macOS

.app bundle (Intel & ARM64 versions available)

Download

🐧

Linux

Standalone executable

Download

All executables include Python, PySide6, Tesseract OCR, and Poppler—everything you need bundled in one package.

For Developers

Run from source (Python) or build your own standalone binary. The project uses PySide6, pytesseract, pdf2image, PyPDF2, and Pillow.

PySide6 (Qt6)

Tesseract OCR

pdf2image

PyPDF2

Pillow

PyInstaller

                
# Clone the repository

git clone https://github.com/KSEGIT/QuickPdfOcr.git

cd QuickPdfOcr

# Install dependencies

pip install -r requirements.txt

# Run the application

python main.py

# Build your own executable

python build.py

View Documentation