OCR scanned PDFs in seconds — locally, with a clean desktop UI.
Drag and drop a PDF, click Start OCR, then copy the extracted text to your clipboard. Built with PySide6 (Qt6) and powered by Tesseract OCR.
QuickPdfOcr is a lightweight PDF OCR application that helps you extract text from image-based or scanned PDFs using Tesseract. It's designed to be straightforward: choose a PDF (or drag it in), run OCR, and copy the text.
Simply drag and drop PDF files directly into the application window for instant processing.
Browse and select PDFs using the traditional file picker if you prefer.
Industry-standard Tesseract OCR engine extracts text accurately from scanned pages.
Real-time feedback shows processing status while OCR is running.
Instantly copy extracted text to your clipboard on macOS, Linux, and Windows.
Retry or start over options available if something fails during processing.
Clean, user-friendly interface with clear visual feedback throughout the process.
Pre-built executables include everything—Python, Tesseract, and Poppler. No installation needed!
Your documents stay on your computer. QuickPdfOcr runs locally and performs OCR on-device. No uploads, no cloud processing, no data collection. Your privacy is guaranteed.
Recommended: Download the pre-built executable for your OS from GitHub Releases. The app is intended to be "download and run." No installation required!
All executables include Python, PySide6, Tesseract OCR, and Poppler—everything you need bundled in one package.
Run from source (Python) or build your own standalone binary. The project uses PySide6, pytesseract, pdf2image, PyPDF2, and Pillow.
# Clone the repository
git clone https://github.com/KSEGIT/QuickPdfOcr.git
cd QuickPdfOcr
# Install dependencies
pip install -r requirements.txt
# Run the application
python main.py
# Build your own executable
python build.py