[App] Simple Text Extractor: Secure, 100% Offline OCR (Stable/Core24)

Hi everyone,

I’m excited to announce that Simple Text Extractor V1.1 is now officially available on the Stable channel for Linux!

Simple Text Extractor is a desktop OCR application designed for users who need to extract text from PDFs and images (PNG, JPG, TIFF, BMP) without compromising their privacy.

Most OCR tools today rely on cloud processing. This app is 100% offline. Your documents never leave your machine.

  1. Snap Strict Confinement (Core 24): Fully sandboxed.
  2. Data Anonymization: Integrated log-scrubbing prevents sensitive file paths or usernames from leaking into system logs (CWE-209 protection).
  3. Security Hardening: Includes protection against “Decompression Bombs” (malicious high-res images) and strict input validation to prevent command injection.
  4. Atomic File Handling: Secure file creation (O_EXCL) to prevent race conditions and unauthorized access.

Built for Ubuntu and Snap-compatible distros, the app has been optimized for the Snap environment:

  • Intelligent Multi-Core Engine: Parallel processing for large batches.

  • Stateless Architecture: Each OCR task is isolated, ensuring the app remains stable and “zero-freeze” even when processing documents over 2000 pages.

  • Turnkey Experience: Tesseract OCR and dictionaries for 10 languages (English, French, German, Dutch, Italian, Spanish, Portuguese, Chinese, Arabic, and Japanese) are pre-packaged. No external dependencies required.

You can install it directly from the Snap Store or via terminal:

snap install simple-text-extractor

:warning: (Note: If you store files on external drives, don’t forget to connect the removable-media interface: sudo snap connect simple-text-extractor:removable-media)

:light_bulb: Tip: Using Wayland? If Drag & Drop doesn’t work, please use the “Add Files” button or switch to an X11 session.

I’m looking for feedback on performance across different distributions. If you find it useful, please consider liking the Snap or leaving a comment here.

Snap Store Link: https://snapcraft.io/simple-text-extractor

3 Likes

Huge thanks to @ogra, @leocjj, and @Yasirel for your support and for the likes! It’s great to see interest from the community for a secure, native OCR solution on Snapcraft.

I am already working on the next update and would love to expand the language support. Which OCR dictionaries or specific languages would you like to see added in the upcoming version?

Your feedback will help me prioritize the next language packs to better suit your text extraction needs.

Thanks again for the warm welcome!

2 Likes