
pytesseract · PyPI
Aug 15, 2024 · Project description Python-tesseract is an optical character recognition (OCR) tool for python. That is, it will recognize and “read” the text embedded in images. Python-tesseract is a …
easyocr · PyPI
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including: Latin, Chinese, Arabic, Devanagari, Cyrillic, etc. Try Demo on our website
aspose-ocr-python-net · PyPI
Nov 19, 2025 · Aspose.OCR for Python is a powerful yet easy-to-use and cost-effective API for extracting text from scanned images, photos, screenshots, PDF documents, and other files.
python-doctr · PyPI
Jul 9, 2025 · End-to-End OCR is achieved in docTR using a two-stage approach: text detection (localizing words), then text recognition (identify all characters in the word). As such, you can select …
pyocr · PyPI
If the OCR fails, an exception pyocr.PyocrException will be raised. An exception MAY be raised if the input image contains no text at all (depends on the OCR tool behavior).
keras-ocr · PyPI
Nov 6, 2023 · It provides a high level API for training a text detection and OCR pipeline. Please see the documentation for more examples, including for training a custom model.
surya-ocr · PyPI
You can now take Surya OCR further by training it on your own data with our finetuning script. It’s built on Hugging Face Trainer, and supports all the arguments that the huggingface trainer provides, and …
img2table · PyPI
Aug 10, 2025 · img2table is a simple, easy to use, table identification and extraction Python Library based on OpenCV image processing that supports most common image file formats as well as PDF …
ocrmypdf · PyPI
Nov 11, 2025 · In addition to the required Python version, OCRmyPDF requires external program installations of Ghostscript and Tesseract OCR. OCRmyPDF is pure Python, and runs on pretty …
Umi-OCR · PyPI
Jun 1, 2024 · OCR识别出的文本是按“块”划分的,通常一行文字分为一块,有时还会将一行误划分为多块,这给阅读带来了不便。 您可以在Umi-OCR设置页的 合并段落 中选择恰当的方案,合并同一行或 …