Mistral AI has released Mistral OCR 3, its latest optical character recognition service that powers the company’s Document AI stack. […]
Category: OCR
Tencent Hunyuan Releases HunyuanOCR: a 1B Parameter End to End OCR Expert VLM
Tencent Hunyuan has released HunyuanOCR, a 1B parameter vision language model that is specialized for OCR and document understanding. The […]
Comparing the Top 6 OCR (Optical Character Recognition) Models/Systems in 2025
Optical character recognition has moved from plain text extraction to document intelligence. Modern systems must read scanned and digital PDFs […]
DeepSeek Just Released a 3B OCR Model: A 3B VLM Designed for High-Performance OCR and Structured Document Conversion
DeepSeek-AI released 3B DeepSeek-OCR, an end to end OCR and document parsing Vision-Language Model (VLM) system that compresses long text […]
How to Build a Multilingual OCR AI Agent in Python with EasyOCR and OpenCV
In this tutorial, we build an Advanced OCR AI Agent in Google Colab using EasyOCR, OpenCV, and Pillow, running fully […]
What are Optical Character Recognition (OCR) Models? Top Open-Source OCR Models
Optical Character Recognition (OCR) is the process of turning images that contain text—such as scanned pages, receipts, or photographs—into machine-readable […]
Meet dots.ocr: A New 1.7B Vision-Language Model that Achieves SOTA Performance on Multilingual Document Parsing
dots.ocr is an open-source vision-language transformer model developed for multilingual document layout parsing and optical character recognition (OCR). It performs […]
NuMind AI Releases NuMarkdown-8B-Thinking: A Reasoning Breakthrough in OCR and Document-to-Markdown Conversion
NuMind AI has officially released NuMarkdown-8B-Thinking, an open-source (MIT License) reasoning OCR Vision-Language Model (VLM) that redefines how complex documents […]
Microsoft brings a new text extraction tool to Windows 11
We’re huge fans of PowerToys, and one of the greatest modules included in the utility collection is Text Extractor. As […]
Huge Microsoft Photos update adds amazing new web search with OCR-extracted text feature and makes AI options easier to access
Microsoft Photos is something of an unsung hero of the Windows app family. It is an astonishing useful and powerful […]
