批量 OCR
上传一个包含 PDF 或图像的 ZIP 文件(总大小不超过 25 MB),即可获得一个合并后的 markdown 文件——每个源文件成为一个标题。非常适合将纸质档案数字化、扫描一整个文件夹的发票,或一次性处理一批扫描页面。由 Mistral OCR 提供支持。
需要无限使用次数?
升级至 Pro — $19/moYour 批量 OCR results will appear here
You'll get clean markdown with tables, equations, and headings preserved — ready to paste or edit.
如何使用 批量 OCR
- Put your PDFs and images into a single ZIP archive (up to 25 MB total).
- Upload the ZIP to Batch OCR.
- Run the extraction and wait while each file is processed.
- Download the combined markdown file with one heading per source document.
使用案例
Digitize a folder of scanned paper documents in one pass
Convert a batch of invoices or forms into searchable text
Extract text from a multi-page scanned archive bundle
Turn a set of image-only PDFs into editable markdown
Build a searchable knowledge base from a stack of legacy files
最佳结果的技巧
- Scan documents at 300 DPI or higher for the cleanest text recognition.
- Name the files inside the ZIP clearly, since those names guide the headings in the output.
- Keep the total ZIP under 25 MB by compressing images or splitting into multiple batches.
- Review tables and complex layouts in the markdown, as intricate formatting may need minor cleanup.
常见问题
What does Batch OCR do?
It reads text from multiple documents at once and merges the results into a single markdown file, with a heading for each source file so you can tell the content apart.
What do I upload and what are the limits?
Upload one ZIP archive containing your PDFs and image files, with a combined total of up to 25 MB. Compress or split larger sets to stay under the limit.
What format is the output?
You get one combined markdown (.md) file. Each original file appears under its own heading, with the extracted text in reading order beneath it.
How accurate is the text extraction?
Mistral OCR is accurate on clear printed text and preserves structure like headings and lists well. Low-resolution scans or heavy handwriting may reduce accuracy.
Does it keep tables and formatting?
It preserves structure such as headings, lists, and tables in markdown where possible, though very complex layouts may need light cleanup afterward.
Can I use the extracted text commercially?
Yes, you can use the output in your own archives, documents, and products. Free covers 5 batch runs per day with no signup; Pro is $19/month for higher volume.
What happens to the files I upload?
Your ZIP and its contents are processed only to extract the text and are then discarded. They are not stored or used to train models.
我们不存储您的文本。处理在实时进行,您的输入在生成结果后立即被丢弃。
解锁无限访问
免费用户:每天 5 次使用 | Pro 用户:无限制
✍️ Prompt Library
Ready-to-use prompts — click "Use This" to auto-fill the tool
Describe what you want to achieve with this tool and include any relevant details.
Provide context about your audience, tone, and any specific requirements.
List the key points or features you want this tool to address.
Specify any constraints such as word count, format, or style guidelines.
Share any examples or references that might help get better results.