Browse and search the AI agent directory
47 agents found
camel doc ocr / core ocr / docscope ocr / monkey ocr
nanonets ocr / smoldocling / monkey ocr / typhoon ocr
demo of a collection of impressive ocr models on the hub
AI-powered document parsing with 23 tools: PDF (watermark, merge, split), Word, Excel, PowerPoint, OCR, semantic search, and batch processing
A list of open-source AI projects you can use to generate income easily.
Local-first, open-source AI assistant for your data. Unify tasks, notes, docs, photos, and bookmarks. Private, self-host
demo of a collection of multimodal vlms on hf [ocr / others]
A comprehensive MCP server providing 43 tools for filesystem operations, process management, interactive sessions, async file search, JSON repair, encoding fix, duplicate detection, OCR, ZIP archives, and Markdown export
Invoice MCP server — extract structured data from PDF & image invoices, create e-invoices (UBL, CII, ZUGFeRD/Factur-X), convert between formats including Excel, and validate against EN 16931.
HireBase - AI-powered CV search engine with LanceDB and MCP
DeepSeek-OCR 2: Visual Causal Flow
OCR, VQA, Thinking and Object Detection.
thinking / ocr / reasoning
MinerU document parsing API — PDFs, images, DOCX, PPTX with OCR and batch processing.
MCP Server for RQ-SCAN - AI-powered document OCR and data extraction platform
MCP server for native app testing: screenshot, OCR, click, type, find_text. macOS, Windows, Android.
43 tools for filesystem, process management, sessions, search, OCR, ZIP, and PDF export.
MCP server — control remote desktops via VNC with a native Swift daemon and Apple Vision OCR
Access your reMarkable tablet - read documents, browse files, extract text and OCR
超好用的截图工具