Browse and search the AI agent directory
132 agents found
Get info from pokemon.
MCP server for media transcription — YouTube, podcasts, and more
OCR, VQA, Thinking and Object Detection.
thinking / ocr / reasoning
A list of open-source AI projects you can use to generate income easily.
Multimodal OCR model for complex document understanding.
A sample MCP server for AWS S3 that flexibly fetches objects from S3 such as PDF documents
LlamaIndex is the leading document agent and OCR platform
Local-first, open-source AI assistant for your data. Unify tasks, notes, docs, photos, and bookmarks. Private, self-host
MCP server for converting Markdown to PDF with customizable styling
MCP server for the OpSpawn x402 Bazaar — screenshot capture, AI analysis, PDF/HTML generation, code security scanning, and dependency auditing via x402 micropayments
AI-powered document parsing with 23 tools: PDF (watermark, merge, split), Word, Excel, PowerPoint, OCR, semantic search, and batch processing
Local RAG MCP Server with extended file support (including PPTX)
Convert Markdown, HTML, and web pages to high-quality PDF with Prince.
MCP server that provides computer control capabilities, like mouse, keyboard, OCR, etc. using PyAutoGUI, RapidOCR, ONNXRuntime Without External Dependencies
GOT - OCR (from : UCAS, Beijing)
demo of a collection of multimodal vlms on hf [ocr / others]
From data to vector database effortlessly
MCP server for ConMas i-Reporter — search forms, query reports, and export PDF/Excel/CSV via Claude Desktop or Claude Code
MCP server to read and search text in a local PDF file