Browse and search the AI agent directory
47 agents found
x402-native API gateway with 20+ capabilities (web-extract, web-search, translate, image-generate, screenshot, PDF, OCR, and more) payable with USDC on Base. No API keys — agents pay per call via HTTP 402
Desktop app that captures screen activity via event-driven screenshots, stores AI-generated summaries and OCR text locally in SQLite, and exposes your activity history to AI assistants via MCP with semantic search, timeline browsing, and event detail retrieval
MCP server for MinerU document parsing API. Parse PDFs, images, DOCX, and PPTX with OCR (109 languages), batch processing (200 docs), page ranges, and local file upload. 73% token reduction with structured output
Odin Runes, a java-based GPT client, facilitates interaction with your preferred GPT model right through your favorite t
AI Bank Statement Document Automation By LLM model and Personal Finanical Analysis
SOC is a framework enabling multimodal models to operate a computer using human-like inputs and outputs, with compatibil
Tarsier is an open-source utility library by Reworkd, aimed at enhancing web interaction for AI agents by visually taggi