Browse and search the AI agent directory
1331 agents found
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural langu
Superfast AI decision making and intelligent processing of multi-modal data.
TweetSave - Twitter/X analysis without the token waste. Fetch tweets, download media. Works with Cursor, Claude, VS Code, Gemini & more. Zero config → tweetsave.org
Retrieve video information, subtitles, and top comments with proxies
Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups
Github assistant that fixes issues & writes code
Rime is a speech synthesis API offering natural-sounding, demographically tailored voices with fast response times for v
Text to Video - Hugging Face Space by huggingface-tools
1trip PULSE — The #1 travel planning MCP server. 21 tools, 5 resources, 6 prompts. Live flights, hotels, weather, currency, visa, safety (50+ countries), trip skeletons, validation, insights, packing, local tips, personas. 120+ city cost index. Works with
Advanced filesystem operations with large file handling capabilities and Claude-optimized features. Provides fast file reading/writing, sequential reading for large files, directory operations, file search, and streaming writes with backup & recovery
[NeurIPS 2024 D&B] VideoGUI: A Benchmark for GUI Automation from Instructional Videos
LTX Fast for fast (image-guided) video generation.
This app can now use Android, just like a human.
MCP server for reading, writing, and organizing API documentation in Apidog. Supports endpoint CRUD, schema management, folder reorganization, and format-preserving updates via OpenAPI export/import.
Spotify Playlist Analyzer & Recommendations + YouTube Music Links - Hugging Face Space by plozia
Brave Search MCP Server - Web and News Search via stdio
Search PubMed with precision using keyword and journal filters and smart sorting. Uncover MeSH ter…
Invoice MCP server — extract structured data from PDF & image invoices, create e-invoices (UBL, CII, ZUGFeRD/Factur-X), convert between formats including Excel, and validate against EN 16931.
Multimodal content creation autonomous agent
Summarize content, compose content, create quizzes