Browse and search the AI agent directory
159 agents found
Generate high-quality text-to-speech and text-to-voice outputs using the [DAISYS](https://www.daisys.ai/) platform and make it able to play and store audio generated
Expose all Home Assistant voice intents through a Model Context Protocol Server allowing home control
MCP Server that connects AI Agents to [Carbon Voice](https://getcarbon.app). Create, manage, and interact with voice messages, conversations, direct messages, folders, voice memos, AI actions and more in [Carbon Voice](https://getcarbon.app)
Complete voice interaction server supporting speech-to-text, text-to-speech, and real-time voice conversations through local microphone, OpenAI-compatible APIs, and LiveKit integration
MCP Server that uses the open weight Kokoro TTS models to convert text-to-speech. Can convert text to MP3 on a local driver or auto-upload to an S3 bucket
MCP server plugin for Claude Code that converts text to speech using OpenAI's TTS API. Features 6 voices, worker pool architecture, mutex-protected playback, and cross-platform support
Voice synthesis and audio generation via MCP
Text-to-speech, image generation, and video generation via MCP
The Self-Coding System for Your App — Alan AI SDK for Cordova
Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models
Command Your World with Voice
Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you
Self-hosted AI voice agent
⚡ A local, privacy-focused AI desktop assistant for Windows. Control your PC remotely via Telegram or locally with Voice
Experimenting with conversational AI in iOS, macOS and visionOS apps
Safeclaw is the alternative to openclaw.. You can naturally chat with it via text and voice, yet there is no language mo
Real-time voice agent powered by Agora and OpenAI
AIUI is a platform enabling seamless two-way verbal communication with AI.
One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides,
vimGPT is a project that integrates GPT-4V's vision capabilities with the Vimium extension to enable web browsing and in