AI Voice Agents

It would be great if FlowMattic considers enhancing their “AI Chatbot” feature with basic Text-to-Speech (TTS) and Speech-to-Text (STT) capabilities and makes them optional to enable within the admin configuration interface. There is a growing demand for AI Voice Agents powered by relatively inexpensive OpenAI TTS API and their Whisper model, that offers transcription services, converting audio input to text (STT). Many AI Chatbot services already offer basic TTS and STT capabilities on top of the standard text conversations. This allows users to tap the microphone and speak to the bot, which then is converted to text for submission. The text response from AI has a play button to perform text-to-speech. There is even more advanced OpenAI real-time API, which provides a speech-to-speech conversational experience and real-time transcription but is still very expensive to use for AI Chatbot use cases. It would also be awesome if FlowMattic’s team considers developing this complex “Conversational AI” feature based on OpenAI real-time API making it similar to the ElevenLabs Conversational AI module that can be integrated into any website and provide real-time voice assistance to the visitors or be used as AI virtual consultant or educator.

Attached images

Garry Z posted 3 months ago

Discussion