Voice-Search

This service accepts an audio file URL, transcribes the speech to text, processes the query similar to the text-based search, and returns both a text response and an audio response URL.

See OpenAPI specification or SwaggerUI for more details on how to call the service.

Process flow for End to End Speech Generation

Voice Service Integration

For detailed information on how to integrate voice capabilities into your application using AAQ, including setup instructions for both in-house and cloud-based speech services, please refer to our Voice Service documentation. This documentation covers:

Setting up the dockerized container for in-house ASR and TTS models
Configuring Google Cloud Speech-to-Text and Text-to-Speech integration
Best practices for voice input and output in your application

Explore Voice Service Documentation