S2S: Speech-to-Speech with File Search in a single API Call

**Is your feature request related to a problem? Please describe.**
Users have a need to take in voice notes (in indic languages or english) as questions, and generate answers based on their knowledge base + prompt and provide them back as voice notes (in same language as the voice notes). Currently the user need to take care of three sandwich API calls (stt -->rag -->tts) for each s2s call.

**Describe the solution you'd like**
One single API endpoint with minimal config to do s2s API call.

**Additional context**
[Glific S2S requirements](https://docs.google.com/document/d/1bP-pJDJ8tKb_86XFPFq1uuZokFm1TqLSiB8KUowMAp4/edit?usp=sharing)
[S2S PRD](https://docs.google.com/document/d/1bP-pJDJ8tKb_86XFPFq1uuZokFm1TqLSiB8KUowMAp4/edit?usp=sharing)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

S2S: Speech-to-Speech with File Search in a single API Call #642

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

S2S: Speech-to-Speech with File Search in a single API Call #642

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions