Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Enable offline Speech-to-Text and Text-to-Speech #1237

Open
1 of 4 tasks
JRMeyer opened this issue Feb 16, 2022 · 6 comments
Open
1 of 4 tasks

[Feature Request] Enable offline Speech-to-Text and Text-to-Speech #1237

JRMeyer opened this issue Feb 16, 2022 · 6 comments

Comments

@JRMeyer
Copy link

JRMeyer commented Feb 16, 2022

👋 hi there!

I'm submitting a...

  • Bug report
  • Feature request
  • Documentation issue or request
  • Other... Please describe:

Expected Behavior

Would be great to be able to test and debug a voice bot without an internet connection. Offline STT and TTS (from @coqui-ai) would make this possible using the existing UX from the new jovo debugger

Current Behavior

Currently there's no offline STT or TTS

@jankoenig
Copy link
Member

Hi there. Thank you.

This is not on our immediate roadmap, but would be a great community contribution.

Coqui STT could be implemented as Jovo ASR integration.

@JRMeyer
Copy link
Author

JRMeyer commented Feb 17, 2022

Hi @jankoenig -- just looked into the integration with Lex, and it would be considerably different with Coqui because the user would have their own server running. For example, the user might be running a simple server on their local desktop or they might have spun up a server on their AWS cloud, and using endpoints there. In either case, the API syntax and integration would be identical, but there would be an expectation that the user spins up the server themselves. Not too difficult, but I'm not sure if that's something the Jovo crowd would be interested in.

I think the biggest value add for Jovo users would be to be able to test out their voicebots locally, without having an ASR backend running on one of the providers (like Lex).

Thoughts?

@jankoenig
Copy link
Member

This could work similar to our Snips NLU integration where people also have to run their own servers.

An integration like this would also be useful for our web starters:

@JRMeyer
Copy link
Author

JRMeyer commented Feb 18, 2022

Yeah, I think a general setup mirroring the Snips approach would work nicely. You know of anyone in your community who might like to hack on this? We're happy to offer support/guidance for using the Coqui tools.

@rubenaeg
Copy link
Contributor

I think I could give this a spin :)

@rmtuckerphx
Copy link
Contributor

@JRMeyer Are there any developer docs on the Coqui APIs for STT and TTS using Node.js or REST?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants