voicechat2
Open source voice chat infra that rivals GPT-4o
Screenshots
![z.png](https://steemitimages.com/640x0/https://s3-us-west-2.amazonaws.com/steemhunt/production/steemhunt/2024-08-05/f386af2a-z.png)
Hunter's comment
AI voice chat infrastructure that uses WebSockets. It can achieve voice-to-voice latency as low as 300ms (what GPT-4o does) without a unified voice codec. Everything runs on a single high-end consumer GPU.
On an 7900-class AMD RDNA3 card, voice-to-voice latency is in the 1 second range:
Whisper large-v2 (Q5)
Llama 3 8B (Q4_K_M)
tts_models/en/vctk/vits (Coqui TTS default VITS models)
On a 4090, using Faster Whisper with faster-distil-whisper-large-v2 we can cut the latency down to as low as 300ms:
These installation instructions are for Ubuntu LTS and assume you've setup your ROCm or CUDA already.
I recommend you use conda or (my preferred), mamba for environment management. It will make your life easier.
Link
https://github.com/lhl/voicechat2?ref=producthunt
![Steemhunt.com](https://steemitimages.com/640x0/https://i.imgur.com/jB2axnW.png)
This is posted on Steemhunt - A place where you can dig products and earn STEEM.
View on Steemhunt.com
Nice Open source voice chat infra that rivals GPT-4o.
Downvoting a post can decrease pending rewards and make it less visible. Common reasons:
Submit
Very cool Open source voice chat infra that rivals GPT-4o.
Downvoting a post can decrease pending rewards and make it less visible. Common reasons:
Submit
Upvoted! Thank you for supporting witness @jswit.
Downvoting a post can decrease pending rewards and make it less visible. Common reasons:
Submit
Downvoting a post can decrease pending rewards and make it less visible. Common reasons:
Submit
Congratulations!
We have upvoted your post for your contribution within our community.
Thanks again and look forward to seeing your next hunt!
Want to chat? Join us on:
Downvoting a post can decrease pending rewards and make it less visible. Common reasons:
Submit