44

I'm using https://github.com/rhasspy/piper mostly to create some audiobooks and read some posts/news, but the voices available are not always comfortable to listen to.

Do you guys have any recommendation for a voice changer to process these audio files?
Preferably it'll have a CLI so I can include it in my pipeline to process RSS feeds, but I don't mind having to work through an UI.
Bonus points if it can process the audio streams.

top 10 comments
sorted by: hot top controversial new old
[-] catloaf@lemm.ee 22 points 3 months ago

That's called text to speech, not a voice changer. A voice changer is the thing in the Darth Vader halloween masks.

There's been discussion on TTS programs here recently: https://lemm.ee/search?q=tts&type=All&listingType=All&communityId=185&page=1&sort=TopAll

Or you can search via your local instance/interface.

[-] pe1uca@lemmy.pe1uca.dev 11 points 3 months ago

Text to speech is what piper is doing.
What I'm looking for is called voice changer since I want to change a voice which already read something.

That's exactly what I want: "the thing in the Darth Vader halloween masks" but for linux, preferably via CLI to ingest audio files and be able to configure it to change the voice as I want, not only Darth Vader.

[-] catloaf@lemm.ee 20 points 3 months ago

Oh, I see. I think it would still be easier to either use a different voice in piper (the github page talks about this) or use a different tts program entirely.

[-] bastion@feddit.nl 4 points 3 months ago

So, all of the awkward pauses, the lack of inflection - you're saying keep those, just change who it sounds like is speaking?

[-] exu@feditown.com 7 points 3 months ago

In case you wanted to try other TTS providers, here's a leaderboard based on user votes.

https://huggingface.co/spaces/TTS-AGI/TTS-Arena

[-] Bookmeat@lemmy.world 2 points 3 months ago
[-] pe1uca@lemmy.pe1uca.dev 2 points 3 months ago

I don't want to manage piper voices, I can handle that directly in my file system as I only have a few.
The issue is none of the ones I've found are good for me, so what I need is something to change the voice once it has been generated by piper.

[-] Bookmeat@lemmy.world 2 points 3 months ago

There are a few voices included with pied which is why I suggested it.

[-] pythia@lemmy.dbzer0.com 2 points 3 months ago

what you're looking for is called RVC. It's integrated into some voice-cloning github projects but i don't use it. Here for example: https://github.com/codename0og/rvc-realtime-voice-changer

[-] xcjs@programming.dev 1 points 3 months ago* (last edited 3 months ago)

Coincidentally, I just found this other thread that mentions EasyEffects: https://programming.dev/post/17612973

You might be able to use a virtual device to get it working for your use case.

this post was submitted on 01 Aug 2024
44 points (100.0% liked)

Selfhosted

39866 readers
460 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 1 year ago
MODERATORS