Making a realistic female voice in real time

Man

Professional
Messages
3,077
Reaction score
614
Points
113
Preface
First, we need a medium-powerful video card, a weak video card will have a big delay.

What will be required?
The program itself, download link and tutorial are below:

Link: https://github.com/w-okada/voice-changer

Download tutorial:
On the opened site, scroll down and click on Hugging face as in the screenshot:

8b18993704afc20c9f3db0c4c105abc565cc67f1dfa88191d9a3cbbf224dd323.webp


Next we see a bunch of files:

e8fabf4eb3683a35b95ee81a653751af9bca99779249bd49b7871a07ebc2a69e.webp


If you have a Mac, then download the Mac version.
If you have an AMD video card, then download the onnxDirectML version (the newest)
If you have an Nvidia video card, then download the onnxgpu version (the newest).

Installation
Unpack the archive and look for the file " start_http.bat" in the folder and run it.
The files will start downloading and the panel will eventually start:

c8df38755bcfe9877b542f86ac780d587c82fe1066f9625224f54d9fbe08711e.webp


To output sound from the program, we will need a virtual audio cable (VAC) Virtual Audio Cabel.

Link to (VAC) Virtual Audio Cable: https://drive.google.com/file/d/1G4_9XM2HKj-ZUPp1mFjY8dkHDZE7TVTJ/view

VT: https://www.virustotal.com/gui/file/d9cc50239e5bad10f689c7c9e82ef52dd99ac6fafeb62dcc6678c493b2d05141

It is not necessary to unpack the archive, for installation we launch this file:

9fae5a41c1b70aa37a8d81be37d1690e61d0560df852c2360ee4dc98e0b5825c.webp


In the installer we wait everywhere further and agree with everything.
IMPORTANT: DURING INSTALLATION THE CABLE WILL SET ITSELF AS THE DEFAULT SOUND OUTPUT DEVICE, DO NOT FORGET TO CHANGE IT BACK TO YOUR HEADPHONES, AND ALSO DO NOT FORGET TO SET LINE 1 TO THE INPUT DEVICE IN DISCORD

Interface

8150b2de2cafff024a9f3967f7c86a836d43a27b7120a97351e46cb3f8faff9f.webp


In Quality, set from 192 to 512 (no point in higher), in Bitrate, set 4096.
In Tone, set +12 if you are testing a female voice model and -12 if a male one.
Then click Start and profit. The voice model works!

Conclusion
We spent only 15 minutes and got a simply wonderful result in changing the voice. I will be glad to answer all your questions about the article in the comments.

You can listen to the result in this video:
 
Top