RVC AI

RVC AI: Retrieval-based Voice Conversion

1 min


[vc_headings linewidth=”0″ borderwidth=”1″ borderclr=”#000000″ title=”RVC AI” google_fonts=”font_family:Comfortaa%3A300%2Cregular%2C700|font_style:700%20bold%20regular%3A700%3Anormal” titlesize=”60″ titleclr=”#000000″]

Retrieval-based-Voice-Conversion-WebUI

[/vc_headings]

Github logo

RVC AI – Retrieval-based Voice Conversion is a technique that uses a deep neural network to transform the voice of a speaker into another voice. It is based on the VITS model, which is a state-of-the-art end-to-end text-to-speech system. RVC can be used to create realistic and expressive voice conversions with minimal data and computational resources.

[vc_headings style=”theme4″ borderclr=”#000000″ style2=”image” title=”Features” google_fonts=”font_family:Comfortaa%3A300%2Cregular%2C700|font_style:700%20bold%20regular%3A700%3Anormal” lineheight=”3″ titlesize=”40″ titleclr=”#000000″ image_id=”2871″][/vc_headings]

✅Minimize tone leakage by substituting source feature with training-set feature from top1 retrieval;

✅Train easily and quickly, even with low-end graphics cards;

✅Achieve decent results with little data (>=10min low noise speech recommended);

✅Support model fusion to alter timbres (use ckpt processing tab->ckpt merge);

✅User-friendly Webui interface;

✅Use the UVR5 model to separate vocals and instruments fast.

[vc_headings style=”theme4″ borderclr=”#000000″ style2=”image” title=”Preparing Environment” google_fonts=”font_family:Comfortaa%3A300%2Cregular%2C700|font_style:700%20bold%20regular%3A700%3Anormal” lineheight=”3″ titlesize=”40″ titleclr=”#000000″ image_id=”2854″][/vc_headings]

To begin, you can install the necessary core dependencies for PyTorch. If you already have them installed, you can skip this step. Please refer to the following link for more information:

https://pytorch.org/get-started/locally/

Use the following command to install the required packages:

pip install torch torchvision torchaudio

For Windows users with Nvidia Ampere Architecture (RTX30xx), it is necessary to specify the CUDA version corresponding to PyTorch. You can refer to the experience shared on this GitHub issue:

https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/issues/21

Use the following command to install PyTorch with the specific CUDA version for Windows + Nvidia Ampere Architecture:

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117

Next, you will need to install the Poetry dependency management tool. If you already have it installed, you can skip this step. Please follow the instructions provided in the following link: https://python-poetry.org/docs/#installation

Use the following command to install Poetry:

curl -sSL https://install.python-poetry.org | python3 -

Finally, you can install the dependencies required for the project. Use the following command:

poetry install

faiss 1.7.2 will raise Segmentation Fault: 11 under MacOS, please use pip install faiss-cpu==1.7.0 if you use pip to install it manually.

pip install -r requirements.txt
[vc_headings style=”theme4″ borderclr=”#000000″ style2=”image” title=”Other Pre-Models Preparation” google_fonts=”font_family:Comfortaa%3A300%2Cregular%2C700|font_style:700%20bold%20regular%3A700%3Anormal” lineheight=”3″ titlesize=”40″ titleclr=”#000000″ image_id=”2854″][/vc_headings]

RVC AI depends on some pre-trained models for inference and training.

You can get them from Huggingface space.

These are the pre-trained models and other files that RVC uses:

hubert_base.pt

./pretrained 

./uvr5_weights

To use the v2 version model, which has a 12-layer Hubert input of 768 dimensions and 3 period discriminators, instead of a 9-layer Hubert+final_proj input of 256 dimensions, you need to download extra features.

./pretrained_v2

#If you are using Windows, you may also need this dictionary, skip if FFmpeg is installed
ffmpeg.exe

Then use this command to start Webui:

python infer-web.py

For Windows users, RVC-beta.7z is available for download and extraction to run RVC directly. To launch Webui, use go-web.bat .


Like it? Share with your friends!

-1

0 Comments

Your email address will not be published. Required fields are marked *

Belmechri

I am an IT engineer, content creator, and proud father with a passion for innovation and excellence. In both my personal and professional life, I strive for excellence and am committed to finding innovative solutions to complex problems.
Choose A Format
Personality quiz
Series of questions that intends to reveal something about the personality
Trivia quiz
Series of questions with right and wrong answers that intends to check knowledge
Poll
Voting to make decisions or determine opinions
Story
Formatted Text with Embeds and Visuals
List
The Classic Internet Listicles
Countdown
The Classic Internet Countdowns
Open List
Submit your own item and vote up for the best submission
Ranked List
Upvote or downvote to decide the best list item
Meme
Upload your own images to make custom memes
Video
Youtube and Vimeo Embeds
Audio
Soundcloud or Mixcloud Embeds
Image
Photo or GIF
Gif
GIF format