24

VITS-fast-fine-tuning 训练自己的声音

这个代码库将指导您在不到1小时的时间内将您自己的角色声音,甚至是您自己的声音,添加到现有的VITS TTS模型中,使其能够执行以下任务。

Refer to

Install

pyenv virtualenv 3.10.6 vits
pyenv activate vits
git clone https://github.com/Plachtaa/VITS-fast-fine-tuning.git
pip install -r requirements.txt
pip install torch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 --index-url https://download.pytorch.org/whl/cu118
 
Uninstalling torchaudio-2.1.0:
Successfully uninstalled torchaudio-2.1.0
Successfully installed cmake-3.25.0 lit-15.0.7 torch-2.0.0+cu118 torchaudio-2.0.1+cu118 torchvision-0.15.1+cu118 triton-2.0.0
 
cp exam1.c /usr/ wang/
python scripts/video2audio.py python scripts/denoise_audio
python scripts/long_audio_transcribe.py --languages "{C}" --whisper_size large
python scripts/short_audio_transcribe.py --languages "{C}" --whisper_size large
python scripts/resample.pypython preprocess_v2.py --languages "{C}"
 
python finetune_speaker_v2.py -m ./OUTPUT_MODEL --max_epochs "{3000}" --drop_speaker_embed True
tensorboard --logdir=./OUTPUT_MODEL
cp ./configs/modified_finetune_speaker.json ./finetune_speaker.json
python VC_inference.py --model_dir ./OUTPUT_MODEL/G_latest.pth --share True

Installation Process

You can follow the next steps to install the repo.