You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -30,6 +30,10 @@ If you find a better voice match for `tts-1` or `tts-1-hd`, please let me know s
30
30
31
31
## Recent Changes
32
32
33
+
Version 0.17.0, 2024-07-01
34
+
35
+
* Automatic language detection, thanks [@RodolfoCastanheira](https://github.com/RodolfoCastanheira)
36
+
33
37
Version 0.16.0, 2024-06-29
34
38
35
39
* Multi-client safe version. Audio generation is synchronized in a single process. The estimated 'realtime' factor of XTTS on a GPU is roughly 1/3, this means that multiple streams simultaneously, or `speed` over 2, may experience audio underrun (delays or pauses in playback). This makes multiple clients possible and safe, but in practice 2 or 3 simultaneous streams is the maximum without audio underrun.
@@ -58,7 +62,7 @@ Version 0.14.0, 2024-06-26
58
62
Version 0.13.0, 2024-06-25
59
63
60
64
* Added [Custom fine-tuned XTTS model support](#custom-fine-tuned-model-support)
61
-
* Initial prebuilt arm64 image support (Apple M-series, Raspberry Pi - MPS is not supported in XTTS/torch), thanks @JakeStevenson, @hchasens
65
+
* Initial prebuilt arm64 image support (Apple M-series, Raspberry Pi - MPS is not supported in XTTS/torch), thanks [@JakeStevenson](https://github.com/JakeStevenson), [@hchasens](https://github.com/hchasens)
62
66
* Initial attempt at AMD GPU (ROCm 5.7) support
63
67
* Parler-tts support removed
64
68
* Move the *.default.yaml to the root folder
@@ -88,7 +92,7 @@ Version 0.11.0, 2024-05-29
88
92
89
93
Version: 0.10.1, 2024-05-05
90
94
91
-
* Remove `runtime: nvidia` from docker-compose.yml, this assumes nvidia/cuda compatible runtime is available by default. thanks @jmtatsch
95
+
* Remove `runtime: nvidia` from docker-compose.yml, this assumes nvidia/cuda compatible runtime is available by default. thanks [@jmtatsch](https://github.com/jmtatsch)
92
96
93
97
Version: 0.10.0, 2024-04-27
94
98
@@ -252,13 +256,9 @@ tts-1:
252
256
model: voices/en_US-ryan-high.onnx
253
257
speaker: # default speaker
254
258
```
255
-
3. New models will be downloaded as needed, or you can download them in advance with `download_voices_tts-1.sh`. For example:
259
+
3. New models will be downloaded as needed, of you can download them in advance with `download_voices_tts-1.sh`. For example:
Multilingual cloning support was added in version 0.11.0 and is available only with the XTTS v2 model. To use multilingual voices with piper simply download a language specific voice.
301
301
302
-
Coqui XTTSv2 has support for 16 languages: English (`en`), Spanish (`es`), French (`fr`), German (`de`), Italian (`it`), Portuguese (`pt`), Polish (`pl`), Turkish (`tr`), Russian (`ru`), Dutch (`nl`), Czech (`cs`), Arabic (`ar`), Chinese (`zh-cn`), Japanese (`ja`), Hungarian (`hu`) and Korean (`ko`).
302
+
Coqui XTTSv2 has support for multiple languages: English (`en`), Spanish (`es`), French (`fr`), German (`de`), Italian (`it`), Portuguese (`pt`), Polish (`pl`), Turkish (`tr`), Russian (`ru`), Dutch (`nl`), Czech (`cs`), Arabic (`ar`), Chinese (`zh-cn`), Hungarian (`hu`), Korean (`ko`), Japanese (`ja`), and Hindi (`hi`). When not set, an attempt will be made to automatically detect the language, falling back to English (`en`).
303
303
304
304
Unfortunately the OpenAI API does not support language, but you can create your own custom speaker voice and set the language for that.
0 commit comments