You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -30,6 +30,19 @@ If you find a better voice match for `tts-1` or `tts-1-hd`, please let me know s
30
30
31
31
## Recent Changes
32
32
33
+
Version 0.17.2, 2024-07-01
34
+
35
+
* fix -min image (re: langdetect)
36
+
37
+
Version 0.17.1, 2024-07-01
38
+
39
+
* fix ROCm (add langdetect to requirements-rocm.txt)
40
+
* Fix zh-cn for xtts
41
+
42
+
Version 0.17.0, 2024-07-01
43
+
44
+
* Automatic language detection, thanks [@RodolfoCastanheira](https://github.com/RodolfoCastanheira)
45
+
33
46
Version 0.16.0, 2024-06-29
34
47
35
48
* Multi-client safe version. Audio generation is synchronized in a single process. The estimated 'realtime' factor of XTTS on a GPU is roughly 1/3, this means that multiple streams simultaneously, or `speed` over 2, may experience audio underrun (delays or pauses in playback). This makes multiple clients possible and safe, but in practice 2 or 3 simultaneous streams is the maximum without audio underrun.
@@ -58,7 +71,7 @@ Version 0.14.0, 2024-06-26
58
71
Version 0.13.0, 2024-06-25
59
72
60
73
* Added [Custom fine-tuned XTTS model support](#custom-fine-tuned-model-support)
61
-
* Initial prebuilt arm64 image support (Apple M-series, Raspberry Pi - MPS is not supported in XTTS/torch), thanks @JakeStevenson, @hchasens
74
+
* Initial prebuilt arm64 image support (Apple M-series, Raspberry Pi - MPS is not supported in XTTS/torch), thanks [@JakeStevenson](https://github.com/JakeStevenson), [@hchasens](https://github.com/hchasens)
62
75
* Initial attempt at AMD GPU (ROCm 5.7) support
63
76
* Parler-tts support removed
64
77
* Move the *.default.yaml to the root folder
@@ -88,7 +101,7 @@ Version 0.11.0, 2024-05-29
88
101
89
102
Version: 0.10.1, 2024-05-05
90
103
91
-
* Remove `runtime: nvidia` from docker-compose.yml, this assumes nvidia/cuda compatible runtime is available by default. thanks @jmtatsch
104
+
* Remove `runtime: nvidia` from docker-compose.yml, this assumes nvidia/cuda compatible runtime is available by default. thanks [@jmtatsch](https://github.com/jmtatsch)
92
105
93
106
Version: 0.10.0, 2024-04-27
94
107
@@ -318,7 +331,7 @@ tts-1-hd:
318
331
319
332
Multilingual cloning support was added in version 0.11.0 and is available only with the XTTS v2 model. To use multilingual voices with piper simply download a language specific voice.
320
333
321
-
Coqui XTTSv2 has support for 16 languages: English (`en`), Spanish (`es`), French (`fr`), German (`de`), Italian (`it`), Portuguese (`pt`), Polish (`pl`), Turkish (`tr`), Russian (`ru`), Dutch (`nl`), Czech (`cs`), Arabic (`ar`), Chinese (`zh-cn`), Japanese (`ja`), Hungarian (`hu`) and Korean (`ko`).
334
+
Coqui XTTSv2 has support for multiple languages: English (`en`), Spanish (`es`), French (`fr`), German (`de`), Italian (`it`), Portuguese (`pt`), Polish (`pl`), Turkish (`tr`), Russian (`ru`), Dutch (`nl`), Czech (`cs`), Arabic (`ar`), Chinese (`zh-cn`), Hungarian (`hu`), Korean (`ko`), Japanese (`ja`), and Hindi (`hi`). When not set, an attempt will be made to automatically detect the language, falling back to English (`en`).
322
335
323
336
Unfortunately the OpenAI API does not support language, but you can create your own custom speaker voice and set the language for that.
parser.add_argument('--openai-model', action='store', default="tts-1-hd", help="Set the openai model for the voice")
17
17
parser.add_argument('--xtts-model', action='store', default="xtts", help="Set the xtts model for the voice (if using a custom model, also set model_path)")
18
18
parser.add_argument('--model-path', action='store', default=None, help="Set the path for a custom xtts model")
0 commit comments