-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Does the prosody codes[0] work? #10
Comments
what kind of test did you performed on prosody code? |
I took two different sound bytes and changed the line z2 = model.encoder(codes[0], codes[1], timbre2, use_p_code=False, n_c=1) line 103 to z2 = model.encoder(codes2[0], codes[1], timbre, use_p_code=False, n_c=1) this should make it so it only outputs the file with prosody changed between the two different sound files right? But if you look at the resulting files they appear to be identical for the the unedited reconstruction vs the new one that should have different prosody. I also tried changing the code to this, which changed content and prosody. z2 = model.encoder(codes[0], codes2[1], timbre, use_p_code=False, n_c=1) Sorry if I got anything wrong I'm a novice at this but isnt the prosody kind of like the emotion and timing of the speech? I did add some lines to pad both audio files to same length, but I don't think that should affect the prosody. def main(args):
|
Thanks for your experiment. It was very helpful for us to understand what exactly the prosody component stands for. |
I tried to test the code some specifically for prosody but it seemed like the prosody was tied to codes[1] with the content?
The text was updated successfully, but these errors were encountered: