-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Time-domain loss #8
Comments
@alexdemartos, I am also skeptic about time domain loss, but we expect muffled or metallic artifacts before discriminator kicks actually that's why we use discriminator to remove those artifacts. |
@rishikksh20 and @alexdemartos: anyone has plots of how the time-domain loss is varying with training? |
Also @rishikksh20: is the phase loss computed across different time scales (as in the paper)? I guess not. |
@Miralan implements the Time domain losses hope she might gives good input on this, I haven't rechecked this code for while as I am busy with other model implementation. Otherwise I will check myself within a week and let you know regarding this. |
OK, thanks @rishikksh20. No worries, in my work (which is different from MelGAN) time-loss seems to behave as expected. So, I feel the code is OK as such. |
Did my implemention of time domian loss have any problems? |
Hi @Miralan: No it was OK. Only question I had was: |
Thanks for your advice, I am sorry for that it seems like that I forgot to implement the case of frame size=1 and hop size=1.And phase loss can also apply on multi-scale, but I don't think it has much impact @rishikksh20 . |
@Miralan yes I think that not have much impact on overall quality, it just helps to removes minute artifacts and jitters. |
Also @Miralan have you worked with Wavenet architecture ? |
No, I just used it on GAN based parallel vocoder. |
Thanks for including multi-scale phase loss. Testing your code (https://github.com/rishikksh20/TFGAN/blob/main/utils/timeloss.py) with
WORKS! However, when I run the training in my framework it throws an error:
Any idea why it is? |
It seems like that out of memory and you can try to use batchsize 2 instead of 32. |
@Miralan works :-) |
Hi,
I am just wondering wether the time-domain loss is working properly. I just noticed that the samples before the discriminator comes into play (>200K steps) are a bit muffled/noisy. After that, the GAN training scheme seems to be helping the audio quality and reducing this effect.
TFGAN_samples.zip
What is your experience? Is this muffled noise to be expected before the discriminator network comes in?
Thanks in advance, and thank you for sharing your work.
The text was updated successfully, but these errors were encountered: