You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Use the speech of GenShin inside the KeQing (a game character) train the matcha_tts model , and use the trained model synthesis speech which is silence.
but the synthesis speech have mel spectrogram。
the synthesised mel spectrogram picture
Has this ever happened to you?What do you think is the cause?
thank you .
The text was updated successfully, but these errors were encountered:
Could you try replacing the vocoder with BigVGAN or even Griffin-lim for testing? I haven't faced this issue before; usually, the hifigan works just fine for the audio I have tested it with.
Use the speech of GenShin inside the KeQing (a game character) train the matcha_tts model , and use the trained model synthesis speech which is silence.
but the synthesis speech have mel spectrogram。
the synthesised mel spectrogram picture
Has this ever happened to you?What do you think is the cause?
thank you .
The text was updated successfully, but these errors were encountered: