Training documenation #11
Replies: 3 comments 3 replies
-
Yes, I am planning on releasing more information on how to train new models soon (hopefully in the next few weeks). Because there is so much data involved (easily 100's of GBs), I won't be able to share everything in exactly the same way that it is being used to produce the models. But I plan to share example notebooks demonstrating the data generation, preparation, and model training with a small set of example data to at least demonstrate the concept. In the meantime, I'm happy to answer any questions you have about training the models. You can also see a little bit of detail here on the text-to-speech models used for training data generation, and some more detail about the data preparation in the model documentation here. |
Beta Was this translation helpful? Give feedback.
-
@Technerder , sorry for the delay, but I'm now close to releasing more information on training new models. I'm still finishing the example notebooks and documentation, but in the meantime I've released the synthetic data generation code in a separate repository here: https://github.com/dscripka/synthetic_speech_dataset_generation Feel free to experiment with that code, and if you run into any problems create an issue on that repo and I'll try to help resolve it. |
Beta Was this translation helpful? Give feedback.
-
@Technerder there is now an example notebook in the repo that demonstrates training custom models: https://github.com/dscripka/openWakeWord/blob/main/notebooks/training_models.ipynb Feel free to post questions/comments on the custom model training process in this thread. |
Beta Was this translation helpful? Give feedback.
-
Are there any plans to release documentation relating to the training of custom models?
Beta Was this translation helpful? Give feedback.
All reactions