-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
question about standardization #55
Comments
Hi @yuffon I think we just subtract the training means and stds in all testing, this is standard in machine learning. |
Yes, this is common in ML. What if the input image is not from the 10 categories in cifar10 dataset? |
I don't think one should preprocess the images by its own mean and stds, that causes different input to be changed by different amounts. CIFAR is not a good dataset to transfer from, I think if you do transfer learning it's better to start with ImageNet models. In this case, you still subtract imageNet training data's mean and stds. |
In TensorFlow's official tutorial Resnet repository, they use per image standardization. |
If I make a new network that trained on cifar10, but I want to test the model on image out of the cifar10 dataset and observe the network behavior on the out-of-distribution data. How should I normalize the data? |
I think the common practice is to just normalize the data as the way it is normalized during training (and using the training stats). |
I find that in the training and testing phase, the dataset is standardized as a whole batch, including computing the mean and variance from the entire cifar10 dataset.
However, when the model is deployed, the image is feed individually, how should we preprocess the image?
What mean and variance should we use when the input image is of the same category but not included in the cifar10 dataset?
The text was updated successfully, but these errors were encountered: