NVIDIA Mixtral 8x7B quantized model #8

itayhubara · 2024-12-10T10:27:49Z

I can't reproduce the steps that led NVIDIA from the reference model to the FP8 model. They simply give the model weights in the container. More specifically, how can I know that they didn't finetune the model (retrain the weights) to achieve the target accuracy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NVIDIA Mixtral 8x7B quantized model #8

NVIDIA Mixtral 8x7B quantized model #8

itayhubara commented Dec 10, 2024

NVIDIA Mixtral 8x7B quantized model #8

NVIDIA Mixtral 8x7B quantized model #8

Comments

itayhubara commented Dec 10, 2024