-
Notifications
You must be signed in to change notification settings - Fork 16
Model Configurations
dhansmair edited this page Sep 13, 2022
·
14 revisions
flamingo 3B | flamingo-mini (ours) | flamingo-tiny (ours) | |
---|---|---|---|
language model | chinchilla | OPT-350m | OPT-125m |
# params | 1.4B | ||
# layers | 24 | ||
# heads | 16 | ||
embedding size | 2048 | ||
number of tokens | 32000 | 50256 | 50256 |
vision encoder | NFNet-F6 | ViT-L | ViT-L |
output shape | --- | 257 x 1024 | 257 x 1024 |
resampler | |||
# params | |||
# heads | 16 | ||
# layers | 6 | 6 | 6 |
hidden size | 1536 | ||
KV size | 128 | ||
# latents | 64 | 64 | 64 |
activation function | sq. ReLU | sq. ReLU | sq. ReLU |
xattn dense | |||
# params | |||
# heads | 16 | ||
# layers | 24 | ||
hidden size | 2048 | ||
KV size | 128 | ||
activation function | sq. ReLU | sq. ReLU | sq. ReLU |
TODO
TODO