A collection of Variational AutoEncoders (VAEs) implemented in pytorch with focus on reproducibility. The aim of this project is to provide a quick and simple working example for many of the cool VAE models out there. All the models are trained on the CelebA dataset for consistency and comparison. The architecture of all the models are kept as similar as possible with the same layers, except for cases where the original paper necessitates a radically different architecture (Ex. VQ VAE uses Residual layers and no Batch-Norm, unlike other models). Here are the results of each model.
- Python >= 3.5
- PyTorch >= 1.6
- Pytorch Lightning >= 0.9.0
- Pytorch vision >= 0.7.0
- CUDA enabled computing device
$ Update Later
$ cd PyTorch-VAE
$ python run.py -c configs/<config-file-name.yaml>
Config file template
model_params:
name: "<name of VAE model>"
in_channels: 3
latent_dim:
. # Other parameters required by the model
.
.
exp_params:
data_path: "<path to the celebA dataset>"
img_size: 64 # Models are designed to work for this size
batch_size: 64 # Better to have a square number
LR: 0.005
weight_decay:
. # Other arguments required for training, like scheduler etc.
.
.
trainer_params:
gpus: 1
max_nb_epochs: 50
gradient_clip_val: 1.5
.
.
.
logging_params:
save_dir: "logs/"
name: "<experiment name>"
manual_seed:
View TensorBoard Logs
$ cd logs/<experiment name>/version_<the version you want>
$ tensorboard --logdir tf
Model | Paper | Reconstruction | Samples |
---|---|---|---|
VAE (Code, Config) | Link | ||
Conditional VAE (Code, Config) | Link | ||
WAE - MMD (RBF Kernel) (Code, Config) | Link | ||
WAE - MMD (IMQ Kernel) (Code, Config) | Link | ||
Beta-VAE (Code, Config) | Link | ||
Disentangled Beta-VAE (Code, Config) | Link | ||
Beta-TC-VAE (Code, Config) | Link | ||
IWAE (K = 5) (Code, Config) | Link | ||
MIWAE (K = 5, M = 3) (Code, Config) | Link | ||
DFCVAE (Code, Config) | Link | ||
MSSIM VAE (Code, Config) | Link | ||
Categorical VAE (Code, Config) | Link | ||
Joint VAE (Code, Config) | Link | ||
Info VAE (Code, Config) | Link | ||
LogCosh VAE (Code, Config) | Link | ||
SWAE (200 Projections) (Code, Config) | Link | ||
VQ-VAE (K = 512, D = 64) (Code, Config) | Link | N/A | |
DIP VAE (Code, Config) | Link |
If you have trained a better model, using these implementations, by fine-tuning the hyper-params in the config file, I would be happy to include your result (along with your config file) in this repo, citing your name 😊.
Additionally, if you would like to contribute some models, please submit a PR.
Apache License 2.0
Permissions | Limitations | Conditions |
---|---|---|
✔️ Commercial use | ❌ Trademark use | ⓘ License and copyright notice |
✔️ Modification | ❌ Liability | ⓘ State changes |
✔️ Distribution | ❌ Warranty | |
✔️ Patent use | ||
✔️ Private use |
@misc{Subramanian2020,
author = {Subramanian, A.K},
title = {PyTorch-VAE},
year = {2020},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/AntixK/PyTorch-VAE}}
}