GitHub - facebookresearch/mae: PyTorch implementation of MAE https//arxiv.org/ab... - JOYK Joy of Geek, Geek News, Link all geek

Masked Autoencoders: A PyTorch Implementation

This is a PyTorch/GPU re-implementation of the paper Masked Autoencoders Are Scalable Vision Learners:

@Article{MaskedAutoencoders2021,
  author  = {Kaiming He and Xinlei Chen and Saining Xie and Yanghao Li and Piotr Doll{\'a}r and Ross Girshick},
  journal = {arXiv:2111.06377},
  title   = {Masked Autoencoders Are Scalable Vision Learners},
  year    = {2021},
}

The original implementation was in TensorFlow+TPU. This re-implementation is in PyTorch+GPU.
This repo is a modification on the DeiT repo. Installation and preparation follow that repo.
This repo is based on timm==0.3.2, for which a fix is needed to work with PyTorch 1.8.1+.

Catalog

Visualization demo
Pre-trained checkpoints + fine-tuning code
Pre-training code

Visualization demo

Run our interactive visualization demo using Colab notebook (no GPU needed):

Fine-tuning with pre-trained checkpoints

The following table provides the pre-trained checkpoints used in the paper, converted from TF/TPU to PT/GPU:

ViT-Base ViT-Large ViT-Huge

pre-trained checkpoint download download download

md5 8cad7c b8b06e 9bdbb0

The fine-tuning instruction is in FINETUNE.md.

By fine-tuning these pre-trained models, we rank #1 in these classification tasks (detailed in the paper):

ViT-B ViT-L ViT-H ViT-H448 prev best

ImageNet-1K (no external data) 83.6 85.9 86.9 87.8 87.1

following are evaluation of the same model weights (fine-tuned in original ImageNet-1K):

ImageNet-Corruption (error rate) 51.7 41.8 33.8 36.8 42.5

ImageNet-Adversarial 35.9 57.1 68.2 76.7 35.8

ImageNet-Rendition 48.3 59.9 64.4 66.5 48.7

ImageNet-Sketch 34.5 45.3 49.6 50.9 36.0

following are transfer learning by fine-tuning the pre-trained MAE on the target dataset:

iNaturalists 2017 70.5 75.7 79.3 83.4 75.4

iNaturalists 2018 75.4 80.1 83.0 86.8 81.2

iNaturalists 2019 80.5 83.4 85.7 88.3 84.1

Places205 63.9 65.8 65.9 66.8 66.0

Places365 57.9 59.4 59.8 60.3 58.0

Pre-training

The pre-training instruction is in PRETRAIN.md.

License

This project is under the CC-BY-NC 4.0 license. See LICENSE for details.

GitHub - facebookresearch/mae: PyTorch implementation of MAE https//arxiv.org/ab...

Masked Autoencoders: A PyTorch Implementation

Catalog

Visualization demo

Fine-tuning with pre-trained checkpoints

Pre-training

License

Recommend

GitHub - NotReallyShikhar/YukkiMusicBot: A Telegram Music Bot with proper functi...

又一房企分拆物业上市，龙湖智创生活去年前三季度营收超77亿元

消费者为盲盒购买肯德基106份套餐！中消协：造成无谓的浪费

京东“白条”升级为“白条卡”：支持3家银行信用卡申请

数字转型方法论：AI决策驱动低碳转型

稳增长细分机会！

掘金国企改革!

数字时代行远自迩马上消费金融创新全景式智能风控体系

医疗器械十大金股！

恒昌公益携伙伴发布《儿童肠道外科重疾救助公益白皮书》

About Joyk