Masked Visual Pre-training for Motor Control
data:image/s3,"s3://crabby-images/84cc0/84cc0b91b89dee38724e770821be8a23e205207c" alt=""
Masked Visual Pre-training for Motor Control
This is a PyTorch implementation of the paper Masked Visual Pre-training for Motor Control. It contains the benchmark suite, pre-trained models, and the training code to reproduce the results from the paper.
Installation
Please see INSTALL.md
for installation instructions.
Pre-trained visual enocoders
We provide pre-trained visual encoders used in the paper. The models are in the same format as mae and timm:
backbone | objective | data | md5 | download |
---|---|---|---|---|
ViT-S | MAE | in-the-wild | model | |
ViT-S | MAE | ImageNet | model | |
ViT-S | Supervised | ImageNet | model |
By default,