A Simple Strong Baseline for TextVQA and TextCaps
Simple is not Easy
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]
Citation
If you use ssbaseline in your work, please cite:
@article{zhu2020simple,
title={Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps},
author={Zhu, Qi and Gao, Chenyu and Wang, Peng and Wu, Qi},
journal={arXiv preprint arXiv:2012.05153},
year={2020}
}
Installation
First install the repo using
git clone https://github.com/ZephyrZhuQi/ssbaseline.git ~/ssbaseline
cd ~/ssbaseline
python setup.py build develop
Getting Data
We provide SBD-Trans OCR for TextVQA and ST-VQA datasets. The corresponding OCR Faster R-CNN features and Recog-CNN features are also released.
Pretrained Models
We release the following pretrained models for ssbaseline on TextVQA.
For the TextVQA dataset, we release: ssbaseline trained with ST-VQA as additional data (our