Skip to content

CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning

License

Notifications You must be signed in to change notification settings

cvlab-columbia/CATER

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning

eg1 eg1 eg1

[project page] [paper]

If this code helps with your work, please cite:

Rohit Girdhar and Deva Ramanan. CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning. In International Conference on Learning Representations (ICLR), 2020.

@inproceedings{girdhar2020cater,
    title = {{CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning}},
    author = {Girdhar, Rohit and Ramanan, Deva},
    booktitle = {ICLR},
    year = 2020
}

Dataset

A pre-generated sample of the dataset used in the paper is provided here. If you'd like to generate a version of the dataset, please follow instructions in generate.

Baselines

We provide code and some basic instructions on setting up some of the baselines in baselines folder.

Acknowledgements

This code was built upon the CLEVR codebase and various video recognition codebases for baselines (especially Non-Local). Many thanks to those authors for making their code available!

License

CATER is Apache 2.0 licensed, as found in the LICENSE file.

About

CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 78.5%
  • C++ 18.0%
  • Shell 2.6%
  • Other 0.9%