Probing Image-Language Transformers for Verb Understanding

Probing Image-Language Transformers for Verb Understanding