Measuring CLEVRness




Blackbox testing of Visual Reasoning Models