TWLV-I: A New Video Foundation Model that Constructs Robust Visual Representations for both Motion and Appearance-based Videos

TWLV-I: A New Video Foundation Model that Constructs Robust Visual Representations for both Motion and Appearance-based Videos