RT-2: New model translates vision and language into action

RT-2: New model translates vision and language into action