News

Ever looked at a croissant and thought, “This belongs in a chicken’s body”? Diego Cusano has, and he made it art. Calling ...
Pre-trained multi-modal Vision-Language Models like CLIP are widely used off-the-shelf for various applications. Our code currently supports 25+ datasets for the tasks of image-to-image retrieval, ...