Abstract: Pre-trained vision-language models (VLMs) and language models (LMs) have recently garnered significant attention due to their remarkable ability to represent textual concepts, opening up new ...
Models can now be quickly used without using this repository using the following code. This can be set up also in other repositories. It has not been widely tested, so we welcome any bug reports / ...