Introduction
To perform vision tasks effectively, it's important to have a strong, general-purpose vision backbone. This allows you to handle many vision tasks, such as:
Image-to-image similarity: For comparison or retrieval.
Vision adapters: Add a...