Document CLIP model source in embedder.py and README

Model downloads automatically from HuggingFace Hub on first use. No manual download required. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-20 12:04:10 +02:00
parent 64ee10c182
commit 4a82352391
3 changed files with 15 additions and 0 deletions
@@ -439,6 +439,15 @@ pip3 install -r pgvector-demo/backend/requirements.txt --break-system-packages
 pip3 install -r oravector-demo/backend/requirements.txt --break-system-packages
 ```

+**CLIP model** — not included in the repository. It is downloaded automatically from
+Hugging Face Hub on first use (~600 MB, cached in `~/.cache/huggingface/hub/`):
+
+> `sentence-transformers/clip-ViT-B-32`
+> https://huggingface.co/sentence-transformers/clip-ViT-B-32
+
+No manual download is required — `sentence-transformers` handles this transparently
+when `index_images.py` or a backend is started for the first time.
+
 ### 1. PostgreSQL

 **Start the container:**
@@ -6,6 +6,9 @@ _model = None
 def _get_model():
    # Lazy load: the CLIP model is ~600 MB and takes several seconds to initialise.
    # Loading on first call avoids the cost at import time and during indexing warmup.
+    # Downloaded automatically from Hugging Face Hub on first use:
+    # https://huggingface.co/sentence-transformers/clip-ViT-B-32
+    # Cached in ~/.cache/huggingface/hub/
    global _model
    if _model is None:
        _model = SentenceTransformer("clip-ViT-B-32")
@@ -6,6 +6,9 @@ _model = None
 def _get_model():
    # Lazy load: the CLIP model is ~600 MB and takes several seconds to initialise.
    # Loading on first call avoids the cost at import time and during indexing warmup.
+    # Downloaded automatically from Hugging Face Hub on first use:
+    # https://huggingface.co/sentence-transformers/clip-ViT-B-32
+    # Cached in ~/.cache/huggingface/hub/
    global _model
    if _model is None:
        _model = SentenceTransformer("clip-ViT-B-32")