Measure the Cosine Similarity drift between the original CLIP and the P4 version.
Evaluate on MS-COCO and Flickr30K for Image-to-Text and Text-to-Image tasks. clip56mp4
How does the 4-bit quantization affect the embedding space compared to FP16? Measure the Cosine Similarity drift between the original
Does the model struggle more with abstract concepts (art/logos) vs. natural images? from ~300MB to ~30MB).
is roughly 1/3 the size of base models; argue its viability for "Always-on" AI features.
🌟 This model is built for speed . Your paper should lean heavily into the Efficiency-Accuracy Trade-off curve .
Highlight the reduction in model weight (e.g., from ~300MB to ~30MB).