Interpreting Contrastive Embeddings in Specific Domains with Fuzzy Rules

Javier Fumanal-Idocin, Mohammadreza Jamalifard, Javier Andreu-Perez|March 12, 2026arXiv

Key Takeaway

CLIP embeddings work well for general tasks, but you need domain-specific interpretation tools like fuzzy rules to understand and improve their performance on specialized text like medical or legal documents.

Summary

This paper shows how to interpret what CLIP embeddings learn in specific domains like medical records and film reviews. The researchers use fuzzy rules to map domain-specific features into CLIP's vector space, making it easier to understand which text features matter most for classification tasks in specialized fields.

multimodal

Key Terms

clip fuzzy-rules zero-shot-learning semantic-embedding