Sentence (Document) Embeddings
- Bag of Tricks for Efficient Text Classification - fastText paper (one of them)
- Sentence-BERT Sentence Embeddings using Siamese BERT-Networks
- sentence-transformers/all-MiniLM-L6-v2
- seems to be a popular model (have seen it around; maybe was the one used by Rafal Wilinski who made the Claude semantic retrieval interface via MCP - noted this down in Semantic Querying of Obsidian)
- a sentence-transformers model
- maps sentences & paragraphs to a 384 dimensional dense vector space and can be used for tasks like clustering or semantic search
- from Nils Reimers (now Director of Machine Learning at Cohere; creator of SBERT)