OpenAI Text Embedding Model

From GM-RKB
Jump to navigation Jump to search

An OpenAI Text Embedding Model is a text-item embedding model that is an OpenAI model.



References

2023

2022

  • https://openai.com/blog/new-and-improved-embedding-model
    • NOTES:
      • It announces a new embedding model called text-embedding-ada-002 that replaces 5 previous models. The new model outperforms previous models on text search, code search, and sentence similarity tasks.
      • The new model has a longer context length (8192 vs 2048 tokens), smaller embedding size (1536 vs 12288 dimensions), and is 99.8% cheaper than the previous davinci-001 model.
      • It simplifies the /embeddings API by merging multiple models into one that handles text search, sentence similarity, and code search well.
      • Examples are given of companies like Kalendar AI and Notion using embeddings to improve search and recommendations.
      • Limitations are that the new model doesn't outperform on text classification as much as the old text-similarity-davinci-001 model.