Back to news
Generative AI
Mar 24, 2026

New Gemini Feature Enables Direct Video Embedding for Fast Search Capabilities

Mar 24, 2026
AI Summary

Gemini Embedding 2 allows for the direct projection of raw video into a vector space, enabling efficient video search without transcription. A new command-line interface has been developed to index and search video footage using natural language queries.

  • Gemini Embedding 2 can embed raw video into a 768-dimensional vector space alongside text, eliminating the need for transcription or frame captioning.
  • Users can perform queries such as 'green car cutting me off' that directly compare to video clips at the vector level.
  • A command-line interface has been created to index hours of footage into ChromaDB, allowing for natural language searches and automatic trimming of matching clips.
  • The cost for indexing video footage is approximately $2.50 per hour, with still-frame detection reducing costs for idle segments, making it more economical for security camera footage.
video searchembeddingnatural language processingvector spacecli tool