Kind of having my mind blown by embeddings based semantic search.
I have an archive of The Economist's The Intelligence podcast transcribed using Whisper.cpp. Indexed all March'23 episodes using @simon 's #llm (https://simonwillison.net/2023/Sep/4/llm-embeddings/)
Searched the index for "Tesla" - and the first hit is the 20th Mar episode that doesn't mention the word "Tesla" even once but is all about mobility that mentions cars and electric cars several times.