cover image
The Site-Search Paradox: Why The Big Box Always Wins
26 Mar 2026
smashingmagazine.com

Success in modern UX isn’t about having the most content. It’s about having the most findable content. Yet even with more data and better tools than ever, internal search often fails, leaving users to rely on global search engines to find a single page on a local site. Why does the “Big Box” still win, and how can we bring users back?

cover image

When you type a query into a search engine, something has to decide which documents are actually relevant — and how to rank them. BM25 (Best Matching 25), the algorithm powering search engines like Elasticsearch and Lucene, has been the dominant answer to that question for decades.  It scores documents by looking at three things: […]

cover image

ANN — Approximate Nearest Neighbors —  is at the core of fast vector search, itself central to GenAI, especially GPT and LLM. My new methodology, abbreviated as PANN, has many other app…

cover image
Faiss: A library for efficient similarity search
14 Jun 2025
engineering.fb.com

Visit the post for more.

cover image

In the first two parts of this series we have discussed two fundamental algorithms in information retrieval: inverted file index and…

cover image

Hierarchical Navigable Small World (HNSW) is a state-of-the-art algorithm used for an approximate search of nearest neighbours. Under the…

cover image

Hierarchical Navigable Small World graphs (HNSW) is an algorithm that allows for efficient nearest neighbor search, and the Sentence…

cover image
Similarity Search, Part 1: kNN & Inverted File Index
27 Jul 2023
towardsdatascience.com

Similarity search is a popular problem where given a query Q we need to find the most similar documents to it among all the documents D.

cover image

Understand how to hash data and reflect its similarity by constructing random hyperplanes

cover image
Similarity Search, Part 7: LSH Compositions
27 Jul 2023
towardsdatascience.com

Dive into combinations of LSH functions to guarantee a more reliable search