cover image

The shift away from Google search signals a broader recalibration: marketers are chasing commerce media that can prove it converts.

cover image

When you type a query into a search engine, something has to decide which documents are actually relevant — and how to rank them. BM25 (Best Matching 25), the algorithm powering search engines like Elasticsearch and Lucene, has been the dominant answer to that question for decades.  It scores documents by looking at three things: […]

cover image

40 Google features to find exactly what you need, the alternative search engines that do things Google won't, and the reference desk framework underneath all of it.

cover image
5 competitive gates hidden inside 'rank and display'
17 Mar 2026
searchengineland.com

The annotation, recruitment, grounding, display, and won gates determine which content AI engines trust and recommend. Here’s how it works.

cover image

ANN — Approximate Nearest Neighbors —  is at the core of fast vector search, itself central to GenAI, especially GPT and LLM. My new methodology, abbreviated as PANN, has many other app…

cover image
The Psychology of AI SERPs and Shopping
29 Dec 2025
practicalecommerce.com

The rise of "zero-click" searches may signal the coming effect of AI shopping on product discovery and decision-making.

cover image

We have hundreds of easter-egg logos (featuring our friendly mascot Dax Brown) that surface when you make certain queries on our search engine. Our subreddit is trying to catch ‘em all. They’ve certainly caught a lot, currently 504, but we keep adding more so it’s a moving target. The total as of this post is 594. I’m the one personally adding them in my spare time just for fun and I recently did a

cover image

Government organizations love to distribute documents as PDF files. They are easy to forward and to print. The problem is when you want to find and access them later among millions of other files. …

cover image
How to Remove a Web Page from Google
20 Oct 2025
practicalecommerce.com

The reasons for deindexing a page haven't much changed over the years, although the tools and tactics have evolved.

cover image
Maze generation algorithm - Wikipedia
16 Sep 2025
en.m.wikipedia.org
cover image

Enough is enough: I’ve jettisoned Google in favor of a search engine that doesn’t treat me like a product.

cover image

Google DeepMind has announced a new artificial intelligence system, AlphaEarth Foundations, designed to analyze the Earth's surface by integrating vast

cover image

The Pew Research Center analysis shows how hard AI is hitting web traffic.

cover image

🔍 Search anyone's digital footprint across 300+ websites - ibnaleem/gosearch

cover image

Your complete playbook for transforming how you research with AI's most powerful search engine

From BM25 to RAG: Everything I learned about vector databases, embedding models, and vector search - and everything in between.

cover image

Forget AI. Google just created a version of its search engine free of the extra junk it has added over the past decade-plus. You just need one URL parameter.

cover image

Despite rapid growth, AI chatbots are yet to make a dent to search engines says fresh data. Take a look here for more!

cover image

Model architectures, data generation, training paradigms, and unified frameworks inspired by LLMs.

cover image

Better search results with no ads. Welcome to Kagi (pronounced kah-gee), a paid search engine that gives power back to the user.

cover image
Branded search and SEO: What you need to know
2 Mar 2025
searchengineland.com

Branded search is more than just your company name on Google. Here's how to capture intent and rank for the queries that matter.

cover image

The free resource helps you see the name associated with any number in seconds flat.

cover image

A detailed analysis of ChatGPT search and Google's performance across 62 queries, with scoring metrics and practical examples.

cover image
Common Crawl Web Graph Statistics
18 Dec 2024
commoncrawl.github.io

Visualisations and metrics from the Common Crawl Web Graph dataset

cover image

BM25 is a widely used algorithm for full text search. I wanted to understand how it works, so here is my attempt at understanding by re-explaining.

cover image

You can use images to do Google Searches too.

cover image

Perplexity is an up-and-coming AI company that has broad ambition to compete with Google in the search market by providing answers to user queries with AI as its core technology.  They’ve been…

cover image
A Roaringish phrase search algorithm
3 Jul 2024
softwaredoug.com

How phrase search works in search array by intersecting roaring-like numpy arrays.

cover image
What AI Engineers Need to Know about Search
28 Jun 2024
softwaredoug.com

All that lexical search context you need to build that RAG app

cover image

In the era of vast data, information retrieval is crucial for search engines, recommender systems, and any application that needs to find documents based on their content. The process involves three key challenges: relevance assessment, document ranking, and efficiency. The recently introduced Python library that implements the BM25 algorithm, BM25S addresses the challenge of efficient and effective information retrieval, particularly the need for ranking documents in response to user queries. The goal is to enhance the speed and memory efficiency of the BM25 algorithm, a standard method for ranking documents by their relevance to a query. Current methods for implementing

cover image

Frustrated by their Google searches, people are funneling their queries to a site that isn’t a search engine at all.

cover image

A collection of 2,311 blogs about every topic

cover image
May 2024 Crawl Archive Now Available
4 Jun 2024
commoncrawl.org

The crawl archive for May 2024 is now available. The data was crawled between May 18th and May 31st, and contains 2.7 billion web pages (or 377 TiB of uncompressed content). This is our 100th crawl!

cover image

A Google document has leaked online that aims to include thousands of APIs aiming to rank better on Google Search for ranking.

cover image

Learn what you always wish you knew about Google's algorithms.

cover image

Performance testing shows integrating Tantivy’s full-text search engine library into vector search significantly improves speed and performance.

cover image

The search engine war is heating up. ChatGPT may introduce its search engine, which will rival Google, on Monday. Although

cover image

For Perplexity, the partnership with SoundHound marks the addition of another strong distribution channel expanding the reach of its LLM-driven search capabilities.

cover image

Perplexity's AI-powered search experience challenges Google's model by delivering conversational answers, citing sources and more.

cover image

The Google Search URL parameters are important to understand whether you are maximizing the...

cover image

A deep-dive into how far fewer companies than you think have taken a hold of Google's search results.

cover image
Perplexity
1 Mar 2024
perplexity.ai

Perplexity is a free AI-powered answer engine that provides accurate, trusted, and real-time answers to any question.

cover image

In part six of our series, we'll use Litesearch, the last piece of the puzzle in LiteStack.

cover image
ChatGPT Prompts for Customer Personas
23 Jan 2024
practicalecommerce.com

Identify and target personas of keywords, competitors, Reddit discussions, and more.

cover image

Here are ways to remove webpages and online posts harmful to your brand – from privacy claims and copyright notices to legal measures.

cover image
Reddit Is a Marketing Goldmine
16 Jan 2024
practicalecommerce.com

Reddit posts reveal brand awareness, sentiment, and more. Increasingly they appear in Google search results.

cover image

Learn how indexing, algorithms, deep learning systems, human raters, click and query data, and more shape Google's Search results.

cover image

Adam Silver – interaction designer - London, UK

cover image

In the first two parts of this series we have discussed two fundamental algorithms in information retrieval: inverted file index and…

cover image

Hierarchical Navigable Small World (HNSW) is a state-of-the-art algorithm used for an approximate search of nearest neighbours. Under the…

cover image

Hierarchical Navigable Small World graphs (HNSW) is an algorithm that allows for efficient nearest neighbor search, and the Sentence…

cover image
Similarity Search, Part 1: kNN & Inverted File Index
27 Jul 2023
towardsdatascience.com

Similarity search is a popular problem where given a query Q we need to find the most similar documents to it among all the documents D.

cover image
Similarity Search, Part 2: Product Quantization
27 Jul 2023
towardsdatascience.com

Learn a powerful technique to effectively compress large data

cover image

Explore how similarity information can be incorporated into hash function

cover image

Understand how to hash data and reflect its similarity by constructing random hyperplanes

cover image
Similarity Search, Part 7: LSH Compositions
27 Jul 2023
towardsdatascience.com

Dive into combinations of LSH functions to guarantee a more reliable search

Thanks to gems, it is easy to implement a search engine into Rails applications. Of course, you have to choose which gem to use. While there are countless options, four stand out as the best. You will be happy with any of these options.

cover image

Learn what vector search is and the metrics pertinent to decide the distance (or similarity) between objects.

cover image

Learn about designing advanced search features. Explore key elements of search UI and build a user-friendly search input.

cover image

The Similarity Engine's use cases include item-to-item similarity for text and image modality and user-to-item personalized recommendations based on a user’s historical behavior data.

cover image

Try those 9 techniques and improve discoverability in your product. Make your users happy and create a smooth ux for them.

cover image

General Partner Connie Chan on how leading brands are using AI and other technology to combine the serendipitous discovery of offline shopping with the infinite options of online shopping. Today, most of the Western world revolves around search-based online commerce. This means that most shoppers type directly what they want into a store search bar,...

A different approach to fuzzy finding
19 Mar 2023
nathancraddock.com
cover image

SEOs have already started analyzing Yandex's search ranking factors, which include PageRank and several other link-related factors

cover image
DuckDuckGo Search Syntax | DuckDuckGo Help Pages
26 Jan 2023
help.duckduckgo.com

Learn how to use syntax on DuckDuckGo Private Search to get the search results you want.

cover image

Open-source vector database built for GenAI applications. Install with pip, perform high-speed searches, and scale to tens of billions of vectors.

cover image

Whitepages | Find accurate phone numbers, addresses and emails from the most trusted U.S. white pages phone directory and address lookup since 1997.

cover image

Over the past three years Pinterest has experimented with several visual search and recommendation services, including Related Pins (2014), Similar Looks (2015), Flashlight (2016) and Lens (2017)....

cover image

Pure python implementation of product quantization for nearest neighbor search - matsui528/nanopq

cover image
Product Quantization for Similarity Search
14 Oct 2022
towardsdatascience.com

How to compress and fit a humongous set of vectors in memory for similarity search with asymmetric distance computation (ADC)

cover image

The best indexing approach for billion-sized vector datasets

cover image
Similarity Search with IVFPQ
14 Oct 2022
towardsdatascience.com

Find out how the inverted file index (IVF) is implemented alongside product quantization (PQ) for a fast and efficient approximate nearest…

cover image

Efficient vector quantization for machine learning optimizations (eps. vector quantized variational autoencoders), better than straight…

cover image

Gain powerful insights to inform your marketing efforts. Use the following advanced Google search operators and commands to your advantage.

cover image

Semantics = theory of meaning, yet most define semantic search with a focus on intent. “Meaning” is not the same as “intention.” Learn more.

cover image
25 Free Tools to Test Your Website
27 Sep 2022
practicalecommerce.com

This all-new update to our popular resource includes tools to evaluate page speed, security, accessibility, regulatory compliance, code, and more.

cover image

TikTok won’t kill Google yet, but it’s a new and fun way to think about search.

cover image

Need to find a restaurant or figure out how to do something? Young people are turning to TikTok to search for answers. Google has noticed.

cover image

530 votes, 63 comments. My co-founder and I, a senior Amazon research scientist and AWS SDE respectively, launched Marqo a little over a week ago - a…

YaCy – your own search engine | Hacker News
18 Sep 2022
news.ycombinator.com
cover image
40,000 Recipes for Murder
10 Sep 2022
wnycstudios.org

Two scientists inadvertently open the Pandora’s Box of WMDs. What now?

cover image
How to Search in Less Command
8 Sep 2022
linuxhandbook.com

The less command is excellent for reading large text files. It also allows you to search for text in it. Here's what you need to know about searching in less.

cover image

A detailed primer on Roaring bitmaps explaining what they are, how they're different from traditional bitmaps, and how they work internally.

cover image
Build a Custom Search Engine
9 Aug 2022
dev.to

Build a personalised search engine with Google's search API. Just a heads up that this is not...

Neil Patel's Digital Marketing Blog
19 Jul 2022
blog.kissmetrics.com

Your #1 resource for digital marketing tips, trends, and strategy to help you build a successful online business.

cover image

Trademarkia provides a free, fast, user friendly search of USPTO registered trademarks. Also, with trademark registration you can protect your brand in 180+ countries. Fast and easy. Starts at $99.

cover image

They're super powerful.

cover image
SEO How-to, Part 2: Understanding Search Engines
18 Jul 2022
practicalecommerce.com

Nearly half of shoppers turn to Google before deciding what to buy and where to buy it. Understanding how Google and other search engines work can help you choose which optimization strategies to apply. This post is the second installment in my "SEO How-to" series.

cover image

I watched online as a college classmate went from disgrace to redemption in months. That’s when I found myself deep in the world of black-ops reputation management.

cover image

If you're finding performance bottlenecks with full-text search in your database, it may be time to switch to Elasticsearch. In this tutorial, Ianis introduces Elasticsearch and shows us how to implement an efficient ...

cover image

Without driving yourself crazy

cover image

Way back in November of 2003, when I was a much younger man and the world had yet to fall head over heels in love with Google, I wrote a post called The Database of Intentions. It was an attempt to…

cover image
The Personal Brand Is Dead
24 Jun 2022
theatlantic.com

Gen Z would rather be anonymous online.

cover image
Hacker News
23 Jun 2022
github.com

Educational material to learn about Goggles and how to create your own.

cover image

Brave Search, the browser developer's privacy-centric Internet search engine, is celebrating its first anniversary after surpassing 2.5 billion queries and seeing almost 5,000% growth in a year.

cover image

PimEyes is a paid service that finds photos of a person from across the internet, including some the person may not want exposed. “We’re just a tool provider,” its owner said.

cover image
Similarity-Based Image Search for Visual Art
28 May 2022
towardsdatascience.com

Evaluating similarity of visual art from both human perceptual & quantitative judgments

cover image
The Future of Search Is Boutique | Future
18 May 2022
future.a16z.com

The way to improve search is not to mimic Google, but instead to build boutique search engines that index, curate, and organize things in new ways.

cover image

Reddit announced it's rolling out the ability to search comments, alongside a few other search-related features.

cover image

I analyzed thousands of searches by people who were diagnosed with cancer. Their queries offer valuable lessons that could improve the way doctors treat patients.

cover image
7 things I wish every search box did
23 Jan 2022
blog.intercom.com

It’s one thing to say “let’s have search” and draw a box with a magnifying glass on the right. It’s a whole other task to implement good search.

cover image

This article shows how to optimize a Full Text Search implementation with Rails and PostgreSQL, taking a single query from 130ms to 7ms.

cover image
Forestry.io CMS
17 Jan 2022
forestry.io

The Forestry.io team is now focused on building TinaCMS. If you wish to migrate your Forestry site to Tina, follow the guide below.

GitHub Awesome Search
17 Jan 2022
github.motakasoft.com
cover image

Reliably and securely take data from any source, in any format, then search, analyze, and visualize it in real time.

cover image

Making a static HTML website have dynamic search

cover image

Do websites created with reactive frameworks get indexed by Google and other search engines? Is it compulsory to set up pre-rendering, as your SEO consultants suggest? Or are they wrong? In this article, Paolo Mioni will talk mostly about Vue.js, since it is the framework he’s used most, and with which he has direct experiences in terms of indexing by the search engines on major projects, but most of what will be covered is valid for other frameworks, too.

cover image

Be a strategic thinker by recognizing opportunities at scale with seemingly small and insignificant data.

Curated SEO Tools
11 Dec 2021
curatedseotools.com

Curated SEO Tools: Best SEO Tools for Marketers

We build and maintain an open repository of web crawl data that can be accessed and analyzed by anyone.

cover image

The social media site is the bane of non-users’ lives, hijacking image search results with a pushy sign-up screen. It needs to stop.

cover image
How to build a search engine with Ruby on Rails
1 Oct 2021
blog.testdouble.com

Implementing search in your Rails app can be vexing. Here's a great pattern to use that combines the best parts of ActiveRecord and Postgres.

cover image

A concise overview of Elasticsearch concepts and principles

cover image

What would a totally new search engine architecture look like? Who better than Julien Lemoine, Co-founder & CTO of Algolia, to describe what the future of search will look like. This is the first article in a series. Search engines, and more generally, information retrieval systems, play a central role in

Hora | Hora Search Everywhere
1 Aug 2021
horasearch.com
cover image

This is part 3 of a series on bot programming originally published on the Coder One blog. Part 1:...

cover image

Think building a booking system is not your cup of tea? I bet, you will change your mind after...

cover image

We are excited to announce that this year’s NeurIPS 2021 Conference will host a first-of-its-kind competition in large scale approximate…

cover image

From the release of the page experience algorithm, there is no longer any preferential treatment for AMP in Google’s search results, Top Stories carousel and the Google News.

cover image

A cloud-native vector database, storage for next generation AI applications - milvus-io/milvus

cover image

Let's be clear about something right at the start: If you're not optimizing your site search to convert more visitors into buyers, you're missing out on

cover image

Googling is one of the most important skills for every developer. Let me show you how to get better at Googling.

cover image

Elasticsearch is your ticket to a better website search. Learn how fast, relevant search improves customer experience and website performance.

cover image

A Gentle Guide to how Beam Search enhances predictions, in Plain English

cover image
Evaluating Search Algorithms
2 Apr 2021
shopify.engineering

The three-step framework Shopify's Data Science & Engineering team built for evaluating new search algorithms.

Full-text search is everywhere. From finding a book on Scribd, a movie on Netflix, toilet paper on Amazon, or anything else on the web through Google (like [how to do your job as a software engineer](https://localghost.dev/2019/09/everything-i-googled-in-a-week-as-a-professional-software-engineer/)), you've searched vast amounts of unstructured data multiple times today. What's even more amazing, is that you've even though you searched millions (or [billions](https://www.worldwidewebsize.com/)) of records, you got a response in milliseconds. In this post, we are going to build a basic full-text search engine that can search across millions of documents and rank them according to their relevance to the query in milliseconds, in less than 150 lines of code!

cover image

Neeva is currently in beta with several thousand users.

cover image

Speech and natural language processing (NLP) have become the foundation for most of the AI development in the enterprise today, as textual data represents a significant portion of unstructured content.

Free People Search | WhitePages
25 Dec 2020
whitepages.com
cover image
How come Algolia is so Fast?
18 Dec 2020
algolia.com

We created Algolia to answer the shortcomings of database full text search. It's a SaaS API dedicated to solving app and web developers' struggles.

tl;dr: Use advanced Google Search to find any webpage, emails, info, or secrets cost: $0 time: 2 minutes Software engineers have long joked about how much of their job is simply Googling things Now you can do the same, but for free Below, I'll cover dorking, the use of …

judicial search
29 Nov 2020
judyrecords.com

Instantly search 740 million+ United States court cases.

cover image

Introduction A simple search form is great, but one with advanced search options can have...

cover image

In Part 1 of this series, we introduced the concept of embedding vectors. In Part 2, we discussed how embedding vectors can be used in…

cover image
DuckDuckGo is growing fast
3 Nov 2020
bleepingcomputer.com

DuckDuckGo, the privacy-focused search engine, announced that August 2020 ended in over 2 billion total searches via its search platform.

cover image

A brief introduction to Elasticsearch

cover image
Part 7: Fast Pattern Searching with STUMPY
3 Nov 2020
towardsdatascience.com

Finding Similar Subsequences for Known Patterns

Apache Tika – Apache Tika
15 Sep 2020
tika.apache.org

Delivering accurate insights is the core function of any data scientist. Navigating the development road toward this goal can sometimes be tricky, especially when cross-collaboration is required, and these lessons learned from building a search application will help you negotiate the demands between accuracy and speed.

cover image

Reverse image search is one of the most well-known and easiest digital investigative techniques, with two-click functionality of choosing “Search Google for image” in many web browsers. This method has also seen widespread use in popular culture, perhaps most notably in the MTV show Catfish, which exposes people in online relationships who use stolen photographs […]

cover image
Nine voice search stats to close out 2019
19 Feb 2020
searchengineland.com

A look back at some of the year's key voice search and virtual assistant metrics.

cover image

2.1K votes, 110 comments. 1.3M subscribers in the Python community. The official Python community for Reddit! Stay up to date with the latest news…

cover image

Effortlessly test Elasticsearch queries using this test environment, including test data and terrific GUIs.

All | Search powered by Algolia
23 Dec 2019
hn.algolia.com

Hacker News Search, millions articles and comments at your fingertips.

cover image

A whirlwind tour of the big ideas powering our web search

cover image

Learn how to create and delete indexes, how to load data in them and perform basic queries.

cover image

Google, Microsoft, et al continue to perfect their search engines – but too often search is not enough. The watchword

cover image

Discover a basic model for adding Programmable Search Engine elements to your web page.

cover image

The tech giant doesn’t have to be dismantled. Sharing its crown jewel might reshape the internet.

cover image

Mixnode allows you to execute SQL against the web.

cover image
10 Hidden URLs To Help You Rule The Web
23 Sep 2018
gizmodo.com.au

Dive into cutting-edge tech, reviews and the latest trends with the expert team at Gizmodo. Your ultimate source for all things tech.

cover image

The fast-fashion retailer has debuted, and quickly expanded, an AI-based visual search and navigation tool for its mobile and e-commerce business.

cover image

Build an 'auto-tagging' image search service using Algolia and Google Cloud's Vision API

HN Search powered by Algolia
5 May 2018
hn.algolia.com

Hacker News Search, millions articles and comments at your fingertips.

cover image

In today’s digital age, information is more accessible than ever before. Open Source Intelligence (OSINT) collects and analyzes publicly…

cover image

Unpack the key features and marketing insights of SearchGPT, OpenAI’s innovative search tool and its potential to rival Google’s dominance.