search

cover image

Forget AI. Google just created a version of its search engine free of the extra junk it has added over the past decade-plus. You just need one URL parameter.

cover image

Despite rapid growth, AI chatbots are yet to make a dent to search engines says fresh data. Take a look here for more!

cover image

Model architectures, data generation, training paradigms, and unified frameworks inspired by LLMs.

cover image

Better search results with no ads. Welcome to Kagi (pronounced kah-gee), a paid search engine that gives power back to the user.

cover image

Branded search is more than just your company name on Google. Here's how to capture intent and rank for the queries that matter.

cover image

A detailed analysis of ChatGPT search and Google's performance across 62 queries, with scoring metrics and practical examples.

cover image

The free resource helps you see the name associated with any number in seconds flat.

cover image

Visualisations and metrics from the Common Crawl Web Graph dataset

cover image

BM25 is a widely used algorithm for full text search. I wanted to understand how it works, so here is my attempt at understanding by re-explaining.

cover image

You can use images to do Google Searches too.

cover image

Unpack the key features and marketing insights of SearchGPT, OpenAI’s innovative search tool and its potential to rival Google’s dominance.

cover image

In today’s digital age, information is more accessible than ever before. Open Source Intelligence (OSINT) collects and analyzes publicly…

cover image

Perplexity is an up-and-coming AI company that has broad ambition to compete with Google in the search market by providing answers to user queries with AI as its core technology.  They’ve been…

cover image

How phrase search works in search array by intersecting roaring-like numpy arrays.

cover image

All that lexical search context you need to build that RAG app

cover image

In the era of vast data, information retrieval is crucial for search engines, recommender systems, and any application that needs to find documents based on their content. The process involves three key challenges: relevance assessment, document ranking, and efficiency. The recently introduced Python library that implements the BM25 algorithm, BM25S addresses the challenge of efficient and effective information retrieval, particularly the need for ranking documents in response to user queries. The goal is to enhance the speed and memory efficiency of the BM25 algorithm, a standard method for ranking documents by their relevance to a query. Current methods for implementing

cover image

Frustrated by their Google searches, people are funneling their queries to a site that isn’t a search engine at all.

cover image

A collection of 2,311 blogs about every topic

cover image

The crawl archive for May 2024 is now available. The data was crawled between May 18th and May 31st, and contains 2.7 billion web pages (or 377 TiB of uncompressed content). This is our 100th crawl!

cover image

A Google document has leaked online that aims to include thousands of APIs aiming to rank better on Google Search for ranking.

cover image

Learn what you always wish you knew about Google's algorithms.

cover image

Performance testing shows integrating Tantivy’s full-text search engine library into vector search significantly improves speed and performance.

cover image

The search engine war is heating up. ChatGPT may introduce its search engine, which will rival Google, on Monday. Although

cover image

For Perplexity, the partnership with SoundHound marks the addition of another strong distribution channel expanding the reach of its LLM-driven search capabilities.

cover image

Perplexity's AI-powered search experience challenges Google's model by delivering conversational answers, citing sources and more.

cover image

The Google Search URL parameters are important to understand whether you are maximizing the...

cover image

A deep-dive into how far fewer companies than you think have taken a hold of Google's search results.

cover image

Perplexity is a free AI-powered answer engine that provides accurate, trusted, and real-time answers to any question.

cover image

In part six of our series, we'll use Litesearch, the last piece of the puzzle in LiteStack.

cover image

Identify and target personas of keywords, competitors, Reddit discussions, and more.

cover image

Here are ways to remove webpages and online posts harmful to your brand – from privacy claims and copyright notices to legal measures.

cover image

Reddit posts reveal brand awareness, sentiment, and more. Increasingly they appear in Google search results.

cover image

Learn how indexing, algorithms, deep learning systems, human raters, click and query data, and more shape Google's Search results.

cover image

Adam Silver – interaction designer - London, UK

cover image

Hierarchical Navigable Small World (HNSW) is a state-of-the-art algorithm used for an approximate search of nearest neighbours. Under the…

cover image

In the first two parts of this series we have discussed two fundamental algorithms in information retrieval: inverted file index and…

cover image

Hierarchical Navigable Small World graphs (HNSW) is an algorithm that allows for efficient nearest neighbor search, and the Sentence…

cover image

Similarity search is a popular problem where given a query Q we need to find the most similar documents to it among all the documents D.

cover image

Learn a powerful technique to effectively compress large data

cover image

Explore how similarity information can be incorporated into hash function

cover image

Understand how to hash data and reflect its similarity by constructing random hyperplanes

cover image

Dive into combinations of LSH functions to guarantee a more reliable search

Thanks to gems, it is easy to implement a search engine into Rails applications. Of course, you have to choose which gem to use. While there are countless options, four stand out as the best. You will be happy with any of these options.

cover image

Learn what vector search is and the metrics pertinent to decide the distance (or similarity) between objects.

cover image

Learn about designing advanced search features. Explore key elements of search UI and build a user-friendly search input.

cover image

The Similarity Engine's use cases include item-to-item similarity for text and image modality and user-to-item personalized recommendations based on a user’s historical behavior data.

cover image

Try those 9 techniques and improve discoverability in your product. Make your users happy and create a smooth ux for them.

cover image

General Partner Connie Chan on how leading brands are using AI and other technology to combine the serendipitous discovery of offline shopping with the infinite options of online shopping. Today, most of the Western world revolves around search-based online commerce. This means that most shoppers type directly what they want into a store search bar,...

cover image

SEOs have already started analyzing Yandex's search ranking factors, which include PageRank and several other link-related factors

cover image

Learn how to use syntax on DuckDuckGo Private Search to get the search results you want.

cover image

Open-source vector database built for GenAI applications. Install with pip, perform high-speed searches, and scale to tens of billions of vectors.

cover image

Whitepages | Find accurate phone numbers, addresses and emails from the most trusted U.S. white pages phone directory and address lookup since 1997.

cover image

Over the past three years Pinterest has experimented with several visual search and recommendation services, including Related Pins (2014), Similar Looks (2015), Flashlight (2016) and Lens (2017)....

cover image

Pure python implementation of product quantization for nearest neighbor search - matsui528/nanopq

cover image

How to compress and fit a humongous set of vectors in memory for similarity search with asymmetric distance computation (ADC)

cover image

The best indexing approach for billion-sized vector datasets

cover image

Find out how the inverted file index (IVF) is implemented alongside product quantization (PQ) for a fast and efficient approximate nearest…

cover image

Efficient vector quantization for machine learning optimizations (eps. vector quantized variational autoencoders), better than straight…

cover image

Gain powerful insights to inform your marketing efforts. Use the following advanced Google search operators and commands to your advantage.

cover image

Semantics = theory of meaning, yet most define semantic search with a focus on intent. “Meaning” is not the same as “intention.” Learn more.

cover image

This all-new update to our popular resource includes tools to evaluate page speed, security, accessibility, regulatory compliance, code, and more.

cover image

TikTok won’t kill Google yet, but it’s a new and fun way to think about search.

cover image

Need to find a restaurant or figure out how to do something? Young people are turning to TikTok to search for answers. Google has noticed.

cover image

530 votes, 63 comments. My co-founder and I, a senior Amazon research scientist and AWS SDE respectively, launched Marqo a little over a week ago - a…

cover image

Two scientists inadvertently open the Pandora’s Box of WMDs. What now?

cover image

The less command is excellent for reading large text files. It also allows you to search for text in it. Here's what you need to know about searching in less.

cover image

A detailed primer on Roaring bitmaps explaining what they are, how they're different from traditional bitmaps, and how they work internally.

cover image

Build a personalised search engine with Google's search API. Just a heads up that this is not...

Your #1 resource for digital marketing tips, trends, and strategy to help you build a successful online business.

cover image

Trademarkia provides a free, fast, user friendly search of USPTO registered trademarks. Also, with trademark registration you can protect your brand in 180+ countries. Fast and easy. Starts at $99.

cover image

Nearly half of shoppers turn to Google before deciding what to buy and where to buy it. Understanding how Google and other search engines work can help you choose which optimization strategies to apply. This post is the second installment in my "SEO How-to" series.

cover image

I watched online as a college classmate went from disgrace to redemption in months. That’s when I found myself deep in the world of black-ops reputation management.

cover image

If you're finding performance bottlenecks with full-text search in your database, it may be time to switch to Elasticsearch. In this tutorial, Ianis introduces Elasticsearch and shows us how to implement an efficient ...

cover image

Way back in November of 2003, when I was a much younger man and the world had yet to fall head over heels in love with Google, I wrote a post called The Database of Intentions. It was an attempt to…

cover image

Gen Z would rather be anonymous online.

cover image

search.com or Google: Why we suck at naming products and companies (PCA13) - Download as a PDF or view online for free

cover image

Educational material to learn about Goggles and how to create your own.

cover image

Brave Search, the browser developer's privacy-centric Internet search engine, is celebrating its first anniversary after surpassing 2.5 billion queries and seeing almost 5,000% growth in a year.

cover image

PimEyes is a paid service that finds photos of a person from across the internet, including some the person may not want exposed. “We’re just a tool provider,” its owner said.

cover image

Evaluating similarity of visual art from both human perceptual & quantitative judgments

cover image

The way to improve search is not to mimic Google, but instead to build boutique search engines that index, curate, and organize things in new ways.

cover image

Reddit announced it's rolling out the ability to search comments, alongside a few other search-related features.

cover image

I analyzed thousands of searches by people who were diagnosed with cancer. Their queries offer valuable lessons that could improve the way doctors treat patients.

cover image

It’s one thing to say “let’s have search” and draw a box with a magnifying glass on the right. It’s a whole other task to implement good search.

cover image

This article shows how to optimize a Full Text Search implementation with Rails and PostgreSQL, taking a single query from 130ms to 7ms.

cover image

The Forestry.io team is now focused on building TinaCMS. If you wish to migrate your Forestry site to Tina, follow the guide below.

cover image

Reliably and securely take data from any source, in any format, then search, analyze, and visualize it in real time.

cover image

Making a static HTML website have dynamic search

cover image

Do websites created with reactive frameworks get indexed by Google and other search engines? Is it compulsory to set up pre-rendering, as your SEO consultants suggest? Or are they wrong? In this article, Paolo Mioni will talk mostly about Vue.js, since it is the framework he’s used most, and with which he has direct experiences in terms of indexing by the search engines on major projects, but most of what will be covered is valid for other frameworks, too.

cover image

Be a strategic thinker by recognizing opportunities at scale with seemingly small and insignificant data.

Curated SEO Tools: Best SEO Tools for Marketers

We build and maintain an open repository of web crawl data that can be accessed and analyzed by anyone.

cover image

The social media site is the bane of non-users’ lives, hijacking image search results with a pushy sign-up screen. It needs to stop.

cover image

Implementing search in your Rails app can be vexing. Here's a great pattern to use that combines the best parts of ActiveRecord and Postgres.

cover image

A concise overview of Elasticsearch concepts and principles

cover image

What would a totally new search engine architecture look like? Who better than Julien Lemoine, Co-founder & CTO of Algolia, to describe what the future of search will look like. This is the first article in a series. Search engines, and more generally, information retrieval systems, play a central role in

cover image

This is part 3 of a series on bot programming originally published on the Coder One blog. Part 1:...

cover image

Think building a booking system is not your cup of tea? I bet, you will change your mind after...

cover image

We are excited to announce that this year’s NeurIPS 2021 Conference will host a first-of-its-kind competition in large scale approximate…

cover image

From the release of the page experience algorithm, there is no longer any preferential treatment for AMP in Google’s search results, Top Stories carousel and the Google News.

cover image

A cloud-native vector database, storage for next generation AI applications - milvus-io/milvus

cover image

Let's be clear about something right at the start: If you're not optimizing your site search to convert more visitors into buyers, you're missing out on

cover image

Googling is one of the most important skills for every developer. Let me show you how to get better at Googling.

cover image

Elasticsearch is your ticket to a better website search. Learn how fast, relevant search improves customer experience and website performance.

cover image

A Gentle Guide to how Beam Search enhances predictions, in Plain English

cover image

The three-step framework Shopify's Data Science & Engineering team built for evaluating new search algorithms.

Full-text search is everywhere. From finding a book on Scribd, a movie on Netflix, toilet paper on Amazon, or anything else on the web through Google (like [how to do your job as a software engineer](https://localghost.dev/2019/09/everything-i-googled-in-a-week-as-a-professional-software-engineer/)), you've searched vast amounts of unstructured data multiple times today. What's even more amazing, is that you've even though you searched millions (or [billions](https://www.worldwidewebsize.com/)) of records, you got a response in milliseconds. In this post, we are going to build a basic full-text search engine that can search across millions of documents and rank them according to their relevance to the query in milliseconds, in less than 150 lines of code!

cover image

Neeva is currently in beta with several thousand users.

cover image

Speech and natural language processing (NLP) have become the foundation for most of the AI development in the enterprise today, as textual data represents a significant portion of unstructured content.

cover image

We created Algolia to answer the shortcomings of database full text search. It's a SaaS API dedicated to solving app and web developers' struggles.

tl;dr: Use advanced Google Search to find any webpage, emails, info, or secrets cost: $0 time: 2 minutes Software engineers have long joked about how much of their job is simply Googling things Now you can do the same, but for free Below, I'll cover dorking, the use of …

cover image

Introduction A simple search form is great, but one with advanced search options can have...

cover image

In Part 1 of this series, we introduced the concept of embedding vectors. In Part 2, we discussed how embedding vectors can be used in…

cover image

DuckDuckGo, the privacy-focused search engine, announced that August 2020 ended in over 2 billion total searches via its search platform.

cover image

A brief introduction to Elasticsearch

cover image

Finding Similar Subsequences for Known Patterns

Delivering accurate insights is the core function of any data scientist. Navigating the development road toward this goal can sometimes be tricky, especially when cross-collaboration is required, and these lessons learned from building a search application will help you negotiate the demands between accuracy and speed.

cover image

Reverse image search is one of the most well-known and easiest digital investigative techniques, with two-click functionality of choosing “Search Google for image” in many web browsers. This method has also seen widespread use in popular culture, perhaps most notably in the MTV show Catfish, which exposes people in online relationships who use stolen photographs […]

cover image

A look back at some of the year's key voice search and virtual assistant metrics.

cover image

2.1K votes, 110 comments. 1.3M subscribers in the Python community. The official Python community for Reddit! Stay up to date with the latest news…

cover image

Learn how to create and delete indexes, how to load data in them and perform basic queries.

Hacker News Search, millions articles and comments at your fingertips.

cover image

A whirlwind tour of the big ideas powering our web search

cover image

Effortlessly test Elasticsearch queries using this test environment, including test data and terrific GUIs.

cover image

Google, Microsoft, et al continue to perfect their search engines – but too often search is not enough. The watchword

cover image

Discover a basic model for adding Programmable Search Engine elements to your web page.

cover image

The tech giant doesn’t have to be dismantled. Sharing its crown jewel might reshape the internet.

cover image

Dive into cutting-edge tech, reviews and the latest trends with the expert team at Gizmodo. Your ultimate source for all things tech.

cover image

The fast-fashion retailer has debuted, and quickly expanded, an AI-based visual search and navigation tool for its mobile and e-commerce business.

cover image

Build an 'auto-tagging' image search service using Algolia and Google Cloud's Vision API

Hacker News Search, millions articles and comments at your fingertips.