llm-rag

How BM25 and RAG Retrieve Information Differently? - MarkTechPost

23 Mar 2026

marktechpost.com

When you type a query into a search engine, something has to decide which documents are actually relevant — and how to rank them. BM25 (Best Matching 25), the algorithm powering search engines like Elasticsearch and Lucene, has been the dominant answer to that question for decades. It scores documents by looking at three things: […]

The Critical Detail About AI That Separates Good Investments from Expensive Mistakes

20 Feb 2026

linkedin.com

You've heard the pitch. "We're going to add AI to the platform.

Comparing Memory Systems for LLM Agents: Vector, Graph, and Event Logs

10 Nov 2025

marktechpost.com

Learn how different memory systems affect multi-agent planning. Comparing Memory Systems for LLM Agents highlights key performance metrics.

Shirin Khosravi Jam on Substack

13 Jul 2025

substack.com

I taught myself how to build RAG + AI Agents in production. Been running them live for over a year now. Here are 4 steps + the only resources you really need to do the same. … Ugly truth: most “AI Engineers” shouting on social media haven’t built a single real production AI Agent or RAG system. If you want to be different - actually build and ship these systems: here’s a laser-focused roadmap from my own journey. .. 🚀 𝗦𝘁𝗮𝗿𝘁 𝘄𝗶𝘁𝗵 𝗳𝘂𝗻𝗱𝗮𝗺𝗲𝗻𝘁𝗮𝗹𝘀 Because no matter how fast LLM/GenAI evolves, your ML & software foundations keep you relevant. ✅ Hands-On ML with TensorFlow & Keras: https://lnkd.in/dWrf5pbS ✅ ISLR: https://lnkd.in/djGPVVwJ ✅ Machine Learning for Beginners by Microsoft (free curriculum): https://lnkd.in/d8kZA3es … 1️⃣ 𝗠𝗮𝘀𝘁𝗲𝗿 𝗟𝗟𝗠𝘀 & 𝗚𝗲𝗻𝗔𝗜 𝗦𝘆𝘀𝘁𝗲𝗺𝘀 → Learn to build & deploy LLMs, understand system design tradeoffs, and handle real constraints. 📚 Must-reads: ✅ Designing ML Systems – Chip Huyen: https://lnkd.in/guN-UhXA ✅ The LLM Engineering Handbook – Iusztin & Labonne: https://lnkd.in/gyA4vFXz ✅ Build a LLM (From Scratch) – Raschka: https://lnkd.in/gXNa-SPb ✅ Hands-On LLMs GitHub: https://lnkd.in/eV4qrgNW … 2️⃣ 𝗚𝗼 𝗯𝗲𝘆𝗼𝗻𝗱 𝘁𝗵𝗲 𝗵𝘆𝗽𝗲 𝗼𝗻 𝗔𝗜 𝗔𝗴𝗲𝗻𝘁𝘀 → Most demos = “if user says hello, return hello.” Actual agents? Handle memory, tools, workflows, costs. ✅ AI Agents for Beginners (GitHub): https://lnkd.in/eik2btmq ✅ GenAI Agents – build step by step: https://lnkd.in/dnhwk75V ✅ OpenAI’s guide to agents: https://lnkd.in/guRfXsFK ✅ Anthropic’s Building Effective Agents: https://lnkd.in/gRWKANS4 … 3️⃣ 𝗥𝗔𝗚 𝗶𝘀 𝗻𝗼𝘁 𝗷𝘂𝘀𝘁 𝗮 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕 Real Retrieval-Augmented Generation requires: → Chunking, hybrid BM25 + vectors, reranking → Query routing & fallback → Evaluating retrieval quality, not just LLM output ✅ RAG Techniques repo: https://lnkd.in/dD4S8Cq2 ✅ Advanced RAG: https://lnkd.in/g2ZHwZ3w ✅ Cost-efficient retrieval with Postgres/OpenSearch/Qdrant ✅ Monitoring with Langfuse / Comet … 4️⃣ 𝗚𝗲𝘁 𝘀𝗲𝗿𝗶𝗼𝘂𝘀 𝗼𝗻 𝗦𝗼𝗳𝘁𝘄𝗮𝗿𝗲 & 𝗜𝗻𝗳𝗿𝗮 → FastAPI, async Python, Pydantic → Docker, CI/CD, blue-green deploys → ETL orchestration (Airflow, Step Functions) → Logs + metrics (CloudWatch, Prometheus) ✅ Move to production: https://lnkd.in/dnnkrJbE ✅ Made with ML (full ML+infra): https://lnkd.in/e-XQwXqS ✅ AWS GenAI path: https://lnkd.in/dmhR3uPc … 5️⃣ 𝗪𝗵𝗲𝗿𝗲 𝗱𝗼 𝗜 𝗹𝗲𝗮𝗿𝗻 𝗳𝗿𝗼𝗺? → Stanford CS336 / CS236 / CS229 (Google it) → MIT 6.S191, Karpathy’s Zero to Hero: https://lnkd.in/dT7vqqQ5 → Google Kaggle GenAI sprint: https://lnkd.in/ga5X7tVJ → NVIDIA’s end-to-end LLM stack: https://lnkd.in/gCtDnhni → DeepLearning.AI’s short courses: https://lnkd.in/gAYmJqS6 … 💥 𝗞𝗲𝗲𝗽 𝗶𝘁 𝗿𝗲𝗮𝗹: Don’t fall for “built in 5 min, dead in 10 min” demos. In prod, it’s about latency, cost, maintainability, guardrails. ♻️ Let's repost to help more people on this journey 💚

Building a RAG Application Using LlamaIndex - KDnuggets

21 Apr 2025

kdnuggets.com

Enhance language models with real-time document retrieval and dynamic knowledge integration using retrieval-augmented generation and LlamaIndex.

How to Build a Graph RAG App

30 Dec 2024

towardsdatascience.com

Using knowledge graphs and AI to retrieve, filter, and summarize medical journal articles

RAG: Still Relevant in the Era of Long Context Models

21 May 2024

thenewstack.io

While RAG will remain a staple of production applications, Gemini 1.5 Pro and similar models will help enterprise data science teams.

Meet RAGFlow: An Open-Source RAG (Retrieval-Augmented Generation) Engine Ba

8 Apr 2024

marktechpost.com

In the ever-evolving landscape of artificial intelligence, businesses face the perpetual challenge of harnessing vast amounts of unstructured data. Meet RAGFlow, a groundbreaking open-source AI project that promises to revolutionize how companies extract insights and answer complex queries with an unprecedented level of truthfulness and accuracy. What Sets RAGFlow Apart RAGFlow is an innovative engine that leverages Retrieval-Augmented Generation (RAG) technology to provide a powerful solution for information retrieval. Unlike traditional keyword searches, RAGFlow combines large language models (LLMs) with deep document understanding to extract relevant information from a vast amount of data. Intelligent template-based chunking and visualized text chunking

RAG Step-by-Step: Open Source Edition

8 Apr 2024

dev.to

In a previous post, I demonstrated how to implement RAG using the Load-Transform-Embed-Store...

Top 9 Different Types of Retrieval-Augmented Generation (RAGs)

24 Sep 2014

marktechpost.com

Retrieval-Augmented Generation (RAG) is a machine learning framework that combines the advantages of both retrieval-based and generation-based models. The RAG framework is highly regarded for its ability to handle large amounts of information and produce coherent, contextually accurate responses. It leverages external data sources by retrieving relevant documents or facts and then generating an answer or output based on the retrieved information and the user query. This blend of retrieval and generation leads to better-informed outputs that are more accurate and comprehensive than models that rely solely on generation. The evolution of RAG has led to various types and approaches,

Building a Simple RAG Application Using LlamaIndex - MachineLearningMastery.com

24 Aug 2014

machinelearningmastery.com

[caption align=

llm-rag — my Raindrop.io articles