llm articles

  • 2025-04-20   To Make Language Models Work Better, Researchers Sidestep...   (www.quantamagazine.org)
  • 2025-04-18   OpenAI Releases a Practical Guide to Building LLM Agents ...   (www.marktechpost.com)
  • 2025-04-17   LLM Post-Training: A Deep Dive into Reasoning Large Langu...   (arxiv.org)
  • 2025-04-16   How To Build An Agent | Amp   (ampcode.com)
  • 2025-04-13   humanlayer/12-factor-agents   (github.com)
  • 2025-04-11   12-factor-agents: Principles to build LLM-powered softwar...   (lobste.rs)
  • 2025-04-08   The “S” in MCP Stands for Security - Elena Cross - Medium   (elenacross7.medium.com)
  • 2025-04-07   Topic 33: Slim Attention, KArAt, XAttention and Multi-Tok...   (huggingface.co)
  • 2025-04-06   The Llama 4 herd: The beginning of a new era of natively ...   (ai.meta.com)
  • 2025-04-06   Model Context Protocol (MCP) an overview   (www.philschmid.de)
  • 2025-04-06   Use MCP servers in VS Code (Preview)   (code.visualstudio.com)
  • 2025-04-05   A Code Implementation to Building a Context-Aware AI Assi...   (www.marktechpost.com)
  • 2025-04-02   LLM Benchmarking: Fundamental Concepts | NVIDIA Technical...   (developer.nvidia.com)
  • 2025-04-02   A Comprehensive Guide to LLM Routing: Tools and Frameworks   (www.marktechpost.com)
  • 2025-03-26   10 Must-Know Python Libraries for LLMs in 2025   (machinelearningmastery.com)
  • 2025-03-25   Introducing 4o Image Generation   (openai.com)
  • 2025-03-25   What is the hallucination index?   (dataconomy.com)
  • 2025-03-23   Quickstart | Mistral AI Large Language Models   (docs.mistral.ai)
  • 2025-03-22   Improving Recommender Systems & Search in the Age of LLMs   (eugeneyan.com)
  • 2025-03-18   Mistral Small 3.1 runs on a MacBook and beats giants - Da...   (dataconomy.com)
  • 2025-03-16   https://www.r-bloggers.com/2025/03/the-ellmer-package-for...   (www.r-bloggers.com)
  • 2025-03-13   What is catastrophic forgetting? - Dataconomy   (dataconomy.com)
  • 2025-03-13   Top 7 Open-Source LLMs in 2025 - KDnuggets   (www.kdnuggets.com)
  • 2025-03-12   What are model cards? - Dataconomy   (dataconomy.com)
  • 2025-03-11   How I use LLMs to help me write code   (open.substack.com)
  • 2025-03-08   The State of LLM Reasoning Models   (open.substack.com)
  • 2025-03-06   Mistral OCR | Mistral AI   (mistral.ai)
  • 2025-02-26   Claude 3.7 Sonnet and Claude Code   (www.anthropic.com)
  • 2025-02-26   The Deep Research problem — Benedict Evans   (www.ben-evans.com)
  • 2025-02-24   5 Principles for Writing Effective Prompts (2025 Update)   (blog.tobiaszwingmann.com)
  • 2025-02-24   Greg Brockman shared this template for prompting   (www.linkedin.com)
  • 2025-02-21   LLM Leaderboard   (artificialanalysis.ai)
  • 2025-02-17   Here Are My Go-To AI Tools   (open.substack.com)
  • 2025-02-17   A Step-by-Step Guide to Setting Up a Custom BPE Tokenizer...   (www.marktechpost.com)
  • 2025-02-15   We Were Wrong About GPUs   (fly.io)
  • 2025-02-03   5 AI Agent Frameworks Compared - KDnuggets   (www.kdnuggets.com)
  • 2025-02-02   Creating an AI Agent-Based System with LangGraph: Adding ...   (www.marktechpost.com)
  • 2025-02-01   aidanmclaughlin/AidanBench: Aidan Bench attempts to measu...   (github.com)
  • 2025-01-29   Multi-Head Latent Attention and Other KV Cache Tricks   (www.pyspur.dev)
  • 2025-01-29   Qwen AI Introduces Qwen2.5-Max: A large MoE LLM Pretraine...   (www.marktechpost.com)
  • 2025-01-28   On MLA   (planetbanatt.net)
  • 2025-01-27   The Illustrated DeepSeek-R1   (newsletter.languagemodels.co)
  • 2025-01-26   DeepSeek-R1 vs. OpenAI’s o1: A New Step in Open Source an...   (www.marktechpost.com)
  • 2025-01-25   AI hallucinations can’t be stopped — but these techniques...   (www.nature.com)
  • 2025-01-23   How Chinese A.I. Start-Up DeepSeek Is Competing With Open...   (www.nytimes.com)
  • 2025-01-18   Microsoft Presents a Comprehensive Framework for Securing...   (www.marktechpost.com)
  • 2025-01-17   This Rumor About GPT-5 Changes Everything   (open.substack.com)
  • 2025-01-14   The 2025 AI Engineering Reading List   (www.latent.space)
  • 2025-01-12   Agents   (huyenchip.com)
  • 2025-01-12   100 Must-Read Generative AI Papers from 2024   (open.substack.com)
  • 2025-01-09   7 Next-Generation Prompt Engineering Techniques - Machine...   (machinelearningmastery.com)
  • 2025-01-08   How to use NotebookLM for personalized knowledge synthesis   (open.substack.com)
  • 2025-01-07   An Opinionated Evals Reading List — Apollo Research   (www.apolloresearch.ai)
  • 2024-12-24   Gemini 2.0 Flash "Thinking Mode"   (open.substack.com)
  • 2024-12-22   LLM Research Papers: The 2024 List   (magazine.sebastianraschka.com)
  • 2024-12-22   Why AI language models choke on too much text   (arstechnica.com)
  • 2024-12-21   rasbt/LLMs-from-scratch: Implement a ChatGPT-like LLM in ...   (github.com)
  • 2024-12-21   Slim-Llama: An Energy-Efficient LLM ASIC Processor Suppor...   (www.marktechpost.com)
  • 2024-12-21   OpenAI Unveils o3 System That Reasons Through Math, Scien...   (www.nytimes.com)
  • 2024-12-19   Building effective agents \ Anthropic   (www.anthropic.com)
  • 2024-12-18   Blt patches scale better than tokens   (dl.fbaipublicfiles.com)
  • 2024-12-16   Meta AI Proposes Large Concept Models (LCMs): A Semantic ...   (www.marktechpost.com)
  • 2024-12-15   How LLMs Store and Use Knowledge? This AI Paper Introduce...   (www.marktechpost.com)
  • 2024-12-13   LangChain vs OpenAI API: When Simplicity Meets Scalabilit...   (blogs.adityabh.is-a.dev)
  • 2024-12-09   What are Hallucinations in LLMs and 6 Effective Strategie...   (www.marktechpost.com)
  • 2024-12-07   Countless.dev | AI Model Comparison   (countless.dev)
  • 2024-12-07   CPU-GPU I/O-Aware LLM Inference Reduces Latency in GPUs b...   (www.marktechpost.com)
  • 2024-12-05   Treemap   (aiworld.eu)
  • 2024-12-05   AI Hallucinations: Why Large Language Models Make Things ...   (www.kapa.ai)
  • 2024-11-28   Four Cutting-Edge Methods for Evaluating AI Agents and En...   (www.marktechpost.com)
  • 2024-11-26   eugeneyan/llm-paper-notes: Notes from the Latent Space pa...   (github.com)
  • 2024-11-21   Understanding Multimodal LLMs   (magazine.sebastianraschka.com)
  • 2024-11-17   Something weird is happening with LLMs and chess   (open.substack.com)
  • 2024-10-31   LLM Chunking, Indexing, Scoring and Agents, in a Nutshell...   (www.datasciencecentral.com)
  • 2024-10-28   Developing a computer use model   (www.anthropic.com)
  • 2024-10-19   5 LLM Tools I Can’t Live Without   (www.kdnuggets.com)
  • 2024-08-04   dpo-from-scratch.ipynb   (github.com)
  • 2024-08-04   What We Learned from a Year of Building with LLMs (Part I)   (www.oreilly.com)
  • 2024-07-24   Customize Generative AI Models for Enterprise Application...   (developer.nvidia.com)
  • 2024-07-24   Llama 3.1 Released: Meta’s New Open-Source AI Model that ...   (www.marktechpost.com)
  • 2024-07-24   Meta Llama 3.1 405b is outperforming private models with ...   (dataconomy.com)
  • 2024-07-15   Claude 3.5 Sonnet   (www.anthropic.com)
  • 2024-07-13   Do large language models understand the world?   (www.amazon.science)
  • 2024-07-04   Building an LLM Router for High-Quality and Cost-Effectiv...   (www.anyscale.com)
  • 2024-07-03   From bare metal to a 70B model: infrastructure set-up and...   (imbue.com)
  • 2024-07-02   StarCoder2-15B: A Powerful LLM for Code Generation, Summa...   (nvda.ws)
  • 2024-06-22   Some Commonly Used Advanced Prompt Engineering Techniques...   (www.marktechpost.com)
  • 2024-06-20   Key Metrics for Evaluating Large Language Models (LLMs)   (www.marktechpost.com)
  • 2024-06-20   Firecrawl: A Powerful Web Scraping Tool for Turning Websi...   (www.marktechpost.com)
  • 2024-06-19   Let's reproduce GPT-2 (124M)   (m.youtube.com)
  • 2024-06-12   “The” Midjourney model personalization guide   (dataconomy.com)
  • 2024-06-12   How to use Perplexity in your PM work   (www.lennysnewsletter.com)
  • 2024-06-11   [2406.01506] The Geometry of Categorical and Hierarchical...   (arxiv.org)
  • 2024-06-11   What We Learned from a Year of Building with LLMs (Part II)   (www.oreilly.com)
  • 2024-06-11   Sharpening LLMs: The Sharpest Tools and Essential Techniq...   (www.marktechpost.com)
  • 2024-06-11   List of Activities and Their Corresponding Suitable LLMs ...   (www.marktechpost.com)
  • 2024-05-24   Demystifying Vision-Language Models: An In-Depth Exploration   (www.marktechpost.com)
  • 2024-05-21   naklecha/llama3-from-scratch   (github.com)
  • 2024-05-21   Abacus AI Releases Smaug-Llama-3-70B-Instruct: The New Be...   (www.marktechpost.com)
  • 2024-05-12   ChuXin: A Fully Open-Sourced Language Model with a Size o...   (www.marktechpost.com)
  • 2024-05-11   Title:You Only Cache Once: Decoder-Decoder Architectures ...   (arxiv.org)
  • 2024-05-11   Anthropic AI Launches a Prompt Engineering Tool that Gene...   (www.marktechpost.com)
  • 2024-05-11   Cleaning   (docs.unstructured.io)
  • 2024-05-08   [2404.19737] Better & Faster Large Language Models via Mu...   (arxiv.org)
  • 2024-05-07   Researchers at NVIDIA AI Introduce ‘VILA’: A Vision Langu...   (www.marktechpost.com)
  • 2024-05-05   Hugging Face - Documentation   (huggingface.co)
  • 2024-04-25   Understanding Key Terminologies in Large Language Model (...   (www.marktechpost.com)
  • 2024-04-25   Top 15 AI Libraries/Frameworks for Automatically Red-Team...   (www.marktechpost.com)
  • 2024-04-17   anthropics/anthropic-cookbook: A collection of notebooks/...   (github.com)
  • 2024-04-15   Deep Learning Architectures From CNN, RNN, GAN, and Trans...   (www.marktechpost.com)
  • 2024-04-15   Tips for LLM Pretraining and Evaluating Reward Models   (magazine.sebastianraschka.com)
  • 2024-04-14   Lessons after a half-billion GPT tokens - Ken Kantzer's Blog   (kenkantzer.com)
  • 2024-04-13   5 Ways To Use LLMs On Your Laptop   (www.kdnuggets.com)
  • 2024-04-13   Words are flowing out like endless rain: Recapping a busy...   (arstechnica.com)
  • 2024-04-12   Gemini: A Family of Highly Capable Multimodal Models   (dev.to)
  • 2024-04-10   Peter Gostev’s Post   (www.linkedin.com)
  • 2024-04-05   Detecting Hallucinations in Large Language Models with Te...   (dev.to)
  • 2024-04-05   Top Open Source Large Language Models (LLMs) Available Fo...   (www.marktechpost.com)
  • 2024-04-02   LLaMA Now Goes Faster on CPUs   (justine.lol)
  • 2024-04-02   Large language models use a surprisingly simple mechanism...   (news.mit.edu)
  • 2024-04-02   Introducing DBRX: A New State-of-the-Art Open LLM   (www.databricks.com)
  • 2024-04-01   ChatGPT vs Perplexity AI: AI App Comparison   (www.marktechpost.com)
  • 2024-03-29   How Nvidia Blackwell Systems Attack 1 Trillion Parameter ...   (www.nextplatform.com)
  • 2024-03-29   How Chain-of-Thought Reasoning Helps Neural Networks Compute   (www.quantamagazine.org)
  • 2024-03-11   You can now train a 70b language model at home   (www.answer.ai)
  • 2024-03-07   Google Bard is called Gemini now and expands to mobile, p...   (www.axios.com)
  • 2024-03-05   Anthropic’s Post   (www.linkedin.com)
  • 2024-03-05   OpenAI's ChatGPT may have its first true rival in Anthrop...   (qz.com)
  • 2024-02-29   rasbt/LLMs-from-scratch   (github.com)
  • 2024-02-29   Meet RAGxplorer: An interactive AI Tool to Support the Bu...   (www.marktechpost.com)
  • 2024-02-29   Meet Google Lumiere AI, Bard’s video maker cousin   (dataconomy.com)
  • 2024-02-29   I Spent a Week With Gemini Pro 1.5—It’s Fantastic   (every.to)
  • 2024-02-29   Title:The Era of 1-bit LLMs: All Large Language Models ar...   (arxiv.org)
  • 2024-02-29   Sora early access: Your guide to securing a spot   (dataconomy.com)
  • 2024-02-29   Au Large | Mistral AI | Frontier AI in your hands   (mistral.ai)
  • 2024-02-22   Claude   (claude.ai)
  • 2024-02-22   How do transformers work?+Design a Multi-class Sentiment ...   (open.substack.com)
  • 2024-02-22   1708022141659 (JPEG Image, 1280 × 1600 pixels) ...   (media.licdn.com)
  • 2024-02-20   How Well Can LLMs Negotiate? Stanford Researchers Develop...   (www.marktechpost.com)
  • 2024-02-17   Sora   (openai.com)
  • 2024-02-15   Code LoRA from Scratch - a Lightning Studio by sebastian   (lightning.ai)
  • 2024-02-15   Bard is now Gemini and Gemini Advanced is amazing   (dataconomy.com)
  • 2024-02-11   Ask HN: What have you built with LLMs?   (news.ycombinator.com)
  • 2024-02-04   Title:BloombergGPT: A Large Language Model for Finance   (arxiv.org)
  • 2024-01-24   Exploring the Zephyr 7B: A Comprehensive Guide to the Lat...   (www.kdnuggets.com)
  • 2024-01-17   Mastering PDFs: Extracting Sections, Headings, Paragraphs...   (blog.llamaindex.ai)
  • 2024-01-16   Understanding and Coding Self-Attention, Multi-Head Atten...   (magazine.sebastianraschka.com)
  • 2024-01-07   Meet Waymo’s MotionLM: The State-of-the-Art Multi-Agent M...   (www.marktechpost.com)
  • 2024-01-07   How much detail is too much? Midjourney v6 attempts to fi...   (arstechnica.com)
  • 2024-01-07   10 Noteworthy AI Research Papers of 2023   (magazine.sebastianraschka.com)
  • 2023-10-20   7 Steps to Mastering Large Language Models (LLMs)   (www.kdnuggets.com)
  • 2023-10-20   Meta AI Researchers Propose Advanced Long-Context LLMs: A...   (www.marktechpost.com)
  • 2023-10-20   This AI Paper from NVIDIA Explores the Power of Retrieval...   (www.marktechpost.com)
  • 2023-10-20   Finetuning LLMs with LoRA and QLoRA: Insights from Hundre...   (lightning.ai)
  • 2023-10-20   Getting Started with Large Language Models: Key Things to...   (flyte.org)
  • 2023-10-20   Unlocking GPT-4 Summarization with Chain of Density Promp...   (www.kdnuggets.com)
  • 2023-10-20   Building RAG-based LLM Applications for Production (Part 1)   (www.anyscale.com)
  • 2023-10-07   Parallel Processing in Prompt Engineering: The Skeleton-o...   (www.kdnuggets.com)
  • 2023-10-05   [2302.07730] Transformer models: an introduction and catalog   (arxiv.org)
  • 2023-10-03   ChatGPT, Bard, or Bing Chat? Differences Among 3 Generati...   (www.nngroup.com)
  • 2023-10-03   Bard   (bard.google.com)
  • 2023-09-25   Large Language Model Prompt Engineering for Complex Summa...   (devblogs.microsoft.com)
  • 2023-09-25   Open LLM Leaderboard : a Hugging Face Space by HuggingFaceH4   (huggingface.co)
  • 2023-09-25   Llama from scratch   (blog.briankitano.com)
  • 2023-09-25   Asking 60+ LLMs a set of 20 questions   (benchmarks.llmonitor.com)
  • 2023-09-24   OpenAI Unveils DALL·E 3: A Revolutionary Leap in Text-to-...   (www.marktechpost.com)
  • 2023-09-24   Comparison: DALL-E 3 vs Midjourney   (dataconomy.com)
  • 2023-09-12   A Beginner’s Guide to Building LLM-Powered Applications w...   (dev.to)
  • 2023-08-31   iryna-kondr/scikit-llm: Seamlessly integrate LLMs into sc...   (github.com)
  • 2023-08-27   Together AI Unveils Llama-2-7B-32K-Instruct: A Breakthrou...   (www.marktechpost.com)
  • 2023-08-20   Meet Chroma: An AI-Native Open-Source Vector Database For...   (www.marktechpost.com)
  • 2023-08-07   Introducing OpenLLM: Open Source Library for LLMs   (www.kdnuggets.com)
  • 2023-08-07   Abacus AI Introduces A New Open Long-Context Large Langua...   (www.marktechpost.com)
  • 2023-08-06   How to use LLMs for PDF parsing   (nanonets.com)
  • 2023-08-02   LangChain 101: Build Your Own GPT-Powered Applications   (www.kdnuggets.com)
  • 2023-07-28   MPT-30B: Raising the bar for open-source foundation models   (www.mosaicml.com)
  • 2023-07-28   Midjourney pricing plans and free alternatives to try   (dataconomy.com)
  • 2023-07-28   Is Anthropic's Claude 2 model ready to take down GPT-4? W...   (dev.to)
  • 2023-07-24   Emerging Architectures for LLM Applications   (a16z.com)
  • 2023-07-24   ELI5: FlashAttention   (gordicaleksa.medium.com)
  • 2023-07-24   Free Full Stack LLM Bootcamp   (www.kdnuggets.com)
  • 2023-07-23   The Secret Sauce behind 100K context window in LLMs: all ...   (blog.gopenai.com)
  • 2023-07-23   Training LLMs with AMD MI250 GPUs and MosaicML   (www.mosaicml.com)
  • 2023-07-23   Optimizing Memory Usage for Training LLMs and Vision Tran...   (lightning.ai)
  • 2023-07-23   Ecosystem Graphs for Foundation Models   (crfm.stanford.edu)
  • 2023-07-23   Leandro von Werra’s Post   (www.linkedin.com)
  • 2023-07-12   Unraveling the Power of Chain-of-Thought Prompting in Lar...   (www.kdnuggets.com)
  • 2023-07-12   GitHub - Mooler0410/LLMsPracticalGuide: A curated list of...   (github.com)
  • 2023-06-19   Falcon LLM: The New King of Open-Source LLMs   (www.kdnuggets.com)
  • 2023-06-09   LMM Garden | Discover, search, and compare LLMs   (llm.garden)
  • 2023-06-08   iryna-kondr/scikit-llm   (github.com)
  • 2023-05-21   Sonali Pattnaik on LinkedIn: #generativeai #ai | 45 comments   (www.linkedin.com)
  • 2023-05-19   The Non-Silence of the LLMs   (informationisbeautiful.net)
  • 2023-05-19   Super Bard: The AI That Can Do It All and Better   (www.kdnuggets.com)
  • 2023-05-05   Distilling Step-by-Step! Outperforming Larger Language Mo...   (arxiv.org)
  • 2023-05-05   SparseGPT: Massive Language Models Can Be Accurately Prun...   (arxiv.org)
  • 2023-05-05   openlm-research/open_llama: OpenLLaMA, a permissively lic...   (github.com)
  • 2023-05-03   guidance-ai/guidance: A guidance language for controlling...   (github.com)
  • 2023-04-29   Blog | Anyscale   (www.anyscale.com)
  • 2023-04-25   Data Machina #198   (datamachina.substack.com)
  • 2023-04-25   Finetuning Large Language Models   (magazine.sebastianraschka.com)
  • 2023-04-21   Stanford CRFM   (crfm.stanford.edu)
  • 2023-04-21   Eight Things to Know about Large Language Models   (arxiv.org)
  • 2023-04-19   Baby AGI: The Birth of a Fully Autonomous AI   (www.kdnuggets.com)
  • 2023-04-19   Hacker News   (magazine.sebastianraschka.com)
  • 2023-04-14   Prompt Engineering   (lilianweng.github.io)
  • 2023-04-14   A Survey of Large Language Models   (arxiv.org)
  • 2023-04-14   New Ebook: A Beginner’s Guide to Large Language Models   (www.nvidia.com)
  • 2023-02-10   OpenAI Platform   (platform.openai.com)
  • 2014-09-24   Top 9 Different Types of Retrieval-Augmented Generation (...   (www.marktechpost.com)
  • 2014-09-24   FlashSigmoid: A Hardware-Aware and Memory-Efficient Imple...   (www.marktechpost.com)
  • 2014-09-24   Graphiti: A Python Library for Building Temporal Knowledg...   (www.marktechpost.com)
  • 2014-08-24   Building a Simple RAG Application Using LlamaIndex - Mach...   (machinelearningmastery.com)
  • 2009-09-24   LlamaIndex : LlamaIndex   (docs.llamaindex.ai)
  • 2003-09-24   Why GPU Utilization Falls Short: Understanding Streaming ...   (www.marktechpost.com)
  • 2002-10-24   LightLLM: A Lightweight Scalable and High-Speed Python Fr...   (www.marktechpost.com)
  • 2001-10-24   Ten Effective Strategies to Lower Large Language Model (L...   (www.marktechpost.com)