large language models (LLMs) (all)

Also see: streaming, youtube, video

2025-05-05 Physics of Language Models: Part 4.1, Architecture Design... (papers.ssrn.com)

2025-05-04 Dummy’s Guide to Modern LLM Sampling (simonwillison.net)

2025-05-03 Creating an MCP Server Using Go (eltonminetto.dev)

2025-05-03 ollama with docker compose (geshan.com.np)

2025-05-03 ollama APIs (geshan.com.np)

2025-05-03 What is Ollama and how to use it: a quick guide [part 1] (geshan.com.np)

2025-05-03 Ollama commands: How to use Ollama in the command line [P... (geshan.com.np)

2025-05-03 LLM Projects with Python (thecleverprogrammer.com)

2025-05-01 XiaomiMiMo/MiMo: MiMo: Unlocking the Reasoning Potential ... (github.com)

2025-05-01 Chatbot Arena (formerly LMSYS): Free AI Chat to Compare &... (lmarena.ai)

2025-04-30 The Leaderboard Illusion (arxiv.org)

2025-04-26 DeepSeek R2 AI Model Rumors Begin to Swirl Online; Report... (wccftech.com)

2025-04-20 To Make Language Models Work Better, Researchers Sidestep... (www.quantamagazine.org)

2025-04-20 The State of Reinforcement Learning for LLM Reasoning (sebastianraschka.com)

2025-04-18 OpenAI Releases a Practical Guide to Building LLM Agents ... (www.marktechpost.com)

2025-04-17 LLM Post-Training: A Deep Dive into Reasoning Large Langu... (arxiv.org)

2025-04-16 How To Build An Agent | Amp (ampcode.com)

2025-04-13 humanlayer/12-factor-agents (github.com)

2025-04-13 The Rise of Slopsquatting: How AI Hallucinations Are Fuel... (socket.dev)

2025-04-11 12-factor-agents: Principles to build LLM-powered softwar... (lobste.rs)

2025-04-10 An LLM Query Understanding Service (simonwillison.net)

2025-04-10 The Man Out to Prove How Dumb AI Still Is (www.theatlantic.com)

2025-04-08 The “S” in MCP Stands for Security - Elena Cross - Medium (elenacross7.medium.com)

2025-04-08 Topic 31: How to Reduce Memory Use in Reasoning Models (www.turingpost.com)

2025-04-07 Topic 33: Slim Attention, KArAt, XAttention and Multi-Tok... (huggingface.co)

2025-04-07 A look at the ARC-AGI exam designed by French computer sc... (www.techmeme.com)

2025-04-06 The Llama 4 herd: The beginning of a new era of natively ... (ai.meta.com)

2025-04-06 Model Context Protocol (MCP) an overview (www.philschmid.de)

2025-04-06 Use MCP servers in VS Code (Preview) (code.visualstudio.com)

2025-04-05 If Anthropic Succeeds, a Nation of Benevolent AI Geniuses... (www.wired.com)

2025-04-05 A Code Implementation to Building a Context-Aware AI Assi... (www.marktechpost.com)

2025-04-02 LLM Benchmarking: Fundamental Concepts | NVIDIA Technical... (developer.nvidia.com)

2025-04-02 A Comprehensive Guide to LLM Routing: Tools and Frameworks (www.marktechpost.com)

2025-03-29 First Look at Reasoning From Scratch: Chapter 1 (sebastianraschka.com)

2025-03-28 How DeepSeek Rewrote the Transformer [MLA] (www.youtube.com)

2025-03-28 Tracing the thoughts of a large language model (simonwillison.net)

2025-03-27 Anthropic can now track the bizarre inner workings of a l... (www.technologyreview.com)

2025-03-26 10 Must-Know Python Libraries for LLMs in 2025 (machinelearningmastery.com)

2025-03-26 Function calling with Gemma (simonwillison.net)

2025-03-26 Putting Gemini 2.5 Pro through its paces (simonwillison.net)

2025-03-25 Introducing 4o Image Generation (openai.com)

2025-03-25 What is the hallucination index? (dataconomy.com)

2025-03-23 Quickstart | Mistral AI Large Language Models (docs.mistral.ai)

2025-03-22 Improving Recommender Systems & Search in the Age of LLMs (eugeneyan.com)

2025-03-20 Anthropic just gave Claude a superpower: real-time web se... (venturebeat.com)

2025-03-18 Mistral Small 3.1 runs on a MacBook and beats giants - Da... (dataconomy.com)

2025-03-17 Mistral Small 3.1 (simonwillison.net)

2025-03-16 https://www.r-bloggers.com/2025/03/the-ellmer-package-for... (www.r-bloggers.com)

2025-03-13 What is catastrophic forgetting? - Dataconomy (dataconomy.com)

2025-03-13 Top 7 Open-Source LLMs in 2025 - KDnuggets (www.kdnuggets.com)

2025-03-12 What are model cards? - Dataconomy (dataconomy.com)

2025-03-11 How I use LLMs to help me write code (open.substack.com)

2025-03-08 On GPT-4.5 (thezvi.substack.com)

2025-03-08 The State of LLM Reasoning Models (open.substack.com)

2025-03-07 Mistral OCR (simonwillison.net)

2025-03-06 Mistral OCR | Mistral AI (mistral.ai)

2025-03-04 llm-ollama 0.9.0 (simonwillison.net)

2025-02-26 Claude 3.7 Sonnet and Claude Code (www.anthropic.com)

2025-02-26 The Deep Research problem — Benedict Evans (www.ben-evans.com)

2025-02-24 5 Principles for Writing Effective Prompts (2025 Update) (blog.tobiaszwingmann.com)

2025-02-24 Greg Brockman shared this template for prompting (www.linkedin.com)

2025-02-21 LLM Leaderboard (artificialanalysis.ai)

2025-02-17 Here Are My Go-To AI Tools (open.substack.com)

2025-02-17 A Step-by-Step Guide to Setting Up a Custom BPE Tokenizer... (www.marktechpost.com)

2025-02-15 We Were Wrong About GPUs (fly.io)

2025-02-07 Using pip to install a Large Language Model that’s under ... (simonwillison.net)

2025-02-05 Understanding Reasoning LLMs (sebastianraschka.com)

2025-02-03 5 AI Agent Frameworks Compared - KDnuggets (www.kdnuggets.com)

2025-02-02 (WIP) A Little Bit of Reinforcement Learning from Human F... (rlhfbook.com)

2025-02-02 Creating an AI Agent-Based System with LangGraph: Adding ... (www.marktechpost.com)

2025-02-01 aidanmclaughlin/AidanBench: Aidan Bench attempts to measu... (github.com)

2025-01-31 OpenAI o3-mini, now available in LLM (simonwillison.net)

2025-01-29 Multi-Head Latent Attention and Other KV Cache Tricks (www.pyspur.dev)

2025-01-29 Qwen AI Introduces Qwen2.5-Max: A large MoE LLM Pretraine... (www.marktechpost.com)

2025-01-29 Alibaba releases AI model it says surpasses DeepSeek (www.reuters.com)

2025-01-28 On MLA (planetbanatt.net)

2025-01-27 The Illustrated DeepSeek-R1 (newsletter.languagemodels.co)

2025-01-26 DeepSeek-R1 vs. OpenAI’s o1: A New Step in Open Source an... (www.marktechpost.com)

2025-01-25 AI hallucinations can’t be stopped — but these techniques... (www.nature.com)

2025-01-23 Noteworthy LLM Research Papers of 2024 (sebastianraschka.com)

2025-01-23 LLM 0.20 (simonwillison.net)

2025-01-23 How Chinese A.I. Start-Up DeepSeek Is Competing With Open... (www.nytimes.com)

2025-01-20 DeepSeek-R1 and exploring DeepSeek-R1-Distill-Llama-8B (simonwillison.net)

2025-01-18 Microsoft Presents a Comprehensive Framework for Securing... (www.marktechpost.com)

2025-01-18 Lessons From Red Teaming 100 Generative AI Products (simonwillison.net)

2025-01-18 Implementing A Byte Pair Encoding (BPE) Tokenizer From Sc... (sebastianraschka.com)

2025-01-17 This Rumor About GPT-5 Changes Everything (open.substack.com)

2025-01-14 The 2025 AI Engineering Reading List (www.latent.space)

2025-01-12 Agents (huyenchip.com)

2025-01-12 100 Must-Read Generative AI Papers from 2024 (open.substack.com)

2025-01-09 7 Next-Generation Prompt Engineering Techniques - Machine... (machinelearningmastery.com)

2025-01-08 How to use NotebookLM for personalized knowledge synthesis (open.substack.com)

2025-01-07 An Opinionated Evals Reading List — Apollo Research (www.apolloresearch.ai)

2024-12-31 Things we learned out about LLMs in 2024 (simonwillison.net)

2024-12-30 How to Build a Graph RAG App (towardsdatascience.com)

2024-12-24 Gemini 2.0 Flash "Thinking Mode" (open.substack.com)

2024-12-22 LLM Research Papers: The 2024 List (magazine.sebastianraschka.com)

2024-12-22 Why AI language models choke on too much text (arstechnica.com)

2024-12-21 rasbt/LLMs-from-scratch: Implement a ChatGPT-like LLM in ... (github.com)

2024-12-21 Slim-Llama: An Energy-Efficient LLM ASIC Processor Suppor... (www.marktechpost.com)

2024-12-21 OpenAI Unveils o3 System That Reasons Through Math, Scien... (www.nytimes.com)

2024-12-19 Building effective agents \ Anthropic (www.anthropic.com)

2024-12-18 Blt patches scale better than tokens (dl.fbaipublicfiles.com)

2024-12-16 Meta AI Proposes Large Concept Models (LCMs): A Semantic ... (www.marktechpost.com)

2024-12-15 How LLMs Store and Use Knowledge? This AI Paper Introduce... (www.marktechpost.com)

2024-12-13 LangChain vs OpenAI API: When Simplicity Meets Scalabilit... (blogs.adityabh.is-a.dev)

2024-12-12 Transformers Key-Value (KV) Caching Explained (towardsdatascience.com)

2024-12-12 Scaling Laws – O1 Pro Architecture, Reasoning Training In... (semianalysis.com)

2024-12-11 The AI Researchers Pushing Computers to Launch Nightmare ... (www.wsj.com)

2024-12-09 What are Hallucinations in LLMs and 6 Effective Strategie... (www.marktechpost.com)

2024-12-07 Countless.dev | AI Model Comparison (countless.dev)

2024-12-07 CPU-GPU I/O-Aware LLM Inference Reduces Latency in GPUs b... (www.marktechpost.com)

2024-12-05 How to Build a General-Purpose LLM Agent (towardsdatascience.com)

2024-12-05 Treemap (aiworld.eu)

2024-12-05 AI Hallucinations: Why Large Language Models Make Things ... (www.kapa.ai)

2024-11-29 llama.cpp guide - Running LLMs locally, on any hardware, ... (steelph0enix.github.io)

2024-11-28 Four Cutting-Edge Methods for Evaluating AI Agents and En... (www.marktechpost.com)

2024-11-26 eugeneyan/llm-paper-notes: Notes from the Latent Space pa... (github.com)

2024-11-21 Understanding Multimodal LLMs (magazine.sebastianraschka.com)

2024-11-17 Something weird is happening with LLMs and chess (open.substack.com)

2024-11-11 Analyzing the homerun year for LLMs: the top-100 most cit... (www.zeta-alpha.com)

2024-10-31 LLM Chunking, Indexing, Scoring and Agents, in a Nutshell... (www.datasciencecentral.com)

2024-10-28 Developing a computer use model (www.anthropic.com)

2024-10-19 5 LLM Tools I Can’t Live Without (www.kdnuggets.com)

2024-10-19 Claude: Everything you need to know about Anthropic's AI ... (techcrunch.com)

2024-10-17 Nvidia just dropped a new AI model that crushes OpenAI’s ... (venturebeat.com)

2024-08-04 dpo-from-scratch.ipynb (github.com)

2024-08-04 What We Learned from a Year of Building with LLMs (Part I) (www.oreilly.com)

2024-08-01 Towards Monosemanticity: A step towards understanding lar... (towardsdatascience.com)

2024-07-24 Meta unleashes its most powerful AI model, Llama 3.1, wit... (venturebeat.com)

2024-07-24 Customize Generative AI Models for Enterprise Application... (developer.nvidia.com)

2024-07-24 Llama 3.1 Released: Meta’s New Open-Source AI Model that ... (www.marktechpost.com)

2024-07-24 Meta Llama 3.1 405b is outperforming private models with ... (dataconomy.com)

2024-07-20 Understanding Positional Embeddings in Transformers: From... (towardsdatascience.com)

2024-07-15 Claude 3.5 Sonnet (www.anthropic.com)

2024-07-13 Do large language models understand the world? (www.amazon.science)

2024-07-04 Building an LLM Router for High-Quality and Cost-Effectiv... (www.anyscale.com)

2024-07-03 From bare metal to a 70B model: infrastructure set-up and... (imbue.com)

2024-07-02 StarCoder2-15B: A Powerful LLM for Code Generation, Summa... (nvda.ws)

2024-06-27 How Gradient created an open LLM with a million-token con... (venturebeat.com)

2024-06-22 Some Commonly Used Advanced Prompt Engineering Techniques... (www.marktechpost.com)

2024-06-20 Key Metrics for Evaluating Large Language Models (LLMs) (www.marktechpost.com)

2024-06-20 Firecrawl: A Powerful Web Scraping Tool for Turning Websi... (www.marktechpost.com)

2024-06-19 Let's reproduce GPT-2 (124M) (m.youtube.com)

2024-06-19 How to use an open source LLM model locally and remotely (thoughtbot.com)

2024-06-12 “The” Midjourney model personalization guide (dataconomy.com)

2024-06-12 How to use Perplexity in your PM work (www.lennysnewsletter.com)

2024-06-11 [2406.01506] The Geometry of Categorical and Hierarchical... (arxiv.org)

2024-06-11 What We Learned from a Year of Building with LLMs (Part II) (www.oreilly.com)

2024-06-11 Sharpening LLMs: The Sharpest Tools and Essential Techniq... (www.marktechpost.com)

2024-06-11 List of Activities and Their Corresponding Suitable LLMs ... (www.marktechpost.com)

2024-06-11 Three Things to Know About Prompting LLMs (sloanreview.mit.edu)

2024-05-31 Perplexity goes beyond AI search, launches publishing pla... (venturebeat.com)

2024-05-28 The Great AI Chatbot Challenge: ChatGPT vs. Gemini vs. Co... (www.wsj.com)

2024-05-26 The future of foundation models is closed-source (www.thediff.co)

2024-05-24 Demystifying Vision-Language Models: An In-Depth Exploration (www.marktechpost.com)

2024-05-22 AI Is a Black Box. Anthropic Figured Out a Way to Look In... (www.wired.com)

2024-05-21 naklecha/llama3-from-scratch (github.com)

2024-05-21 Abacus AI Releases Smaug-Llama-3-70B-Instruct: The New Be... (www.marktechpost.com)

2024-05-13 Do Enormous LLM Context Windows Spell the End of RAG? (thenewstack.io)

2024-05-13 How Good Are the Latest Open LLMs? And Is DPO Better Than... (sebastianraschka.com)

2024-05-12 ChuXin: A Fully Open-Sourced Language Model with a Size o... (www.marktechpost.com)

2024-05-11 Title:You Only Cache Once: Decoder-Decoder Architectures ... (arxiv.org)

2024-05-11 Anthropic AI Launches a Prompt Engineering Tool that Gene... (www.marktechpost.com)

2024-05-11 Cleaning (docs.unstructured.io)

2024-05-08 [2404.19737] Better & Faster Large Language Models via Mu... (arxiv.org)

2024-05-07 Researchers at NVIDIA AI Introduce ‘VILA’: A Vision Langu... (www.marktechpost.com)

2024-05-05 Hugging Face - Documentation (huggingface.co)

2024-04-25 Understanding Key Terminologies in Large Language Model (... (www.marktechpost.com)

2024-04-25 Top 15 AI Libraries/Frameworks for Automatically Red-Team... (www.marktechpost.com)

2024-04-19 Meta says Llama 3 beats most other models, including Gemi... (www.theverge.com)

2024-04-17 anthropics/anthropic-cookbook: A collection of notebooks/... (github.com)

2024-04-15 Deep Learning Architectures From CNN, RNN, GAN, and Trans... (www.marktechpost.com)

2024-04-15 Tips for LLM Pretraining and Evaluating Reward Models (magazine.sebastianraschka.com)

2024-04-14 Lessons after a half-billion GPT tokens - Ken Kantzer's Blog (kenkantzer.com)

2024-04-13 5 Ways To Use LLMs On Your Laptop (www.kdnuggets.com)

2024-04-13 Words are flowing out like endless rain: Recapping a busy... (arstechnica.com)

2024-04-12 Gemini: A Family of Highly Capable Multimodal Models (dev.to)

2024-04-10 Peter Gostev’s Post (www.linkedin.com)

2024-04-05 Detecting Hallucinations in Large Language Models with Te... (dev.to)

2024-04-05 Top Open Source Large Language Models (LLMs) Available Fo... (www.marktechpost.com)

2024-04-02 LLaMA Now Goes Faster on CPUs (justine.lol)

2024-04-02 Large language models use a surprisingly simple mechanism... (news.mit.edu)

2024-04-02 Introducing DBRX: A New State-of-the-Art Open LLM (www.databricks.com)

2024-04-01 ChatGPT vs Perplexity AI: AI App Comparison (www.marktechpost.com)

2024-03-30 Mamba Explained (thegradient.pub)

2024-03-29 How Nvidia Blackwell Systems Attack 1 Trillion Parameter ... (www.nextplatform.com)

2024-03-29 How Chain-of-Thought Reasoning Helps Neural Networks Compute (www.quantamagazine.org)

2024-03-11 Why and How to Achieve Longer Context Windows for LLMs (towardsdatascience.com)

2024-03-11 Generative AI Design Patterns: A Comprehensive Guide | by... (towardsdatascience.com)

2024-03-11 You can now train a 70b language model at home (www.answer.ai)

2024-03-11 Easily Train a Specialized LLM: PEFT, LoRA, QLoRA, LLaMA-... (towardsdatascience.com)

2024-03-07 Google Bard is called Gemini now and expands to mobile, p... (www.axios.com)

2024-03-05 Anthropic’s Post (www.linkedin.com)

2024-03-05 OpenAI's ChatGPT may have its first true rival in Anthrop... (qz.com)

2024-02-29 rasbt/LLMs-from-scratch (github.com)

2024-02-29 Meet RAGxplorer: An interactive AI Tool to Support the Bu... (www.marktechpost.com)

2024-02-29 Meet Google Lumiere AI, Bard’s video maker cousin (dataconomy.com)

2024-02-29 How To Build an LLM-Powered App To Chat with PapersWithCode (towardsdatascience.com)

2024-02-29 The killer app of Gemini Pro 1.5 is video (simonwillison.net)

2024-02-29 Understanding Direct Preference Optimization (towardsdatascience.com)

2024-02-29 I Spent a Week With Gemini Pro 1.5—It’s Fantastic (every.to)

2024-02-29 Title:The Era of 1-bit LLMs: All Large Language Models ar... (arxiv.org)

2024-02-29 Sora early access: Your guide to securing a spot (dataconomy.com)

2024-02-29 Au Large | Mistral AI | Frontier AI in your hands (mistral.ai)

2024-02-22 Claude (claude.ai)

2024-02-22 Beyond Self-Attention: How a Small Language Model Predict... (shyam.blog)

2024-02-22 How do transformers work?+Design a Multi-class Sentiment ... (open.substack.com)

2024-02-22 1708022141659 (JPEG Image, 1280 × 1600 pixels) ... (media.licdn.com)

2024-02-22 Groq Inference Tokenomics: Speed, But At What Cost? (www.semianalysis.com)

2024-02-20 How Well Can LLMs Negotiate? Stanford Researchers Develop... (www.marktechpost.com)

2024-02-17 Sora (openai.com)

2024-02-15 Code LoRA from Scratch - a Lightning Studio by sebastian (lightning.ai)

2024-02-15 Bard is now Gemini and Gemini Advanced is amazing (dataconomy.com)

2024-02-11 Ask HN: What have you built with LLMs? (news.ycombinator.com)

2024-02-04 Title:BloombergGPT: A Large Language Model for Finance (arxiv.org)

2024-01-24 Exploring the Zephyr 7B: A Comprehensive Guide to the Lat... (www.kdnuggets.com)

2024-01-17 Mastering PDFs: Extracting Sections, Headings, Paragraphs... (blog.llamaindex.ai)

2024-01-16 Understanding and Coding Self-Attention, Multi-Head Atten... (magazine.sebastianraschka.com)

2024-01-16 Dashboard - SciSummary (scisummary.com)

2024-01-07 Meet Waymo’s MotionLM: The State-of-the-Art Multi-Agent M... (www.marktechpost.com)

2024-01-07 How much detail is too much? Midjourney v6 attempts to fi... (arstechnica.com)

2024-01-07 10 Noteworthy AI Research Papers of 2023 (magazine.sebastianraschka.com)

2023-10-20 7 Steps to Mastering Large Language Models (LLMs) (www.kdnuggets.com)

2023-10-20 Meta AI Researchers Propose Advanced Long-Context LLMs: A... (www.marktechpost.com)

2023-10-20 This AI Paper from NVIDIA Explores the Power of Retrieval... (www.marktechpost.com)

2023-10-20 Finetuning LLMs with LoRA and QLoRA: Insights from Hundre... (lightning.ai)

2023-10-20 Getting Started with Large Language Models: Key Things to... (flyte.org)

2023-10-20 Unlocking GPT-4 Summarization with Chain of Density Promp... (www.kdnuggets.com)

2023-10-20 The Ins and Outs of Retrieval-Augmented Generation (RAG) (towardsdatascience.com)

2023-10-20 Building RAG-based LLM Applications for Production (Part 1) (www.anyscale.com)

2023-10-20 RAG vs Finetuning: Which Is the Best Tool to Boost Your L... (towardsdatascience.com)

2023-10-20 A High-Level Overview Of Large Language Model Concepts, U... (smashingmagazine.com)

2023-10-20 Augmenting LLMs with RAG (towardsdatascience.com)

2023-10-07 Parallel Processing in Prompt Engineering: The Skeleton-o... (www.kdnuggets.com)

2023-10-05 [2302.07730] Transformer models: an introduction and catalog (arxiv.org)

2023-10-04 Hey, Computer, Make Me a Font (serce.me)

2023-10-04 SaaS Competitive Advantage Through Elegant LLM Feedback M... (www.tomtunguz.com)

2023-10-03 ChatGPT, Bard, or Bing Chat? Differences Among 3 Generati... (www.nngroup.com)

2023-10-03 Bard (bard.google.com)

2023-10-03 The State of Large Language Models (www.scientificamerican.com)

2023-09-25 10 Ways to Improve the Performance of Retrieval Augmented... (towardsdatascience.com)

2023-09-25 How to Build an LLM from Scratch (towardsdatascience.com)

2023-09-25 Large Language Model Prompt Engineering for Complex Summa... (devblogs.microsoft.com)

2023-09-25 Open LLM Leaderboard : a Hugging Face Space by HuggingFaceH4 (huggingface.co)

2023-09-25 Llama from scratch (blog.briankitano.com)

2023-09-25 Cracking Open the OpenAI (Python) API (towardsdatascience.com)

2023-09-25 Cracking Open the Hugging Face Transformers Library (towardsdatascience.com)

2023-09-25 Asking 60+ LLMs a set of 20 questions (benchmarks.llmonitor.com)

2023-09-24 OpenAI Unveils DALL·E 3: A Revolutionary Leap in Text-to-... (www.marktechpost.com)

2023-09-24 Comparison: DALL-E 3 vs Midjourney (dataconomy.com)

2023-09-17 What OpenAI Really Wants (www.wired.com)

2023-09-12 A Beginner’s Guide to Building LLM-Powered Applications w... (dev.to)

2023-08-31 iryna-kondr/scikit-llm: Seamlessly integrate LLMs into sc... (github.com)

2023-08-31 Prompt Engineering — How to trick AI into solving your pr... (towardsdatascience.com)

2023-08-30 A Beginner’s Guide to LLM Fine-Tuning (towardsdatascience.com)

2023-08-27 Together AI Unveils Llama-2-7B-32K-Instruct: A Breakthrou... (www.marktechpost.com)

2023-08-25 A Practical Introduction to LLMs (towardsdatascience.com)

2023-08-20 Meet Chroma: An AI-Native Open-Source Vector Database For... (www.marktechpost.com)

2023-08-07 How to Extract Text from Any PDF and Image for Large Lang... (towardsdatascience.com)

2023-08-07 Introducing OpenLLM: Open Source Library for LLMs (www.kdnuggets.com)

2023-08-07 Abacus AI Introduces A New Open Long-Context Large Langua... (www.marktechpost.com)

2023-08-06 How to use LLMs for PDF parsing (nanonets.com)

2023-08-06 How to Chat With Any File from PDFs to Images Using Large... (towardsdatascience.com)

2023-08-06 How to Leverage Open-Source LLMs in Your Project (www.turingpost.com)

2023-08-02 LangChain 101: Build Your Own GPT-Powered Applications (www.kdnuggets.com)

2023-07-28 MPT-30B: Raising the bar for open-source foundation models (www.mosaicml.com)

2023-07-28 Midjourney pricing plans and free alternatives to try (dataconomy.com)

2023-07-28 A Deep Dive Into LLaMA, Falcon, Llama 2 and Their Remarka... (www.turingpost.com)

2023-07-28 Chain of Thought Prompting for LLMs (towardsdatascience.com)

2023-07-28 Is Anthropic's Claude 2 model ready to take down GPT-4? W... (dev.to)

2023-07-24 Emerging Architectures for LLM Applications (a16z.com)

2023-07-24 ELI5: FlashAttention (gordicaleksa.medium.com)

2023-07-24 Build Industry-Specific LLMs Using Retrieval Augmented Ge... (towardsdatascience.com)

2023-07-24 Free Full Stack LLM Bootcamp (www.kdnuggets.com)

2023-07-24 Edge 300: Meet Falcon LLM: The Most Powerful Open Source ... (thesequence.substack.com)

2023-07-23 The Secret Sauce behind 100K context window in LLMs: all ... (blog.gopenai.com)

2023-07-23 Observe.ai unveils 30-billion-parameter contact center LL... (venturebeat.com)

2023-07-23 All You Need to Know to Build Your First LLM App (towardsdatascience.com)

2023-07-23 Training LLMs with AMD MI250 GPUs and MosaicML (www.mosaicml.com)

2023-07-23 Optimizing Memory Usage for Training LLMs and Vision Tran... (lightning.ai)

2023-07-23 Deploying Falcon-7B Into Production (towardsdatascience.com)

2023-07-23 Anthropic releases Claude 2, its second-gen AI chatbot (techcrunch.com)

2023-07-23 Google Launches AI-Powered Notes App Called NotebookLM (tech.slashdot.org)

2023-07-23 Ecosystem Graphs for Foundation Models (crfm.stanford.edu)

2023-07-23 Meet LMQL: An Open Source Query Language for LLMs (thesequence.substack.com)

2023-07-23 Leandro von Werra’s Post (www.linkedin.com)

2023-07-23 LLaMA 2: How to access and use Meta’s versatile open-sour... (venturebeat.com)

2023-07-22 Beyond LLaMA: The Power of Open LLMs (towardsdatascience.com)

2023-07-22 Facebook parent Meta unveils LLaMA 2 open-source AI model... (venturebeat.com)

2023-07-22 MosaicML launches MPT-7B-8K, a 7B-parameter open-source L... (venturebeat.com)

2023-07-22 The $1 billion gamble to ensure AI doesn’t destroy humanity (www.thediff.co)

2023-07-12 Unraveling the Power of Chain-of-Thought Prompting in Lar... (www.kdnuggets.com)

2023-07-12 GitHub - Mooler0410/LLMsPracticalGuide: A curated list of... (github.com)

2023-06-19 Introduction to the Open LLM Falcon-40B: Performance, Tra... (towardsdatascience.com)

2023-06-19 Falcon LLM: The New King of Open-Source LLMs (www.kdnuggets.com)

2023-06-18 Meet FinGPT: An Open-Source Financial Large Language Mode... (www-marktechpost-com.cdn.ampproject.org)

2023-06-09 LMM Garden | Discover, search, and compare LLMs (llm.garden)

2023-06-08 iryna-kondr/scikit-llm (github.com)

2023-06-02 The Case for Running AI on CPUs Isn’t Dead Yet (spectrum.ieee.org)

2023-05-28 The Art of Prompt Design: Prompt Boundaries and Token Hea... (towardsdatascience.com)

2023-05-21 Sonali Pattnaik on LinkedIn: #generativeai #ai | 45 comments (www.linkedin.com)

2023-05-19 The Non-Silence of the LLMs (informationisbeautiful.net)

2023-05-19 Super Bard: The AI That Can Do It All and Better (www.kdnuggets.com)

2023-05-18 Edge 291: Reinforcement Learning with Human Feedback (thesequence.substack.com)

2023-05-12 Google dives into the ‘supercomputer’ game by knitting to... (venturebeat.com)

2023-05-05 Distilling Step-by-Step! Outperforming Larger Language Mo... (arxiv.org)

2023-05-05 SparseGPT: Massive Language Models Can Be Accurately Prun... (arxiv.org)

2023-05-05 openlm-research/open_llama: OpenLLaMA, a permissively lic... (github.com)

2023-05-03 guidance-ai/guidance: A guidance language for controlling... (github.com)

2023-04-29 Blog | Anyscale (www.anyscale.com)

2023-04-29 Parameter-Efficient LLM Finetuning With Low-Rank Adaptati... (sebastianraschka.com)

2023-04-29 Edge 286: Vicuna, the LLaMA-Based Model that Matches Chat... (thesequence.substack.com)

2023-04-26 Grounding Large Language Models in a Cognitive Foundation... (thegradient.pub)

2023-04-25 Data Machina #198 (datamachina.substack.com)

2023-04-25 Finetuning Large Language Models (magazine.sebastianraschka.com)

2023-04-21 The LLama Effect: How an Accidental Leak Sparked a Series... (thesequence.substack.com)

2023-04-21 Stanford CRFM (crfm.stanford.edu)

2023-04-21 Meta has built a massive new language AI—and it’s giving ... (www.technologyreview.com)

2023-04-21 Eight Things to Know about Large Language Models (arxiv.org)

2023-04-19 Baby AGI: The Birth of a Fully Autonomous AI (www.kdnuggets.com)

2023-04-19 Hacker News (magazine.sebastianraschka.com)

2023-04-17 📝 Guest Post: How to Enhance the Usefulness of Large Lang... (thesequence.substack.com)

2023-04-14 Prompt Engineering (lilianweng.github.io)

2023-04-14 A Survey of Large Language Models (arxiv.org)

2023-04-14 New Ebook: A Beginner’s Guide to Large Language Models (www.nvidia.com)

2023-04-13 Maximizing the Potential of LLMs: A Guide to Prompt Engin... (www.ruxu.dev)

2023-04-13 The Magic of LLMs — Prompt Engineering (towardsdatascience.com)

2023-04-12 📝 Guest Post: Caching LLM Queries for Improved Performanc... (thesequence.substack.com)

2023-02-10 OpenAI Platform (platform.openai.com)

2014-09-24 Graphiti: A Python Library for Building Temporal Knowledg... (www.marktechpost.com)

2014-09-24 Top 9 Different Types of Retrieval-Augmented Generation (... (www.marktechpost.com)

2014-09-24 FlashSigmoid: A Hardware-Aware and Memory-Efficient Imple... (www.marktechpost.com)

2014-08-24 Building a Simple RAG Application Using LlamaIndex - Mach... (machinelearningmastery.com)

2009-09-24 LlamaIndex : LlamaIndex (docs.llamaindex.ai)

2003-09-24 Why GPU Utilization Falls Short: Understanding Streaming ... (www.marktechpost.com)

2002-10-24 Nvidia just dropped a bombshell: Its new AI model is open... (venturebeat.com)

2002-10-24 LightLLM: A Lightweight Scalable and High-Speed Python Fr... (www.marktechpost.com)

2001-10-24 Ten Effective Strategies to Lower Large Language Model (L... (www.marktechpost.com)

Perfectly Awesome

large language models (LLMs) (all)