large language models (LLMs) (all)

Also see: streaming, youtube, video

  • 2025-03-23   Quickstart | Mistral AI Large Language Models   (docs.mistral.ai)
  • 2025-03-22   Improving Recommender Systems & Search in the Age of LLMs   (eugeneyan.com)
  • 2025-03-20   Anthropic just gave Claude a superpower: real-time web se...   (venturebeat.com)
  • 2025-03-18   Mistral Small 3.1 runs on a MacBook and beats giants - Da...   (dataconomy.com)
  • 2025-03-17   Mistral Small 3.1   (simonwillison.net)
  • 2025-03-16   https://www.r-bloggers.com/2025/03/the-ellmer-package-for...   (www.r-bloggers.com)
  • 2025-03-13   What is catastrophic forgetting? - Dataconomy   (dataconomy.com)
  • 2025-03-13   Top 7 Open-Source LLMs in 2025 - KDnuggets   (www.kdnuggets.com)
  • 2025-03-12   What are model cards? - Dataconomy   (dataconomy.com)
  • 2025-03-11   How I use LLMs to help me write code   (open.substack.com)
  • 2025-03-08   On GPT-4.5   (thezvi.substack.com)
  • 2025-03-08   The State of LLM Reasoning Models   (open.substack.com)
  • 2025-03-07   Mistral OCR   (simonwillison.net)
  • 2025-03-06   Mistral OCR | Mistral AI   (mistral.ai)
  • 2025-03-04   llm-ollama 0.9.0   (simonwillison.net)
  • 2025-02-26   Claude 3.7 Sonnet and Claude Code   (www.anthropic.com)
  • 2025-02-26   The Deep Research problem — Benedict Evans   (www.ben-evans.com)
  • 2025-02-24   5 Principles for Writing Effective Prompts (2025 Update)   (blog.tobiaszwingmann.com)
  • 2025-02-24   Greg Brockman shared this template for prompting   (www.linkedin.com)
  • 2025-02-21   LLM Leaderboard   (artificialanalysis.ai)
  • 2025-02-17   Here Are My Go-To AI Tools   (open.substack.com)
  • 2025-02-17   A Step-by-Step Guide to Setting Up a Custom BPE Tokenizer...   (www.marktechpost.com)
  • 2025-02-15   We Were Wrong About GPUs   (fly.io)
  • 2025-02-07   Using pip to install a Large Language Model that’s under ...   (simonwillison.net)
  • 2025-02-05   Understanding Reasoning LLMs   (sebastianraschka.com)
  • 2025-02-03   5 AI Agent Frameworks Compared - KDnuggets   (www.kdnuggets.com)
  • 2025-02-02   (WIP) A Little Bit of Reinforcement Learning from Human F...   (rlhfbook.com)
  • 2025-02-02   Creating an AI Agent-Based System with LangGraph: Adding ...   (www.marktechpost.com)
  • 2025-02-01   aidanmclaughlin/AidanBench: Aidan Bench attempts to measu...   (github.com)
  • 2025-01-31   OpenAI o3-mini, now available in LLM   (simonwillison.net)
  • 2025-01-29   Multi-Head Latent Attention and Other KV Cache Tricks   (www.pyspur.dev)
  • 2025-01-29   Qwen AI Introduces Qwen2.5-Max: A large MoE LLM Pretraine...   (www.marktechpost.com)
  • 2025-01-29   Alibaba releases AI model it says surpasses DeepSeek   (www.reuters.com)
  • 2025-01-28   On MLA   (planetbanatt.net)
  • 2025-01-27   The Illustrated DeepSeek-R1   (newsletter.languagemodels.co)
  • 2025-01-26   DeepSeek-R1 vs. OpenAI’s o1: A New Step in Open Source an...   (www.marktechpost.com)
  • 2025-01-25   AI hallucinations can’t be stopped — but these techniques...   (www.nature.com)
  • 2025-01-23   Noteworthy LLM Research Papers of 2024   (sebastianraschka.com)
  • 2025-01-23   LLM 0.20   (simonwillison.net)
  • 2025-01-23   How Chinese A.I. Start-Up DeepSeek Is Competing With Open...   (www.nytimes.com)
  • 2025-01-20   DeepSeek-R1 and exploring DeepSeek-R1-Distill-Llama-8B   (simonwillison.net)
  • 2025-01-18   Microsoft Presents a Comprehensive Framework for Securing...   (www.marktechpost.com)
  • 2025-01-18   Lessons From Red Teaming 100 Generative AI Products   (simonwillison.net)
  • 2025-01-18   Implementing A Byte Pair Encoding (BPE) Tokenizer From Sc...   (sebastianraschka.com)
  • 2025-01-17   This Rumor About GPT-5 Changes Everything   (open.substack.com)
  • 2025-01-14   The 2025 AI Engineering Reading List   (www.latent.space)
  • 2025-01-12   Agents   (huyenchip.com)
  • 2025-01-12   100 Must-Read Generative AI Papers from 2024   (open.substack.com)
  • 2025-01-09   7 Next-Generation Prompt Engineering Techniques - Machine...   (machinelearningmastery.com)
  • 2025-01-08   How to use NotebookLM for personalized knowledge synthesis   (open.substack.com)
  • 2025-01-07   An Opinionated Evals Reading List — Apollo Research   (www.apolloresearch.ai)
  • 2025-01-01   LLMS 2023-2024 (Williston) – Dropbox Paper   (www.dropbox.com)
  • 2024-12-31   Things we learned out about LLMs in 2024   (simonwillison.net)
  • 2024-12-30   How to Build a Graph RAG App   (towardsdatascience.com)
  • 2024-12-24   Gemini 2.0 Flash "Thinking Mode"   (open.substack.com)
  • 2024-12-22   LLM Research Papers: The 2024 List   (magazine.sebastianraschka.com)
  • 2024-12-22   Why AI language models choke on too much text   (arstechnica.com)
  • 2024-12-21   rasbt/LLMs-from-scratch: Implement a ChatGPT-like LLM in ...   (github.com)
  • 2024-12-21   Slim-Llama: An Energy-Efficient LLM ASIC Processor Suppor...   (www.marktechpost.com)
  • 2024-12-21   OpenAI Unveils o3 System That Reasons Through Math, Scien...   (www.nytimes.com)
  • 2024-12-19   Building effective agents \ Anthropic   (www.anthropic.com)
  • 2024-12-18   Blt patches scale better than tokens   (dl.fbaipublicfiles.com)
  • 2024-12-16   Meta AI Proposes Large Concept Models (LCMs): A Semantic ...   (www.marktechpost.com)
  • 2024-12-15   How LLMs Store and Use Knowledge? This AI Paper Introduce...   (www.marktechpost.com)
  • 2024-12-13   LangChain vs OpenAI API: When Simplicity Meets Scalabilit...   (blogs.adityabh.is-a.dev)
  • 2024-12-12   Transformers Key-Value (KV) Caching Explained   (towardsdatascience.com)
  • 2024-12-12   Scaling Laws – O1 Pro Architecture, Reasoning Training In...   (semianalysis.com)
  • 2024-12-11   The AI Researchers Pushing Computers to Launch Nightmare ...   (www.wsj.com)
  • 2024-12-09   What are Hallucinations in LLMs and 6 Effective Strategie...   (www.marktechpost.com)
  • 2024-12-07   Countless.dev | AI Model Comparison   (countless.dev)
  • 2024-12-07   CPU-GPU I/O-Aware LLM Inference Reduces Latency in GPUs b...   (www.marktechpost.com)
  • 2024-12-05   How to Build a General-Purpose LLM Agent   (towardsdatascience.com)
  • 2024-12-05   Treemap   (aiworld.eu)
  • 2024-12-05   AI Hallucinations: Why Large Language Models Make Things ...   (www.kapa.ai)
  • 2024-11-29   llama.cpp guide - Running LLMs locally, on any hardware, ...   (steelph0enix.github.io)
  • 2024-11-28   Four Cutting-Edge Methods for Evaluating AI Agents and En...   (www.marktechpost.com)
  • 2024-11-26   eugeneyan/llm-paper-notes: Notes from the Latent Space pa...   (github.com)
  • 2024-11-21   Understanding Multimodal LLMs   (magazine.sebastianraschka.com)
  • 2024-11-17   Something weird is happening with LLMs and chess   (open.substack.com)
  • 2024-11-11   Analyzing the homerun year for LLMs: the top-100 most cit...   (www.zeta-alpha.com)
  • 2024-10-31   LLM Chunking, Indexing, Scoring and Agents, in a Nutshell...   (www.datasciencecentral.com)
  • 2024-10-28   Developing a computer use model   (www.anthropic.com)
  • 2024-10-19   5 LLM Tools I Can’t Live Without   (www.kdnuggets.com)
  • 2024-10-19   Claude: Everything you need to know about Anthropic's AI ...   (techcrunch.com)
  • 2024-10-17   Nvidia just dropped a new AI model that crushes OpenAI’s ...   (venturebeat.com)
  • 2024-08-04   dpo-from-scratch.ipynb   (github.com)
  • 2024-08-04   What We Learned from a Year of Building with LLMs (Part I)   (www.oreilly.com)
  • 2024-08-01   Towards Monosemanticity: A step towards understanding lar...   (towardsdatascience.com)
  • 2024-07-24   Meta unleashes its most powerful AI model, Llama 3.1, wit...   (venturebeat.com)
  • 2024-07-24   Customize Generative AI Models for Enterprise Application...   (developer.nvidia.com)
  • 2024-07-24   Llama 3.1 Released: Meta’s New Open-Source AI Model that ...   (www.marktechpost.com)
  • 2024-07-24   Meta Llama 3.1 405b is outperforming private models with ...   (dataconomy.com)
  • 2024-07-20   Understanding Positional Embeddings in Transformers: From...   (towardsdatascience.com)
  • 2024-07-15   Claude 3.5 Sonnet   (www.anthropic.com)
  • 2024-07-13   Do large language models understand the world?   (www.amazon.science)
  • 2024-07-04   Building an LLM Router for High-Quality and Cost-Effectiv...   (www.anyscale.com)
  • 2024-07-03   From bare metal to a 70B model: infrastructure set-up and...   (imbue.com)
  • 2024-07-02   StarCoder2-15B: A Powerful LLM for Code Generation, Summa...   (nvda.ws)
  • 2024-06-27   How Gradient created an open LLM with a million-token con...   (venturebeat.com)
  • 2024-06-22   Some Commonly Used Advanced Prompt Engineering Techniques...   (www.marktechpost.com)
  • 2024-06-20   Key Metrics for Evaluating Large Language Models (LLMs)   (www.marktechpost.com)
  • 2024-06-20   Firecrawl: A Powerful Web Scraping Tool for Turning Websi...   (www.marktechpost.com)
  • 2024-06-19   Let's reproduce GPT-2 (124M)   (m.youtube.com)
  • 2024-06-19   How to use an open source LLM model locally and remotely   (thoughtbot.com)
  • 2024-06-12   “The” Midjourney model personalization guide   (dataconomy.com)
  • 2024-06-12   How to use Perplexity in your PM work   (www.lennysnewsletter.com)
  • 2024-06-11   [2406.01506] The Geometry of Categorical and Hierarchical...   (arxiv.org)
  • 2024-06-11   What We Learned from a Year of Building with LLMs (Part II)   (www.oreilly.com)
  • 2024-06-11   Sharpening LLMs: The Sharpest Tools and Essential Techniq...   (www.marktechpost.com)
  • 2024-06-11   List of Activities and Their Corresponding Suitable LLMs ...   (www.marktechpost.com)
  • 2024-06-11   Three Things to Know About Prompting LLMs   (sloanreview.mit.edu)
  • 2024-05-31   Perplexity goes beyond AI search, launches publishing pla...   (venturebeat.com)
  • 2024-05-28   The Great AI Chatbot Challenge: ChatGPT vs. Gemini vs. Co...   (www.wsj.com)
  • 2024-05-26   The future of foundation models is closed-source   (www.thediff.co)
  • 2024-05-24   Demystifying Vision-Language Models: An In-Depth Exploration   (www.marktechpost.com)
  • 2024-05-22   AI Is a Black Box. Anthropic Figured Out a Way to Look In...   (www.wired.com)
  • 2024-05-21   naklecha/llama3-from-scratch   (github.com)
  • 2024-05-21   Abacus AI Releases Smaug-Llama-3-70B-Instruct: The New Be...   (www.marktechpost.com)
  • 2024-05-13   Do Enormous LLM Context Windows Spell the End of RAG?   (thenewstack.io)
  • 2024-05-13   How Good Are the Latest Open LLMs? And Is DPO Better Than...   (sebastianraschka.com)
  • 2024-05-12   ChuXin: A Fully Open-Sourced Language Model with a Size o...   (www.marktechpost.com)
  • 2024-05-11   Title:You Only Cache Once: Decoder-Decoder Architectures ...   (arxiv.org)
  • 2024-05-11   Anthropic AI Launches a Prompt Engineering Tool that Gene...   (www.marktechpost.com)
  • 2024-05-11   Cleaning   (docs.unstructured.io)
  • 2024-05-08   [2404.19737] Better & Faster Large Language Models via Mu...   (arxiv.org)
  • 2024-05-07   Researchers at NVIDIA AI Introduce ‘VILA’: A Vision Langu...   (www.marktechpost.com)
  • 2024-05-05   Hugging Face - Documentation   (huggingface.co)
  • 2024-04-25   Understanding Key Terminologies in Large Language Model (...   (www.marktechpost.com)
  • 2024-04-25   Top 15 AI Libraries/Frameworks for Automatically Red-Team...   (www.marktechpost.com)
  • 2024-04-19   Meta says Llama 3 beats most other models, including Gemi...   (www.theverge.com)
  • 2024-04-17   anthropics/anthropic-cookbook: A collection of notebooks/...   (github.com)
  • 2024-04-15   Deep Learning Architectures From CNN, RNN, GAN, and Trans...   (www.marktechpost.com)
  • 2024-04-15   Tips for LLM Pretraining and Evaluating Reward Models   (magazine.sebastianraschka.com)
  • 2024-04-14   Lessons after a half-billion GPT tokens - Ken Kantzer's Blog   (kenkantzer.com)
  • 2024-04-13   5 Ways To Use LLMs On Your Laptop   (www.kdnuggets.com)
  • 2024-04-13   Words are flowing out like endless rain: Recapping a busy...   (arstechnica.com)
  • 2024-04-12   Gemini: A Family of Highly Capable Multimodal Models   (dev.to)
  • 2024-04-10   Peter Gostev’s Post   (www.linkedin.com)
  • 2024-04-05   Detecting Hallucinations in Large Language Models with Te...   (dev.to)
  • 2024-04-05   Top Open Source Large Language Models (LLMs) Available Fo...   (www.marktechpost.com)
  • 2024-04-02   LLaMA Now Goes Faster on CPUs   (justine.lol)
  • 2024-04-02   Large language models use a surprisingly simple mechanism...   (news.mit.edu)
  • 2024-04-02   Introducing DBRX: A New State-of-the-Art Open LLM   (www.databricks.com)
  • 2024-04-01   ChatGPT vs Perplexity AI: AI App Comparison   (www.marktechpost.com)
  • 2024-03-30   Mamba Explained   (thegradient.pub)
  • 2024-03-29   How Nvidia Blackwell Systems Attack 1 Trillion Parameter ...   (www.nextplatform.com)
  • 2024-03-29   How Chain-of-Thought Reasoning Helps Neural Networks Compute   (www.quantamagazine.org)
  • 2024-03-11   Why and How to Achieve Longer Context Windows for LLMs   (towardsdatascience.com)
  • 2024-03-11   Generative AI Design Patterns: A Comprehensive Guide | by...   (towardsdatascience.com)
  • 2024-03-11   You can now train a 70b language model at home   (www.answer.ai)
  • 2024-03-11   Easily Train a Specialized LLM: PEFT, LoRA, QLoRA, LLaMA-...   (towardsdatascience.com)
  • 2024-03-07   Google Bard is called Gemini now and expands to mobile, p...   (www.axios.com)
  • 2024-03-05   Anthropic’s Post   (www.linkedin.com)
  • 2024-03-05   OpenAI's ChatGPT may have its first true rival in Anthrop...   (qz.com)
  • 2024-02-29   rasbt/LLMs-from-scratch   (github.com)
  • 2024-02-29   Meet RAGxplorer: An interactive AI Tool to Support the Bu...   (www.marktechpost.com)
  • 2024-02-29   Meet Google Lumiere AI, Bard’s video maker cousin   (dataconomy.com)
  • 2024-02-29   How To Build an LLM-Powered App To Chat with PapersWithCode   (towardsdatascience.com)
  • 2024-02-29   The killer app of Gemini Pro 1.5 is video   (simonwillison.net)
  • 2024-02-29   Understanding Direct Preference Optimization   (towardsdatascience.com)
  • 2024-02-29   I Spent a Week With Gemini Pro 1.5—It’s Fantastic   (every.to)
  • 2024-02-29   Title:The Era of 1-bit LLMs: All Large Language Models ar...   (arxiv.org)
  • 2024-02-29   Sora early access: Your guide to securing a spot   (dataconomy.com)
  • 2024-02-29   Au Large | Mistral AI | Frontier AI in your hands   (mistral.ai)
  • 2024-02-22   Claude   (claude.ai)
  • 2024-02-22   Beyond Self-Attention: How a Small Language Model Predict...   (shyam.blog)
  • 2024-02-22   How do transformers work?+Design a Multi-class Sentiment ...   (open.substack.com)
  • 2024-02-22   1708022141659 (JPEG Image, 1280 × 1600 pixels) ...   (media.licdn.com)
  • 2024-02-22   Groq Inference Tokenomics: Speed, But At What Cost?   (www.semianalysis.com)
  • 2024-02-20   How Well Can LLMs Negotiate? Stanford Researchers Develop...   (www.marktechpost.com)
  • 2024-02-17   Sora   (openai.com)
  • 2024-02-15   Code LoRA from Scratch - a Lightning Studio by sebastian   (lightning.ai)
  • 2024-02-15   Bard is now Gemini and Gemini Advanced is amazing   (dataconomy.com)
  • 2024-02-11   Ask HN: What have you built with LLMs?   (news.ycombinator.com)
  • 2024-02-04   Title:BloombergGPT: A Large Language Model for Finance   (arxiv.org)
  • 2024-01-24   Exploring the Zephyr 7B: A Comprehensive Guide to the Lat...   (www.kdnuggets.com)
  • 2024-01-17   Mastering PDFs: Extracting Sections, Headings, Paragraphs...   (blog.llamaindex.ai)
  • 2024-01-16   Understanding and Coding Self-Attention, Multi-Head Atten...   (magazine.sebastianraschka.com)
  • 2024-01-16   Dashboard - SciSummary   (scisummary.com)
  • 2024-01-07   Meet Waymo’s MotionLM: The State-of-the-Art Multi-Agent M...   (www.marktechpost.com)
  • 2024-01-07   How much detail is too much? Midjourney v6 attempts to fi...   (arstechnica.com)
  • 2024-01-07   10 Noteworthy AI Research Papers of 2023   (magazine.sebastianraschka.com)
  • 2023-10-20   7 Steps to Mastering Large Language Models (LLMs)   (www.kdnuggets.com)
  • 2023-10-20   Meta AI Researchers Propose Advanced Long-Context LLMs: A...   (www.marktechpost.com)
  • 2023-10-20   This AI Paper from NVIDIA Explores the Power of Retrieval...   (www.marktechpost.com)
  • 2023-10-20   Finetuning LLMs with LoRA and QLoRA: Insights from Hundre...   (lightning.ai)
  • 2023-10-20   Getting Started with Large Language Models: Key Things to...   (flyte.org)
  • 2023-10-20   Unlocking GPT-4 Summarization with Chain of Density Promp...   (www.kdnuggets.com)
  • 2023-10-20   The Ins and Outs of Retrieval-Augmented Generation (RAG)   (towardsdatascience.com)
  • 2023-10-20   Building RAG-based LLM Applications for Production (Part 1)   (www.anyscale.com)
  • 2023-10-20   RAG vs Finetuning: Which Is the Best Tool to Boost Your L...   (towardsdatascience.com)
  • 2023-10-20   A High-Level Overview Of Large Language Model Concepts, U...   (smashingmagazine.com)
  • 2023-10-20   Augmenting LLMs with RAG   (towardsdatascience.com)
  • 2023-10-07   Parallel Processing in Prompt Engineering: The Skeleton-o...   (www.kdnuggets.com)
  • 2023-10-05   [2302.07730] Transformer models: an introduction and catalog   (arxiv.org)
  • 2023-10-04   Hey, Computer, Make Me a Font   (serce.me)
  • 2023-10-04   SaaS Competitive Advantage Through Elegant LLM Feedback M...   (www.tomtunguz.com)
  • 2023-10-03   2302.11382.pdf   (arxiv.org)
  • 2023-10-03   ChatGPT, Bard, or Bing Chat? Differences Among 3 Generati...   (www.nngroup.com)
  • 2023-10-03   Bard   (bard.google.com)
  • 2023-10-03   The State of Large Language Models   (www.scientificamerican.com)
  • 2023-09-25   10 Ways to Improve the Performance of Retrieval Augmented...   (towardsdatascience.com)
  • 2023-09-25   How to Build an LLM from Scratch   (towardsdatascience.com)
  • 2023-09-25   Large Language Model Prompt Engineering for Complex Summa...   (devblogs.microsoft.com)
  • 2023-09-25   Open LLM Leaderboard : a Hugging Face Space by HuggingFaceH4   (huggingface.co)
  • 2023-09-25   Llama from scratch   (blog.briankitano.com)
  • 2023-09-25   Cracking Open the OpenAI (Python) API   (towardsdatascience.com)
  • 2023-09-25   Cracking Open the Hugging Face Transformers Library   (towardsdatascience.com)
  • 2023-09-25   Asking 60+ LLMs a set of 20 questions   (benchmarks.llmonitor.com)
  • 2023-09-24   OpenAI Unveils DALL·E 3: A Revolutionary Leap in Text-to-...   (www.marktechpost.com)
  • 2023-09-24   Comparison: DALL-E 3 vs Midjourney   (dataconomy.com)
  • 2023-09-17   What OpenAI Really Wants   (www.wired.com)
  • 2023-09-12   A Beginner’s Guide to Building LLM-Powered Applications w...   (dev.to)
  • 2023-08-31   iryna-kondr/scikit-llm: Seamlessly integrate LLMs into sc...   (github.com)
  • 2023-08-31   Prompt Engineering — How to trick AI into solving your pr...   (towardsdatascience.com)
  • 2023-08-30   A Beginner’s Guide to LLM Fine-Tuning   (towardsdatascience.com)
  • 2023-08-27   Together AI Unveils Llama-2-7B-32K-Instruct: A Breakthrou...   (www.marktechpost.com)
  • 2023-08-25   A Practical Introduction to LLMs   (towardsdatascience.com)
  • 2023-08-20   Meet Chroma: An AI-Native Open-Source Vector Database For...   (www.marktechpost.com)
  • 2023-08-07   How to Extract Text from Any PDF and Image for Large Lang...   (towardsdatascience.com)
  • 2023-08-07   Introducing OpenLLM: Open Source Library for LLMs   (www.kdnuggets.com)
  • 2023-08-07   Abacus AI Introduces A New Open Long-Context Large Langua...   (www.marktechpost.com)
  • 2023-08-06   How to use LLMs for PDF parsing   (nanonets.com)
  • 2023-08-06   How to Chat With Any File from PDFs to Images Using Large...   (towardsdatascience.com)
  • 2023-08-06   How to Leverage Open-Source LLMs in Your Project   (www.turingpost.com)
  • 2023-08-02   LangChain 101: Build Your Own GPT-Powered Applications   (www.kdnuggets.com)
  • 2023-07-28   MPT-30B: Raising the bar for open-source foundation models   (www.mosaicml.com)
  • 2023-07-28   Midjourney pricing plans and free alternatives to try   (dataconomy.com)
  • 2023-07-28   A Deep Dive Into LLaMA, Falcon, Llama 2 and Their Remarka...   (www.turingpost.com)
  • 2023-07-28   Chain of Thought Prompting for LLMs   (towardsdatascience.com)
  • 2023-07-28   Is Anthropic's Claude 2 model ready to take down GPT-4? W...   (dev.to)
  • 2023-07-24   Emerging Architectures for LLM Applications   (a16z.com)
  • 2023-07-24   ELI5: FlashAttention   (gordicaleksa.medium.com)
  • 2023-07-24   Build Industry-Specific LLMs Using Retrieval Augmented Ge...   (towardsdatascience.com)
  • 2023-07-24   Free Full Stack LLM Bootcamp   (www.kdnuggets.com)
  • 2023-07-24   Edge 300: Meet Falcon LLM: The Most Powerful Open Source ...   (thesequence.substack.com)
  • 2023-07-23   The Secret Sauce behind 100K context window in LLMs: all ...   (blog.gopenai.com)
  • 2023-07-23   Observe.ai unveils 30-billion-parameter contact center LL...   (venturebeat.com)
  • 2023-07-23   All You Need to Know to Build Your First LLM App   (towardsdatascience.com)
  • 2023-07-23   Training LLMs with AMD MI250 GPUs and MosaicML   (www.mosaicml.com)
  • 2023-07-23   Optimizing Memory Usage for Training LLMs and Vision Tran...   (lightning.ai)
  • 2023-07-23   Deploying Falcon-7B Into Production   (towardsdatascience.com)
  • 2023-07-23   Anthropic releases Claude 2, its second-gen AI chatbot   (techcrunch.com)
  • 2023-07-23   Google Launches AI-Powered Notes App Called NotebookLM   (tech.slashdot.org)
  • 2023-07-23   Ecosystem Graphs for Foundation Models   (crfm.stanford.edu)
  • 2023-07-23   Meet LMQL: An Open Source Query Language for LLMs   (thesequence.substack.com)
  • 2023-07-23   Leandro von Werra’s Post   (www.linkedin.com)
  • 2023-07-23   LLaMA 2: How to access and use Meta’s versatile open-sour...   (venturebeat.com)
  • 2023-07-22   Beyond LLaMA: The Power of Open LLMs   (towardsdatascience.com)
  • 2023-07-22   Facebook parent Meta unveils LLaMA 2 open-source AI model...   (venturebeat.com)
  • 2023-07-22   MosaicML launches MPT-7B-8K, a 7B-parameter open-source L...   (venturebeat.com)
  • 2023-07-22   The $1 billion gamble to ensure AI doesn’t destroy humanity   (www.thediff.co)
  • 2023-07-12   Unraveling the Power of Chain-of-Thought Prompting in Lar...   (www.kdnuggets.com)
  • 2023-07-12   GitHub - Mooler0410/LLMsPracticalGuide: A curated list of...   (github.com)
  • 2023-06-19   Introduction to the Open LLM Falcon-40B: Performance, Tra...   (towardsdatascience.com)
  • 2023-06-19   Falcon LLM: The New King of Open-Source LLMs   (www.kdnuggets.com)
  • 2023-06-18   Meet FinGPT: An Open-Source Financial Large Language Mode...   (www-marktechpost-com.cdn.ampproject.org)
  • 2023-06-09   LMM Garden | Discover, search, and compare LLMs   (llm.garden)
  • 2023-06-08   iryna-kondr/scikit-llm   (github.com)
  • 2023-06-02   The Case for Running AI on CPUs Isn’t Dead Yet   (spectrum.ieee.org)
  • 2023-05-28   The Art of Prompt Design: Prompt Boundaries and Token Hea...   (towardsdatascience.com)
  • 2023-05-21   Sonali Pattnaik on LinkedIn: #generativeai #ai | 45 comments   (www.linkedin.com)
  • 2023-05-19   The Non-Silence of the LLMs   (informationisbeautiful.net)
  • 2023-05-19   Super Bard: The AI That Can Do It All and Better   (www.kdnuggets.com)
  • 2023-05-18   Edge 291: Reinforcement Learning with Human Feedback   (thesequence.substack.com)
  • 2023-05-12   Google dives into the ‘supercomputer’ game by knitting to...   (venturebeat.com)
  • 2023-05-05   Distilling Step-by-Step! Outperforming Larger Language Mo...   (arxiv.org)
  • 2023-05-05   SparseGPT: Massive Language Models Can Be Accurately Prun...   (arxiv.org)
  • 2023-05-05   openlm-research/open_llama: OpenLLaMA, a permissively lic...   (github.com)
  • 2023-05-03   guidance-ai/guidance: A guidance language for controlling...   (github.com)
  • 2023-04-29   Blog | Anyscale   (www.anyscale.com)
  • 2023-04-29   Parameter-Efficient LLM Finetuning With Low-Rank Adaptati...   (sebastianraschka.com)
  • 2023-04-29   Edge 286: Vicuna, the LLaMA-Based Model that Matches Chat...   (thesequence.substack.com)
  • 2023-04-26   Grounding Large Language Models in a Cognitive Foundation...   (thegradient.pub)
  • 2023-04-25   Data Machina #198   (datamachina.substack.com)
  • 2023-04-25   Finetuning Large Language Models   (magazine.sebastianraschka.com)
  • 2023-04-21   The LLama Effect: How an Accidental Leak Sparked a Series...   (thesequence.substack.com)
  • 2023-04-21   Stanford CRFM   (crfm.stanford.edu)
  • 2023-04-21   Meta has built a massive new language AI—and it’s giving ...   (www.technologyreview.com)
  • 2023-04-21   Eight Things to Know about Large Language Models   (arxiv.org)
  • 2023-04-19   Baby AGI: The Birth of a Fully Autonomous AI   (www.kdnuggets.com)
  • 2023-04-19   Hacker News   (magazine.sebastianraschka.com)
  • 2023-04-17   📝 Guest Post: How to Enhance the Usefulness of Large Lang...   (thesequence.substack.com)
  • 2023-04-14   Prompt Engineering   (lilianweng.github.io)
  • 2023-04-14   A Survey of Large Language Models   (arxiv.org)
  • 2023-04-14   New Ebook: A Beginner’s Guide to Large Language Models   (www.nvidia.com)
  • 2023-04-13   Maximizing the Potential of LLMs: A Guide to Prompt Engin...   (www.ruxu.dev)
  • 2023-04-13   The Magic of LLMs — Prompt Engineering   (towardsdatascience.com)
  • 2023-04-12   📝 Guest Post: Caching LLM Queries for Improved Performanc...   (thesequence.substack.com)
  • 2023-02-10   OpenAI Platform   (platform.openai.com)
  • 2014-09-24   Graphiti: A Python Library for Building Temporal Knowledg...   (www.marktechpost.com)
  • 2014-09-24   Top 9 Different Types of Retrieval-Augmented Generation (...   (www.marktechpost.com)
  • 2014-09-24   FlashSigmoid: A Hardware-Aware and Memory-Efficient Imple...   (www.marktechpost.com)
  • 2014-08-24   Building a Simple RAG Application Using LlamaIndex - Mach...   (machinelearningmastery.com)
  • 2009-09-24   LlamaIndex : LlamaIndex   (docs.llamaindex.ai)
  • 2003-09-24   Why GPU Utilization Falls Short: Understanding Streaming ...   (www.marktechpost.com)
  • 2002-10-24   Nvidia just dropped a bombshell: Its new AI model is open...   (venturebeat.com)
  • 2002-10-24   LightLLM: A Lightweight Scalable and High-Speed Python Fr...   (www.marktechpost.com)
  • 2001-10-24   Ten Effective Strategies to Lower Large Language Model (L...   (www.marktechpost.com)