Perfectly Awesome
Posts by Date
Topics (tags)
Posts by Title
my Repos
About
large language models (LLMs) (all)
Also see: streaming, youtube, video
2025-04-20
To Make Language Models Work Better, Researchers Sidestep...
(www.quantamagazine.org)
2025-04-18
OpenAI Releases a Practical Guide to Building LLM Agents ...
(www.marktechpost.com)
2025-04-17
LLM Post-Training: A Deep Dive into Reasoning Large Langu...
(arxiv.org)
2025-04-16
How To Build An Agent | Amp
(ampcode.com)
2025-04-13
humanlayer/12-factor-agents
(github.com)
2025-04-11
12-factor-agents: Principles to build LLM-powered softwar...
(lobste.rs)
2025-04-08
The “S” in MCP Stands for Security - Elena Cross - Medium
(elenacross7.medium.com)
2025-04-07
Topic 33: Slim Attention, KArAt, XAttention and Multi-Tok...
(huggingface.co)
2025-04-06
The Llama 4 herd: The beginning of a new era of natively ...
(ai.meta.com)
2025-04-06
Model Context Protocol (MCP) an overview
(www.philschmid.de)
2025-04-06
Use MCP servers in VS Code (Preview)
(code.visualstudio.com)
2025-04-05
A Code Implementation to Building a Context-Aware AI Assi...
(www.marktechpost.com)
2025-04-02
LLM Benchmarking: Fundamental Concepts | NVIDIA Technical...
(developer.nvidia.com)
2025-04-02
A Comprehensive Guide to LLM Routing: Tools and Frameworks
(www.marktechpost.com)
2025-03-26
10 Must-Know Python Libraries for LLMs in 2025
(machinelearningmastery.com)
2025-03-25
Introducing 4o Image Generation
(openai.com)
2025-03-25
What is the hallucination index?
(dataconomy.com)
2025-03-23
Quickstart | Mistral AI Large Language Models
(docs.mistral.ai)
2025-03-22
Improving Recommender Systems & Search in the Age of LLMs
(eugeneyan.com)
2025-03-18
Mistral Small 3.1 runs on a MacBook and beats giants - Da...
(dataconomy.com)
2025-03-16
https://www.r-bloggers.com/2025/03/the-ellmer-package-for...
(www.r-bloggers.com)
2025-03-13
What is catastrophic forgetting? - Dataconomy
(dataconomy.com)
2025-03-13
Top 7 Open-Source LLMs in 2025 - KDnuggets
(www.kdnuggets.com)
2025-03-12
What are model cards? - Dataconomy
(dataconomy.com)
2025-03-11
How I use LLMs to help me write code
(open.substack.com)
2025-03-08
The State of LLM Reasoning Models
(open.substack.com)
2025-03-06
Mistral OCR | Mistral AI
(mistral.ai)
2025-02-26
Claude 3.7 Sonnet and Claude Code
(www.anthropic.com)
2025-02-26
The Deep Research problem — Benedict Evans
(www.ben-evans.com)
2025-02-24
5 Principles for Writing Effective Prompts (2025 Update)
(blog.tobiaszwingmann.com)
2025-02-24
Greg Brockman shared this template for prompting
(www.linkedin.com)
2025-02-21
LLM Leaderboard
(artificialanalysis.ai)
2025-02-17
Here Are My Go-To AI Tools
(open.substack.com)
2025-02-17
A Step-by-Step Guide to Setting Up a Custom BPE Tokenizer...
(www.marktechpost.com)
2025-02-15
We Were Wrong About GPUs
(fly.io)
2025-02-03
5 AI Agent Frameworks Compared - KDnuggets
(www.kdnuggets.com)
2025-02-02
Creating an AI Agent-Based System with LangGraph: Adding ...
(www.marktechpost.com)
2025-02-01
aidanmclaughlin/AidanBench: Aidan Bench attempts to measu...
(github.com)
2025-01-29
Multi-Head Latent Attention and Other KV Cache Tricks
(www.pyspur.dev)
2025-01-29
Qwen AI Introduces Qwen2.5-Max: A large MoE LLM Pretraine...
(www.marktechpost.com)
2025-01-28
On MLA
(planetbanatt.net)
2025-01-27
The Illustrated DeepSeek-R1
(newsletter.languagemodels.co)
2025-01-26
DeepSeek-R1 vs. OpenAI’s o1: A New Step in Open Source an...
(www.marktechpost.com)
2025-01-25
AI hallucinations can’t be stopped — but these techniques...
(www.nature.com)
2025-01-23
How Chinese A.I. Start-Up DeepSeek Is Competing With Open...
(www.nytimes.com)
2025-01-18
Microsoft Presents a Comprehensive Framework for Securing...
(www.marktechpost.com)
2025-01-17
This Rumor About GPT-5 Changes Everything
(open.substack.com)
2025-01-14
The 2025 AI Engineering Reading List
(www.latent.space)
2025-01-12
Agents
(huyenchip.com)
2025-01-12
100 Must-Read Generative AI Papers from 2024
(open.substack.com)
2025-01-09
7 Next-Generation Prompt Engineering Techniques - Machine...
(machinelearningmastery.com)
2025-01-08
How to use NotebookLM for personalized knowledge synthesis
(open.substack.com)
2025-01-07
An Opinionated Evals Reading List — Apollo Research
(www.apolloresearch.ai)
2024-12-24
Gemini 2.0 Flash "Thinking Mode"
(open.substack.com)
2024-12-22
LLM Research Papers: The 2024 List
(magazine.sebastianraschka.com)
2024-12-22
Why AI language models choke on too much text
(arstechnica.com)
2024-12-21
rasbt/LLMs-from-scratch: Implement a ChatGPT-like LLM in ...
(github.com)
2024-12-21
Slim-Llama: An Energy-Efficient LLM ASIC Processor Suppor...
(www.marktechpost.com)
2024-12-21
OpenAI Unveils o3 System That Reasons Through Math, Scien...
(www.nytimes.com)
2024-12-19
Building effective agents \ Anthropic
(www.anthropic.com)
2024-12-18
Blt patches scale better than tokens
(dl.fbaipublicfiles.com)
2024-12-16
Meta AI Proposes Large Concept Models (LCMs): A Semantic ...
(www.marktechpost.com)
2024-12-15
How LLMs Store and Use Knowledge? This AI Paper Introduce...
(www.marktechpost.com)
2024-12-13
LangChain vs OpenAI API: When Simplicity Meets Scalabilit...
(blogs.adityabh.is-a.dev)
2024-12-09
What are Hallucinations in LLMs and 6 Effective Strategie...
(www.marktechpost.com)
2024-12-07
Countless.dev | AI Model Comparison
(countless.dev)
2024-12-07
CPU-GPU I/O-Aware LLM Inference Reduces Latency in GPUs b...
(www.marktechpost.com)
2024-12-05
Treemap
(aiworld.eu)
2024-12-05
AI Hallucinations: Why Large Language Models Make Things ...
(www.kapa.ai)
2024-11-28
Four Cutting-Edge Methods for Evaluating AI Agents and En...
(www.marktechpost.com)
2024-11-26
eugeneyan/llm-paper-notes: Notes from the Latent Space pa...
(github.com)
2024-11-21
Understanding Multimodal LLMs
(magazine.sebastianraschka.com)
2024-11-17
Something weird is happening with LLMs and chess
(open.substack.com)
2024-10-31
LLM Chunking, Indexing, Scoring and Agents, in a Nutshell...
(www.datasciencecentral.com)
2024-10-28
Developing a computer use model
(www.anthropic.com)
2024-10-19
5 LLM Tools I Can’t Live Without
(www.kdnuggets.com)
2024-08-04
dpo-from-scratch.ipynb
(github.com)
2024-08-04
What We Learned from a Year of Building with LLMs (Part I)
(www.oreilly.com)
2024-07-24
Customize Generative AI Models for Enterprise Application...
(developer.nvidia.com)
2024-07-24
Llama 3.1 Released: Meta’s New Open-Source AI Model that ...
(www.marktechpost.com)
2024-07-24
Meta Llama 3.1 405b is outperforming private models with ...
(dataconomy.com)
2024-07-15
Claude 3.5 Sonnet
(www.anthropic.com)
2024-07-13
Do large language models understand the world?
(www.amazon.science)
2024-07-04
Building an LLM Router for High-Quality and Cost-Effectiv...
(www.anyscale.com)
2024-07-03
From bare metal to a 70B model: infrastructure set-up and...
(imbue.com)
2024-07-02
StarCoder2-15B: A Powerful LLM for Code Generation, Summa...
(nvda.ws)
2024-06-22
Some Commonly Used Advanced Prompt Engineering Techniques...
(www.marktechpost.com)
2024-06-20
Key Metrics for Evaluating Large Language Models (LLMs)
(www.marktechpost.com)
2024-06-20
Firecrawl: A Powerful Web Scraping Tool for Turning Websi...
(www.marktechpost.com)
2024-06-19
Let's reproduce GPT-2 (124M)
(m.youtube.com)
2024-06-12
“The” Midjourney model personalization guide
(dataconomy.com)
2024-06-12
How to use Perplexity in your PM work
(www.lennysnewsletter.com)
2024-06-11
[2406.01506] The Geometry of Categorical and Hierarchical...
(arxiv.org)
2024-06-11
What We Learned from a Year of Building with LLMs (Part II)
(www.oreilly.com)
2024-06-11
Sharpening LLMs: The Sharpest Tools and Essential Techniq...
(www.marktechpost.com)
2024-06-11
List of Activities and Their Corresponding Suitable LLMs ...
(www.marktechpost.com)
2024-05-24
Demystifying Vision-Language Models: An In-Depth Exploration
(www.marktechpost.com)
2024-05-21
naklecha/llama3-from-scratch
(github.com)
2024-05-21
Abacus AI Releases Smaug-Llama-3-70B-Instruct: The New Be...
(www.marktechpost.com)
2024-05-12
ChuXin: A Fully Open-Sourced Language Model with a Size o...
(www.marktechpost.com)
2024-05-11
Title:You Only Cache Once: Decoder-Decoder Architectures ...
(arxiv.org)
2024-05-11
Anthropic AI Launches a Prompt Engineering Tool that Gene...
(www.marktechpost.com)
2024-05-11
Cleaning
(docs.unstructured.io)
2024-05-08
[2404.19737] Better & Faster Large Language Models via Mu...
(arxiv.org)
2024-05-07
Researchers at NVIDIA AI Introduce ‘VILA’: A Vision Langu...
(www.marktechpost.com)
2024-05-05
Hugging Face - Documentation
(huggingface.co)
2024-04-25
Understanding Key Terminologies in Large Language Model (...
(www.marktechpost.com)
2024-04-25
Top 15 AI Libraries/Frameworks for Automatically Red-Team...
(www.marktechpost.com)
2024-04-17
anthropics/anthropic-cookbook: A collection of notebooks/...
(github.com)
2024-04-15
Deep Learning Architectures From CNN, RNN, GAN, and Trans...
(www.marktechpost.com)
2024-04-15
Tips for LLM Pretraining and Evaluating Reward Models
(magazine.sebastianraschka.com)
2024-04-14
Lessons after a half-billion GPT tokens - Ken Kantzer's Blog
(kenkantzer.com)
2024-04-13
5 Ways To Use LLMs On Your Laptop
(www.kdnuggets.com)
2024-04-13
Words are flowing out like endless rain: Recapping a busy...
(arstechnica.com)
2024-04-12
Gemini: A Family of Highly Capable Multimodal Models
(dev.to)
2024-04-10
Peter Gostev’s Post
(www.linkedin.com)
2024-04-05
Detecting Hallucinations in Large Language Models with Te...
(dev.to)
2024-04-05
Top Open Source Large Language Models (LLMs) Available Fo...
(www.marktechpost.com)
2024-04-02
LLaMA Now Goes Faster on CPUs
(justine.lol)
2024-04-02
Large language models use a surprisingly simple mechanism...
(news.mit.edu)
2024-04-02
Introducing DBRX: A New State-of-the-Art Open LLM
(www.databricks.com)
2024-04-01
ChatGPT vs Perplexity AI: AI App Comparison
(www.marktechpost.com)
2024-03-29
How Nvidia Blackwell Systems Attack 1 Trillion Parameter ...
(www.nextplatform.com)
2024-03-29
How Chain-of-Thought Reasoning Helps Neural Networks Compute
(www.quantamagazine.org)
2024-03-11
You can now train a 70b language model at home
(www.answer.ai)
2024-03-07
Google Bard is called Gemini now and expands to mobile, p...
(www.axios.com)
2024-03-05
Anthropic’s Post
(www.linkedin.com)
2024-03-05
OpenAI's ChatGPT may have its first true rival in Anthrop...
(qz.com)
2024-02-29
rasbt/LLMs-from-scratch
(github.com)
2024-02-29
Meet RAGxplorer: An interactive AI Tool to Support the Bu...
(www.marktechpost.com)
2024-02-29
Meet Google Lumiere AI, Bard’s video maker cousin
(dataconomy.com)
2024-02-29
I Spent a Week With Gemini Pro 1.5—It’s Fantastic
(every.to)
2024-02-29
Title:The Era of 1-bit LLMs: All Large Language Models ar...
(arxiv.org)
2024-02-29
Sora early access: Your guide to securing a spot
(dataconomy.com)
2024-02-29
Au Large | Mistral AI | Frontier AI in your hands
(mistral.ai)
2024-02-22
Claude
(claude.ai)
2024-02-22
How do transformers work?+Design a Multi-class Sentiment ...
(open.substack.com)
2024-02-22
1708022141659 (JPEG Image, 1280 × 1600 pixels) ...
(media.licdn.com)
2024-02-20
How Well Can LLMs Negotiate? Stanford Researchers Develop...
(www.marktechpost.com)
2024-02-17
Sora
(openai.com)
2024-02-15
Code LoRA from Scratch - a Lightning Studio by sebastian
(lightning.ai)
2024-02-15
Bard is now Gemini and Gemini Advanced is amazing
(dataconomy.com)
2024-02-11
Ask HN: What have you built with LLMs?
(news.ycombinator.com)
2024-02-04
Title:BloombergGPT: A Large Language Model for Finance
(arxiv.org)
2024-01-24
Exploring the Zephyr 7B: A Comprehensive Guide to the Lat...
(www.kdnuggets.com)
2024-01-17
Mastering PDFs: Extracting Sections, Headings, Paragraphs...
(blog.llamaindex.ai)
2024-01-16
Understanding and Coding Self-Attention, Multi-Head Atten...
(magazine.sebastianraschka.com)
2024-01-07
Meet Waymo’s MotionLM: The State-of-the-Art Multi-Agent M...
(www.marktechpost.com)
2024-01-07
How much detail is too much? Midjourney v6 attempts to fi...
(arstechnica.com)
2024-01-07
10 Noteworthy AI Research Papers of 2023
(magazine.sebastianraschka.com)
2023-10-20
7 Steps to Mastering Large Language Models (LLMs)
(www.kdnuggets.com)
2023-10-20
Meta AI Researchers Propose Advanced Long-Context LLMs: A...
(www.marktechpost.com)
2023-10-20
This AI Paper from NVIDIA Explores the Power of Retrieval...
(www.marktechpost.com)
2023-10-20
Finetuning LLMs with LoRA and QLoRA: Insights from Hundre...
(lightning.ai)
2023-10-20
Getting Started with Large Language Models: Key Things to...
(flyte.org)
2023-10-20
Unlocking GPT-4 Summarization with Chain of Density Promp...
(www.kdnuggets.com)
2023-10-20
Building RAG-based LLM Applications for Production (Part 1)
(www.anyscale.com)
2023-10-07
Parallel Processing in Prompt Engineering: The Skeleton-o...
(www.kdnuggets.com)
2023-10-05
[2302.07730] Transformer models: an introduction and catalog
(arxiv.org)
2023-10-03
ChatGPT, Bard, or Bing Chat? Differences Among 3 Generati...
(www.nngroup.com)
2023-10-03
Bard
(bard.google.com)
2023-09-25
Large Language Model Prompt Engineering for Complex Summa...
(devblogs.microsoft.com)
2023-09-25
Open LLM Leaderboard : a Hugging Face Space by HuggingFaceH4
(huggingface.co)
2023-09-25
Llama from scratch
(blog.briankitano.com)
2023-09-25
Asking 60+ LLMs a set of 20 questions
(benchmarks.llmonitor.com)
2023-09-24
OpenAI Unveils DALL·E 3: A Revolutionary Leap in Text-to-...
(www.marktechpost.com)
2023-09-24
Comparison: DALL-E 3 vs Midjourney
(dataconomy.com)
2023-09-12
A Beginner’s Guide to Building LLM-Powered Applications w...
(dev.to)
2023-08-31
iryna-kondr/scikit-llm: Seamlessly integrate LLMs into sc...
(github.com)
2023-08-27
Together AI Unveils Llama-2-7B-32K-Instruct: A Breakthrou...
(www.marktechpost.com)
2023-08-20
Meet Chroma: An AI-Native Open-Source Vector Database For...
(www.marktechpost.com)
2023-08-07
Introducing OpenLLM: Open Source Library for LLMs
(www.kdnuggets.com)
2023-08-07
Abacus AI Introduces A New Open Long-Context Large Langua...
(www.marktechpost.com)
2023-08-06
How to use LLMs for PDF parsing
(nanonets.com)
2023-08-02
LangChain 101: Build Your Own GPT-Powered Applications
(www.kdnuggets.com)
2023-07-28
MPT-30B: Raising the bar for open-source foundation models
(www.mosaicml.com)
2023-07-28
Midjourney pricing plans and free alternatives to try
(dataconomy.com)
2023-07-28
Is Anthropic's Claude 2 model ready to take down GPT-4? W...
(dev.to)
2023-07-24
Emerging Architectures for LLM Applications
(a16z.com)
2023-07-24
ELI5: FlashAttention
(gordicaleksa.medium.com)
2023-07-24
Free Full Stack LLM Bootcamp
(www.kdnuggets.com)
2023-07-23
The Secret Sauce behind 100K context window in LLMs: all ...
(blog.gopenai.com)
2023-07-23
Training LLMs with AMD MI250 GPUs and MosaicML
(www.mosaicml.com)
2023-07-23
Optimizing Memory Usage for Training LLMs and Vision Tran...
(lightning.ai)
2023-07-23
Ecosystem Graphs for Foundation Models
(crfm.stanford.edu)
2023-07-23
Leandro von Werra’s Post
(www.linkedin.com)
2023-07-12
Unraveling the Power of Chain-of-Thought Prompting in Lar...
(www.kdnuggets.com)
2023-07-12
GitHub - Mooler0410/LLMsPracticalGuide: A curated list of...
(github.com)
2023-06-19
Falcon LLM: The New King of Open-Source LLMs
(www.kdnuggets.com)
2023-06-09
LMM Garden | Discover, search, and compare LLMs
(llm.garden)
2023-06-08
iryna-kondr/scikit-llm
(github.com)
2023-05-21
Sonali Pattnaik on LinkedIn: #generativeai #ai | 45 comments
(www.linkedin.com)
2023-05-19
The Non-Silence of the LLMs
(informationisbeautiful.net)
2023-05-19
Super Bard: The AI That Can Do It All and Better
(www.kdnuggets.com)
2023-05-05
Distilling Step-by-Step! Outperforming Larger Language Mo...
(arxiv.org)
2023-05-05
SparseGPT: Massive Language Models Can Be Accurately Prun...
(arxiv.org)
2023-05-05
openlm-research/open_llama: OpenLLaMA, a permissively lic...
(github.com)
2023-05-03
guidance-ai/guidance: A guidance language for controlling...
(github.com)
2023-04-29
Blog | Anyscale
(www.anyscale.com)
2023-04-25
Data Machina #198
(datamachina.substack.com)
2023-04-25
Finetuning Large Language Models
(magazine.sebastianraschka.com)
2023-04-21
Stanford CRFM
(crfm.stanford.edu)
2023-04-21
Eight Things to Know about Large Language Models
(arxiv.org)
2023-04-19
Baby AGI: The Birth of a Fully Autonomous AI
(www.kdnuggets.com)
2023-04-19
Hacker News
(magazine.sebastianraschka.com)
2023-04-14
Prompt Engineering
(lilianweng.github.io)
2023-04-14
A Survey of Large Language Models
(arxiv.org)
2023-04-14
New Ebook: A Beginner’s Guide to Large Language Models
(www.nvidia.com)
2023-02-10
OpenAI Platform
(platform.openai.com)
2014-09-24
Top 9 Different Types of Retrieval-Augmented Generation (...
(www.marktechpost.com)
2014-09-24
FlashSigmoid: A Hardware-Aware and Memory-Efficient Imple...
(www.marktechpost.com)
2014-09-24
Graphiti: A Python Library for Building Temporal Knowledg...
(www.marktechpost.com)
2014-08-24
Building a Simple RAG Application Using LlamaIndex - Mach...
(machinelearningmastery.com)
2009-09-24
LlamaIndex : LlamaIndex
(docs.llamaindex.ai)
2003-09-24
Why GPU Utilization Falls Short: Understanding Streaming ...
(www.marktechpost.com)
2002-10-24
LightLLM: A Lightweight Scalable and High-Speed Python Fr...
(www.marktechpost.com)
2001-10-24
Ten Effective Strategies to Lower Large Language Model (L...
(www.marktechpost.com)