Perfectly Awesome
Posts by Date
Posts by Topic
Posts by Title
About
large language models (LLMs) (all)
Also see: streaming, youtube, video
2025-03-23
Quickstart | Mistral AI Large Language Models
(docs.mistral.ai)
2025-03-22
Improving Recommender Systems & Search in the Age of LLMs
(eugeneyan.com)
2025-03-20
Anthropic just gave Claude a superpower: real-time web se...
(venturebeat.com)
2025-03-18
Mistral Small 3.1 runs on a MacBook and beats giants - Da...
(dataconomy.com)
2025-03-17
Mistral Small 3.1
(simonwillison.net)
2025-03-16
https://www.r-bloggers.com/2025/03/the-ellmer-package-for...
(www.r-bloggers.com)
2025-03-13
What is catastrophic forgetting? - Dataconomy
(dataconomy.com)
2025-03-13
Top 7 Open-Source LLMs in 2025 - KDnuggets
(www.kdnuggets.com)
2025-03-12
What are model cards? - Dataconomy
(dataconomy.com)
2025-03-11
How I use LLMs to help me write code
(open.substack.com)
2025-03-08
On GPT-4.5
(thezvi.substack.com)
2025-03-08
The State of LLM Reasoning Models
(open.substack.com)
2025-03-07
Mistral OCR
(simonwillison.net)
2025-03-06
Mistral OCR | Mistral AI
(mistral.ai)
2025-03-04
llm-ollama 0.9.0
(simonwillison.net)
2025-02-26
Claude 3.7 Sonnet and Claude Code
(www.anthropic.com)
2025-02-26
The Deep Research problem — Benedict Evans
(www.ben-evans.com)
2025-02-24
5 Principles for Writing Effective Prompts (2025 Update)
(blog.tobiaszwingmann.com)
2025-02-24
Greg Brockman shared this template for prompting
(www.linkedin.com)
2025-02-21
LLM Leaderboard
(artificialanalysis.ai)
2025-02-17
Here Are My Go-To AI Tools
(open.substack.com)
2025-02-17
A Step-by-Step Guide to Setting Up a Custom BPE Tokenizer...
(www.marktechpost.com)
2025-02-15
We Were Wrong About GPUs
(fly.io)
2025-02-07
Using pip to install a Large Language Model that’s under ...
(simonwillison.net)
2025-02-05
Understanding Reasoning LLMs
(sebastianraschka.com)
2025-02-03
5 AI Agent Frameworks Compared - KDnuggets
(www.kdnuggets.com)
2025-02-02
(WIP) A Little Bit of Reinforcement Learning from Human F...
(rlhfbook.com)
2025-02-02
Creating an AI Agent-Based System with LangGraph: Adding ...
(www.marktechpost.com)
2025-02-01
aidanmclaughlin/AidanBench: Aidan Bench attempts to measu...
(github.com)
2025-01-31
OpenAI o3-mini, now available in LLM
(simonwillison.net)
2025-01-29
Multi-Head Latent Attention and Other KV Cache Tricks
(www.pyspur.dev)
2025-01-29
Qwen AI Introduces Qwen2.5-Max: A large MoE LLM Pretraine...
(www.marktechpost.com)
2025-01-29
Alibaba releases AI model it says surpasses DeepSeek
(www.reuters.com)
2025-01-28
On MLA
(planetbanatt.net)
2025-01-27
The Illustrated DeepSeek-R1
(newsletter.languagemodels.co)
2025-01-26
DeepSeek-R1 vs. OpenAI’s o1: A New Step in Open Source an...
(www.marktechpost.com)
2025-01-25
AI hallucinations can’t be stopped — but these techniques...
(www.nature.com)
2025-01-23
Noteworthy LLM Research Papers of 2024
(sebastianraschka.com)
2025-01-23
LLM 0.20
(simonwillison.net)
2025-01-23
How Chinese A.I. Start-Up DeepSeek Is Competing With Open...
(www.nytimes.com)
2025-01-20
DeepSeek-R1 and exploring DeepSeek-R1-Distill-Llama-8B
(simonwillison.net)
2025-01-18
Microsoft Presents a Comprehensive Framework for Securing...
(www.marktechpost.com)
2025-01-18
Lessons From Red Teaming 100 Generative AI Products
(simonwillison.net)
2025-01-18
Implementing A Byte Pair Encoding (BPE) Tokenizer From Sc...
(sebastianraschka.com)
2025-01-17
This Rumor About GPT-5 Changes Everything
(open.substack.com)
2025-01-14
The 2025 AI Engineering Reading List
(www.latent.space)
2025-01-12
Agents
(huyenchip.com)
2025-01-12
100 Must-Read Generative AI Papers from 2024
(open.substack.com)
2025-01-09
7 Next-Generation Prompt Engineering Techniques - Machine...
(machinelearningmastery.com)
2025-01-08
How to use NotebookLM for personalized knowledge synthesis
(open.substack.com)
2025-01-07
An Opinionated Evals Reading List — Apollo Research
(www.apolloresearch.ai)
2025-01-01
LLMS 2023-2024 (Williston) – Dropbox Paper
(www.dropbox.com)
2024-12-31
Things we learned out about LLMs in 2024
(simonwillison.net)
2024-12-30
How to Build a Graph RAG App
(towardsdatascience.com)
2024-12-24
Gemini 2.0 Flash "Thinking Mode"
(open.substack.com)
2024-12-22
LLM Research Papers: The 2024 List
(magazine.sebastianraschka.com)
2024-12-22
Why AI language models choke on too much text
(arstechnica.com)
2024-12-21
rasbt/LLMs-from-scratch: Implement a ChatGPT-like LLM in ...
(github.com)
2024-12-21
Slim-Llama: An Energy-Efficient LLM ASIC Processor Suppor...
(www.marktechpost.com)
2024-12-21
OpenAI Unveils o3 System That Reasons Through Math, Scien...
(www.nytimes.com)
2024-12-19
Building effective agents \ Anthropic
(www.anthropic.com)
2024-12-18
Blt patches scale better than tokens
(dl.fbaipublicfiles.com)
2024-12-16
Meta AI Proposes Large Concept Models (LCMs): A Semantic ...
(www.marktechpost.com)
2024-12-15
How LLMs Store and Use Knowledge? This AI Paper Introduce...
(www.marktechpost.com)
2024-12-13
LangChain vs OpenAI API: When Simplicity Meets Scalabilit...
(blogs.adityabh.is-a.dev)
2024-12-12
Transformers Key-Value (KV) Caching Explained
(towardsdatascience.com)
2024-12-12
Scaling Laws – O1 Pro Architecture, Reasoning Training In...
(semianalysis.com)
2024-12-11
The AI Researchers Pushing Computers to Launch Nightmare ...
(www.wsj.com)
2024-12-09
What are Hallucinations in LLMs and 6 Effective Strategie...
(www.marktechpost.com)
2024-12-07
Countless.dev | AI Model Comparison
(countless.dev)
2024-12-07
CPU-GPU I/O-Aware LLM Inference Reduces Latency in GPUs b...
(www.marktechpost.com)
2024-12-05
How to Build a General-Purpose LLM Agent
(towardsdatascience.com)
2024-12-05
Treemap
(aiworld.eu)
2024-12-05
AI Hallucinations: Why Large Language Models Make Things ...
(www.kapa.ai)
2024-11-29
llama.cpp guide - Running LLMs locally, on any hardware, ...
(steelph0enix.github.io)
2024-11-28
Four Cutting-Edge Methods for Evaluating AI Agents and En...
(www.marktechpost.com)
2024-11-26
eugeneyan/llm-paper-notes: Notes from the Latent Space pa...
(github.com)
2024-11-21
Understanding Multimodal LLMs
(magazine.sebastianraschka.com)
2024-11-17
Something weird is happening with LLMs and chess
(open.substack.com)
2024-11-11
Analyzing the homerun year for LLMs: the top-100 most cit...
(www.zeta-alpha.com)
2024-10-31
LLM Chunking, Indexing, Scoring and Agents, in a Nutshell...
(www.datasciencecentral.com)
2024-10-28
Developing a computer use model
(www.anthropic.com)
2024-10-19
5 LLM Tools I Can’t Live Without
(www.kdnuggets.com)
2024-10-19
Claude: Everything you need to know about Anthropic's AI ...
(techcrunch.com)
2024-10-17
Nvidia just dropped a new AI model that crushes OpenAI’s ...
(venturebeat.com)
2024-08-04
dpo-from-scratch.ipynb
(github.com)
2024-08-04
What We Learned from a Year of Building with LLMs (Part I)
(www.oreilly.com)
2024-08-01
Towards Monosemanticity: A step towards understanding lar...
(towardsdatascience.com)
2024-07-24
Meta unleashes its most powerful AI model, Llama 3.1, wit...
(venturebeat.com)
2024-07-24
Customize Generative AI Models for Enterprise Application...
(developer.nvidia.com)
2024-07-24
Llama 3.1 Released: Meta’s New Open-Source AI Model that ...
(www.marktechpost.com)
2024-07-24
Meta Llama 3.1 405b is outperforming private models with ...
(dataconomy.com)
2024-07-20
Understanding Positional Embeddings in Transformers: From...
(towardsdatascience.com)
2024-07-15
Claude 3.5 Sonnet
(www.anthropic.com)
2024-07-13
Do large language models understand the world?
(www.amazon.science)
2024-07-04
Building an LLM Router for High-Quality and Cost-Effectiv...
(www.anyscale.com)
2024-07-03
From bare metal to a 70B model: infrastructure set-up and...
(imbue.com)
2024-07-02
StarCoder2-15B: A Powerful LLM for Code Generation, Summa...
(nvda.ws)
2024-06-27
How Gradient created an open LLM with a million-token con...
(venturebeat.com)
2024-06-22
Some Commonly Used Advanced Prompt Engineering Techniques...
(www.marktechpost.com)
2024-06-20
Key Metrics for Evaluating Large Language Models (LLMs)
(www.marktechpost.com)
2024-06-20
Firecrawl: A Powerful Web Scraping Tool for Turning Websi...
(www.marktechpost.com)
2024-06-19
Let's reproduce GPT-2 (124M)
(m.youtube.com)
2024-06-19
How to use an open source LLM model locally and remotely
(thoughtbot.com)
2024-06-12
“The” Midjourney model personalization guide
(dataconomy.com)
2024-06-12
How to use Perplexity in your PM work
(www.lennysnewsletter.com)
2024-06-11
[2406.01506] The Geometry of Categorical and Hierarchical...
(arxiv.org)
2024-06-11
What We Learned from a Year of Building with LLMs (Part II)
(www.oreilly.com)
2024-06-11
Sharpening LLMs: The Sharpest Tools and Essential Techniq...
(www.marktechpost.com)
2024-06-11
List of Activities and Their Corresponding Suitable LLMs ...
(www.marktechpost.com)
2024-06-11
Three Things to Know About Prompting LLMs
(sloanreview.mit.edu)
2024-05-31
Perplexity goes beyond AI search, launches publishing pla...
(venturebeat.com)
2024-05-28
The Great AI Chatbot Challenge: ChatGPT vs. Gemini vs. Co...
(www.wsj.com)
2024-05-26
The future of foundation models is closed-source
(www.thediff.co)
2024-05-24
Demystifying Vision-Language Models: An In-Depth Exploration
(www.marktechpost.com)
2024-05-22
AI Is a Black Box. Anthropic Figured Out a Way to Look In...
(www.wired.com)
2024-05-21
naklecha/llama3-from-scratch
(github.com)
2024-05-21
Abacus AI Releases Smaug-Llama-3-70B-Instruct: The New Be...
(www.marktechpost.com)
2024-05-13
Do Enormous LLM Context Windows Spell the End of RAG?
(thenewstack.io)
2024-05-13
How Good Are the Latest Open LLMs? And Is DPO Better Than...
(sebastianraschka.com)
2024-05-12
ChuXin: A Fully Open-Sourced Language Model with a Size o...
(www.marktechpost.com)
2024-05-11
Title:You Only Cache Once: Decoder-Decoder Architectures ...
(arxiv.org)
2024-05-11
Anthropic AI Launches a Prompt Engineering Tool that Gene...
(www.marktechpost.com)
2024-05-11
Cleaning
(docs.unstructured.io)
2024-05-08
[2404.19737] Better & Faster Large Language Models via Mu...
(arxiv.org)
2024-05-07
Researchers at NVIDIA AI Introduce ‘VILA’: A Vision Langu...
(www.marktechpost.com)
2024-05-05
Hugging Face - Documentation
(huggingface.co)
2024-04-25
Understanding Key Terminologies in Large Language Model (...
(www.marktechpost.com)
2024-04-25
Top 15 AI Libraries/Frameworks for Automatically Red-Team...
(www.marktechpost.com)
2024-04-19
Meta says Llama 3 beats most other models, including Gemi...
(www.theverge.com)
2024-04-17
anthropics/anthropic-cookbook: A collection of notebooks/...
(github.com)
2024-04-15
Deep Learning Architectures From CNN, RNN, GAN, and Trans...
(www.marktechpost.com)
2024-04-15
Tips for LLM Pretraining and Evaluating Reward Models
(magazine.sebastianraschka.com)
2024-04-14
Lessons after a half-billion GPT tokens - Ken Kantzer's Blog
(kenkantzer.com)
2024-04-13
5 Ways To Use LLMs On Your Laptop
(www.kdnuggets.com)
2024-04-13
Words are flowing out like endless rain: Recapping a busy...
(arstechnica.com)
2024-04-12
Gemini: A Family of Highly Capable Multimodal Models
(dev.to)
2024-04-10
Peter Gostev’s Post
(www.linkedin.com)
2024-04-05
Detecting Hallucinations in Large Language Models with Te...
(dev.to)
2024-04-05
Top Open Source Large Language Models (LLMs) Available Fo...
(www.marktechpost.com)
2024-04-02
LLaMA Now Goes Faster on CPUs
(justine.lol)
2024-04-02
Large language models use a surprisingly simple mechanism...
(news.mit.edu)
2024-04-02
Introducing DBRX: A New State-of-the-Art Open LLM
(www.databricks.com)
2024-04-01
ChatGPT vs Perplexity AI: AI App Comparison
(www.marktechpost.com)
2024-03-30
Mamba Explained
(thegradient.pub)
2024-03-29
How Nvidia Blackwell Systems Attack 1 Trillion Parameter ...
(www.nextplatform.com)
2024-03-29
How Chain-of-Thought Reasoning Helps Neural Networks Compute
(www.quantamagazine.org)
2024-03-11
Why and How to Achieve Longer Context Windows for LLMs
(towardsdatascience.com)
2024-03-11
Generative AI Design Patterns: A Comprehensive Guide | by...
(towardsdatascience.com)
2024-03-11
You can now train a 70b language model at home
(www.answer.ai)
2024-03-11
Easily Train a Specialized LLM: PEFT, LoRA, QLoRA, LLaMA-...
(towardsdatascience.com)
2024-03-07
Google Bard is called Gemini now and expands to mobile, p...
(www.axios.com)
2024-03-05
Anthropic’s Post
(www.linkedin.com)
2024-03-05
OpenAI's ChatGPT may have its first true rival in Anthrop...
(qz.com)
2024-02-29
rasbt/LLMs-from-scratch
(github.com)
2024-02-29
Meet RAGxplorer: An interactive AI Tool to Support the Bu...
(www.marktechpost.com)
2024-02-29
Meet Google Lumiere AI, Bard’s video maker cousin
(dataconomy.com)
2024-02-29
How To Build an LLM-Powered App To Chat with PapersWithCode
(towardsdatascience.com)
2024-02-29
The killer app of Gemini Pro 1.5 is video
(simonwillison.net)
2024-02-29
Understanding Direct Preference Optimization
(towardsdatascience.com)
2024-02-29
I Spent a Week With Gemini Pro 1.5—It’s Fantastic
(every.to)
2024-02-29
Title:The Era of 1-bit LLMs: All Large Language Models ar...
(arxiv.org)
2024-02-29
Sora early access: Your guide to securing a spot
(dataconomy.com)
2024-02-29
Au Large | Mistral AI | Frontier AI in your hands
(mistral.ai)
2024-02-22
Claude
(claude.ai)
2024-02-22
Beyond Self-Attention: How a Small Language Model Predict...
(shyam.blog)
2024-02-22
How do transformers work?+Design a Multi-class Sentiment ...
(open.substack.com)
2024-02-22
1708022141659 (JPEG Image, 1280 × 1600 pixels) ...
(media.licdn.com)
2024-02-22
Groq Inference Tokenomics: Speed, But At What Cost?
(www.semianalysis.com)
2024-02-20
How Well Can LLMs Negotiate? Stanford Researchers Develop...
(www.marktechpost.com)
2024-02-17
Sora
(openai.com)
2024-02-15
Code LoRA from Scratch - a Lightning Studio by sebastian
(lightning.ai)
2024-02-15
Bard is now Gemini and Gemini Advanced is amazing
(dataconomy.com)
2024-02-11
Ask HN: What have you built with LLMs?
(news.ycombinator.com)
2024-02-04
Title:BloombergGPT: A Large Language Model for Finance
(arxiv.org)
2024-01-24
Exploring the Zephyr 7B: A Comprehensive Guide to the Lat...
(www.kdnuggets.com)
2024-01-17
Mastering PDFs: Extracting Sections, Headings, Paragraphs...
(blog.llamaindex.ai)
2024-01-16
Understanding and Coding Self-Attention, Multi-Head Atten...
(magazine.sebastianraschka.com)
2024-01-16
Dashboard - SciSummary
(scisummary.com)
2024-01-07
Meet Waymo’s MotionLM: The State-of-the-Art Multi-Agent M...
(www.marktechpost.com)
2024-01-07
How much detail is too much? Midjourney v6 attempts to fi...
(arstechnica.com)
2024-01-07
10 Noteworthy AI Research Papers of 2023
(magazine.sebastianraschka.com)
2023-10-20
7 Steps to Mastering Large Language Models (LLMs)
(www.kdnuggets.com)
2023-10-20
Meta AI Researchers Propose Advanced Long-Context LLMs: A...
(www.marktechpost.com)
2023-10-20
This AI Paper from NVIDIA Explores the Power of Retrieval...
(www.marktechpost.com)
2023-10-20
Finetuning LLMs with LoRA and QLoRA: Insights from Hundre...
(lightning.ai)
2023-10-20
Getting Started with Large Language Models: Key Things to...
(flyte.org)
2023-10-20
Unlocking GPT-4 Summarization with Chain of Density Promp...
(www.kdnuggets.com)
2023-10-20
The Ins and Outs of Retrieval-Augmented Generation (RAG)
(towardsdatascience.com)
2023-10-20
Building RAG-based LLM Applications for Production (Part 1)
(www.anyscale.com)
2023-10-20
RAG vs Finetuning: Which Is the Best Tool to Boost Your L...
(towardsdatascience.com)
2023-10-20
A High-Level Overview Of Large Language Model Concepts, U...
(smashingmagazine.com)
2023-10-20
Augmenting LLMs with RAG
(towardsdatascience.com)
2023-10-07
Parallel Processing in Prompt Engineering: The Skeleton-o...
(www.kdnuggets.com)
2023-10-05
[2302.07730] Transformer models: an introduction and catalog
(arxiv.org)
2023-10-04
Hey, Computer, Make Me a Font
(serce.me)
2023-10-04
SaaS Competitive Advantage Through Elegant LLM Feedback M...
(www.tomtunguz.com)
2023-10-03
2302.11382.pdf
(arxiv.org)
2023-10-03
ChatGPT, Bard, or Bing Chat? Differences Among 3 Generati...
(www.nngroup.com)
2023-10-03
Bard
(bard.google.com)
2023-10-03
The State of Large Language Models
(www.scientificamerican.com)
2023-09-25
10 Ways to Improve the Performance of Retrieval Augmented...
(towardsdatascience.com)
2023-09-25
How to Build an LLM from Scratch
(towardsdatascience.com)
2023-09-25
Large Language Model Prompt Engineering for Complex Summa...
(devblogs.microsoft.com)
2023-09-25
Open LLM Leaderboard : a Hugging Face Space by HuggingFaceH4
(huggingface.co)
2023-09-25
Llama from scratch
(blog.briankitano.com)
2023-09-25
Cracking Open the OpenAI (Python) API
(towardsdatascience.com)
2023-09-25
Cracking Open the Hugging Face Transformers Library
(towardsdatascience.com)
2023-09-25
Asking 60+ LLMs a set of 20 questions
(benchmarks.llmonitor.com)
2023-09-24
OpenAI Unveils DALL·E 3: A Revolutionary Leap in Text-to-...
(www.marktechpost.com)
2023-09-24
Comparison: DALL-E 3 vs Midjourney
(dataconomy.com)
2023-09-17
What OpenAI Really Wants
(www.wired.com)
2023-09-12
A Beginner’s Guide to Building LLM-Powered Applications w...
(dev.to)
2023-08-31
iryna-kondr/scikit-llm: Seamlessly integrate LLMs into sc...
(github.com)
2023-08-31
Prompt Engineering — How to trick AI into solving your pr...
(towardsdatascience.com)
2023-08-30
A Beginner’s Guide to LLM Fine-Tuning
(towardsdatascience.com)
2023-08-27
Together AI Unveils Llama-2-7B-32K-Instruct: A Breakthrou...
(www.marktechpost.com)
2023-08-25
A Practical Introduction to LLMs
(towardsdatascience.com)
2023-08-20
Meet Chroma: An AI-Native Open-Source Vector Database For...
(www.marktechpost.com)
2023-08-07
How to Extract Text from Any PDF and Image for Large Lang...
(towardsdatascience.com)
2023-08-07
Introducing OpenLLM: Open Source Library for LLMs
(www.kdnuggets.com)
2023-08-07
Abacus AI Introduces A New Open Long-Context Large Langua...
(www.marktechpost.com)
2023-08-06
How to use LLMs for PDF parsing
(nanonets.com)
2023-08-06
How to Chat With Any File from PDFs to Images Using Large...
(towardsdatascience.com)
2023-08-06
How to Leverage Open-Source LLMs in Your Project
(www.turingpost.com)
2023-08-02
LangChain 101: Build Your Own GPT-Powered Applications
(www.kdnuggets.com)
2023-07-28
MPT-30B: Raising the bar for open-source foundation models
(www.mosaicml.com)
2023-07-28
Midjourney pricing plans and free alternatives to try
(dataconomy.com)
2023-07-28
A Deep Dive Into LLaMA, Falcon, Llama 2 and Their Remarka...
(www.turingpost.com)
2023-07-28
Chain of Thought Prompting for LLMs
(towardsdatascience.com)
2023-07-28
Is Anthropic's Claude 2 model ready to take down GPT-4? W...
(dev.to)
2023-07-24
Emerging Architectures for LLM Applications
(a16z.com)
2023-07-24
ELI5: FlashAttention
(gordicaleksa.medium.com)
2023-07-24
Build Industry-Specific LLMs Using Retrieval Augmented Ge...
(towardsdatascience.com)
2023-07-24
Free Full Stack LLM Bootcamp
(www.kdnuggets.com)
2023-07-24
Edge 300: Meet Falcon LLM: The Most Powerful Open Source ...
(thesequence.substack.com)
2023-07-23
The Secret Sauce behind 100K context window in LLMs: all ...
(blog.gopenai.com)
2023-07-23
Observe.ai unveils 30-billion-parameter contact center LL...
(venturebeat.com)
2023-07-23
All You Need to Know to Build Your First LLM App
(towardsdatascience.com)
2023-07-23
Training LLMs with AMD MI250 GPUs and MosaicML
(www.mosaicml.com)
2023-07-23
Optimizing Memory Usage for Training LLMs and Vision Tran...
(lightning.ai)
2023-07-23
Deploying Falcon-7B Into Production
(towardsdatascience.com)
2023-07-23
Anthropic releases Claude 2, its second-gen AI chatbot
(techcrunch.com)
2023-07-23
Google Launches AI-Powered Notes App Called NotebookLM
(tech.slashdot.org)
2023-07-23
Ecosystem Graphs for Foundation Models
(crfm.stanford.edu)
2023-07-23
Meet LMQL: An Open Source Query Language for LLMs
(thesequence.substack.com)
2023-07-23
Leandro von Werra’s Post
(www.linkedin.com)
2023-07-23
LLaMA 2: How to access and use Meta’s versatile open-sour...
(venturebeat.com)
2023-07-22
Beyond LLaMA: The Power of Open LLMs
(towardsdatascience.com)
2023-07-22
Facebook parent Meta unveils LLaMA 2 open-source AI model...
(venturebeat.com)
2023-07-22
MosaicML launches MPT-7B-8K, a 7B-parameter open-source L...
(venturebeat.com)
2023-07-22
The $1 billion gamble to ensure AI doesn’t destroy humanity
(www.thediff.co)
2023-07-12
Unraveling the Power of Chain-of-Thought Prompting in Lar...
(www.kdnuggets.com)
2023-07-12
GitHub - Mooler0410/LLMsPracticalGuide: A curated list of...
(github.com)
2023-06-19
Introduction to the Open LLM Falcon-40B: Performance, Tra...
(towardsdatascience.com)
2023-06-19
Falcon LLM: The New King of Open-Source LLMs
(www.kdnuggets.com)
2023-06-18
Meet FinGPT: An Open-Source Financial Large Language Mode...
(www-marktechpost-com.cdn.ampproject.org)
2023-06-09
LMM Garden | Discover, search, and compare LLMs
(llm.garden)
2023-06-08
iryna-kondr/scikit-llm
(github.com)
2023-06-02
The Case for Running AI on CPUs Isn’t Dead Yet
(spectrum.ieee.org)
2023-05-28
The Art of Prompt Design: Prompt Boundaries and Token Hea...
(towardsdatascience.com)
2023-05-21
Sonali Pattnaik on LinkedIn: #generativeai #ai | 45 comments
(www.linkedin.com)
2023-05-19
The Non-Silence of the LLMs
(informationisbeautiful.net)
2023-05-19
Super Bard: The AI That Can Do It All and Better
(www.kdnuggets.com)
2023-05-18
Edge 291: Reinforcement Learning with Human Feedback
(thesequence.substack.com)
2023-05-12
Google dives into the ‘supercomputer’ game by knitting to...
(venturebeat.com)
2023-05-05
Distilling Step-by-Step! Outperforming Larger Language Mo...
(arxiv.org)
2023-05-05
SparseGPT: Massive Language Models Can Be Accurately Prun...
(arxiv.org)
2023-05-05
openlm-research/open_llama: OpenLLaMA, a permissively lic...
(github.com)
2023-05-03
guidance-ai/guidance: A guidance language for controlling...
(github.com)
2023-04-29
Blog | Anyscale
(www.anyscale.com)
2023-04-29
Parameter-Efficient LLM Finetuning With Low-Rank Adaptati...
(sebastianraschka.com)
2023-04-29
Edge 286: Vicuna, the LLaMA-Based Model that Matches Chat...
(thesequence.substack.com)
2023-04-26
Grounding Large Language Models in a Cognitive Foundation...
(thegradient.pub)
2023-04-25
Data Machina #198
(datamachina.substack.com)
2023-04-25
Finetuning Large Language Models
(magazine.sebastianraschka.com)
2023-04-21
The LLama Effect: How an Accidental Leak Sparked a Series...
(thesequence.substack.com)
2023-04-21
Stanford CRFM
(crfm.stanford.edu)
2023-04-21
Meta has built a massive new language AI—and it’s giving ...
(www.technologyreview.com)
2023-04-21
Eight Things to Know about Large Language Models
(arxiv.org)
2023-04-19
Baby AGI: The Birth of a Fully Autonomous AI
(www.kdnuggets.com)
2023-04-19
Hacker News
(magazine.sebastianraschka.com)
2023-04-17
📝 Guest Post: How to Enhance the Usefulness of Large Lang...
(thesequence.substack.com)
2023-04-14
Prompt Engineering
(lilianweng.github.io)
2023-04-14
A Survey of Large Language Models
(arxiv.org)
2023-04-14
New Ebook: A Beginner’s Guide to Large Language Models
(www.nvidia.com)
2023-04-13
Maximizing the Potential of LLMs: A Guide to Prompt Engin...
(www.ruxu.dev)
2023-04-13
The Magic of LLMs — Prompt Engineering
(towardsdatascience.com)
2023-04-12
📝 Guest Post: Caching LLM Queries for Improved Performanc...
(thesequence.substack.com)
2023-02-10
OpenAI Platform
(platform.openai.com)
2014-09-24
Graphiti: A Python Library for Building Temporal Knowledg...
(www.marktechpost.com)
2014-09-24
Top 9 Different Types of Retrieval-Augmented Generation (...
(www.marktechpost.com)
2014-09-24
FlashSigmoid: A Hardware-Aware and Memory-Efficient Imple...
(www.marktechpost.com)
2014-08-24
Building a Simple RAG Application Using LlamaIndex - Mach...
(machinelearningmastery.com)
2009-09-24
LlamaIndex : LlamaIndex
(docs.llamaindex.ai)
2003-09-24
Why GPU Utilization Falls Short: Understanding Streaming ...
(www.marktechpost.com)
2002-10-24
Nvidia just dropped a bombshell: Its new AI model is open...
(venturebeat.com)
2002-10-24
LightLLM: A Lightweight Scalable and High-Speed Python Fr...
(www.marktechpost.com)
2001-10-24
Ten Effective Strategies to Lower Large Language Model (L...
(www.marktechpost.com)