gpt
gpt — my Raindrop.io articles
OpenAI has indicated that a new version of its large language model, GPT-5.4, is in development following a post on
Musings of a Computer Scientist.
One of my favourite features of ChatGPT is its ability to write and execute code in a container. This feature launched as ChatGPT Code Interpreter nearly three years ago, was …
A blog about making culture. Since 1999.
OpenAI half-relased this model earlier this month, adding it to their Codex CLI tool but not their API. Today they've fixed that - the new model can now be accessed …
I’ve had preview access to the new GPT-5 model family for the past two weeks, and have been using GPT-5 as my daily-driver. It’s my new favorite model. It’s still …
We tested OpenAI’s ChatGPT Agent, currently only available via its $200-per-month Pro subscription.
ChatGPT agent System Card: OpenAI’s agentic model unites research, browser automation, and code tools with safeguards under the Preparedness Framework.
Can't find a human to practice a language with? I've been there. Here's why most of my interaction with ChatGPT is in other languages.
Johann Rehberger snagged a copy of the [ChatGPT Operator](https://simonwillison.net/2025/Jan/23/introducing-operator/) system prompt. As usual, the system prompt doubles as better written documentation than any of the official sources. It asks users …
A detailed analysis of ChatGPT search and Google's performance across 62 queries, with scoring metrics and practical examples.
A.I. insiders are falling for Claude, a chatbot from Anthropic. Is it a passing fad, or a preview of artificial relationships to come?
The power of a robust honor code—and abundant institutional resources
The popular online chatbot can now access and deliver information from across the internet in real time, including news, stock prices and sports scores.
Discover and create custom versions of ChatGPT that combine instructions, extra knowledge, and any combination of skills.
Introduction In this post, I want to introduce Mark, a simple CLI tool that uses Markdown and its syntax to interact naturally with the GPT4-vision/GPT4o models.
Prompt engineering is crucial to leveraging ChatGPT's capabilities, enabling users to elicit relevant, accurate, high-quality responses from the model. As language models like ChatGPT become more sophisticated, mastering the art of crafting effective prompts has become essential. This comprehensive overview delves into prompt engineering principles, techniques, and best practices, providing a detailed understanding drawn from multiple authoritative sources. Understanding Prompt Engineering Prompt engineering involves the deliberate design and refinement of input prompts to influence the output of a language model like ChatGPT. The efficacy of a prompt directly impacts the relevance and coherence of the AI's responses. Effective prompt engineering
Amazon trained me to write evidence-based narratives. I love the format. It’s a clear and compelling way to present information to drive…
We tested OpenAI’s ChatGPT against Microsoft’s Copilot and Google’s Gemini, along with Perplexity and Anthropic’s Claude. Here’s how they ranked.
The search engine war is heating up. ChatGPT may introduce its search engine, which will rival Google, on Monday. Although
Making your custom GPTs is just one of the ways to leverage your content strategy and use ChatGPT...
Claude and ChatGPT are two compelling options in AI chatbots, each with unique features and capabilities. To discern their strengths and suitability for various applications, let's compare these two AI chatbots comprehensively. What is Claude? Claude is an AI chatbot developed by an Anthropic AI renowned for its ability to simulate human-like conversations. Built on sophisticated NLP algorithms, Claude excels in engaging users in meaningful dialogues across a spectrum of topics. What sets Claude apart is its emphasis on understanding the user's persona and tailoring responses to match individual preferences and communication styles. This personalised touch enhances user experience, fostering
We build a Generatively Pretrained Transformer (GPT), following the paper "Attention is All You Need" and OpenAI's GPT-2 / GPT-3. We talk about connections to ChatGPT, which has taken the world by storm. We watch GitHub Copilot, itself a GPT, help us write a GPT (meta :D!) . I recommend people watch the earlier makemore videos to get comfortable with the autoregressive language modeling framework and basics of tensors and PyTorch nn, which we take for granted in this video. Links: - Google colab for the video: https://colab.research.google.com/drive/1JMLa53HDuA-i7ZBmqV7ZnA3c_fvtXnx-?usp=sharing - GitHub repo for the video: https://github.com/karpathy/ng-video-lecture - Playlist of the whole Zero to Hero series so far: https://www.youtube.com/watch?v=VMj-3S1tku0&list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ - nanoGPT repo: https://github.com/karpathy/nanoGPT - my website: https://karpathy.ai - my twitter: https://twitter.com/karpathy - our Discord channel: https://discord.gg/3zy8kqD9Cp Supplementary links: - Attention is All You Need paper: https://arxiv.org/abs/1706.03762 - OpenAI GPT-3 paper: https://arxiv.org/abs/2005.14165 - OpenAI ChatGPT blog post: https://openai.com/blog/chatgpt/ - The GPU I'm training the model on is from Lambda GPU Cloud, I think the best and easiest way to spin up an on-demand GPU instance in the cloud that you can ssh to: https://lambdalabs.com . If you prefer to work in notebooks, I think the easiest path today is Google Colab. Suggested exercises: - EX1: The n-dimensional tensor mastery challenge: Combine the `Head` and `MultiHeadAttention` into one class that processes all the heads in parallel, treating the heads as another batch dimension (answer is in nanoGPT). - EX2: Train the GPT on your own dataset of choice! What other data could be fun to blabber on about? (A fun advanced suggestion if you like: train a GPT to do addition of two numbers, i.e. a+b=c. You may find it helpful to predict the digits of c in reverse order, as the typical addition algorithm (that you're hoping it learns) would proceed right to left too. You may want to modify the data loader to simply serve random problems and skip the generation of train.bin, val.bin. You may want to mask out the loss at the input positions of a+b that just specify the problem using y=-1 in the targets (see CrossEntropyLoss ignore_index). Does your Transformer learn to add? Once you have this, swole doge project: build a calculator clone in GPT, for all of +-*/. Not an easy problem. You may need Chain of Thought traces.) - EX3: Find a dataset that is very large, so large that you can't see a gap between train and val loss. Pretrain the transformer on this data, then initialize with that model and finetune it on tiny shakespeare with a smaller number of steps and lower learning rate. Can you obtain a lower validation loss by the use of pretraining? - EX4: Read some transformer papers and implement one additional feature or change that people seem to use. Does it improve the performance of your GPT? Chapters: 00:00:00 intro: ChatGPT, Transformers, nanoGPT, Shakespeare baseline language modeling, code setup 00:07:52 reading and exploring the data 00:09:28 tokenization, train/val split 00:14:27 data loader: batches of chunks of data 00:22:11 simplest baseline: bigram language model, loss, generation 00:34:53 training the bigram model 00:38:00 port our code to a script Building the "self-attention" 00:42:13 version 1: averaging past context with for loops, the weakest form of aggregation 00:47:11 the trick in self-attention: matrix multiply as weighted aggregation 00:51:54 version 2: using matrix multiply 00:54:42 version 3: adding softmax 00:58:26 minor code cleanup 01:00:18 positional encoding 01:02:00 THE CRUX OF THE VIDEO: version 4: self-attention 01:11:38 note 1: attention as communication 01:12:46 note 2: attention has no notion of space, operates over sets 01:13:40 note 3: there is no communication across batch dimension 01:14:14 note 4: encoder blocks vs. decoder blocks 01:15:39 note 5: attention vs. self-attention vs. cross-attention 01:16:56 note 6: "scaled" self-attention. why divide by sqrt(head_size) Building the Transformer 01:19:11 inserting a single self-attention block to our network 01:21:59 multi-headed self-attention 01:24:25 feedforward layers of transformer block 01:26:48 residual connections 01:32:51 layernorm (and its relationship to our previous batchnorm) 01:37:49 scaling up the model! creating a few variables. adding dropout Notes on Transformer 01:42:39 encoder vs. decoder vs. both (?) Transformers 01:46:22 super quick walkthrough of nanoGPT, batched multi-headed self-attention 01:48:53 back to ChatGPT, GPT-3, pretraining vs. finetuning, RLHF 01:54:32 conclusions Corrections: 00:57:00 Oops "tokens from the _future_ cannot communicate", not "past". Sorry! :) 01:20:05 Oops I should be using the head_size for the normalization, not C
More AI image generation tools at your fingertips.
What is ChatGPT? ChatGPT, developed by OpenAI, is an AI platform renowned for its conversational AI capabilities. Leveraging the power of the Generative Pre-trained Transformer models, ChatGPT generates human-like text responses across various topics, from casual conversations to complex, technical discussions. Its ability to engage users with coherent, contextually relevant dialogues stands out, making it highly versatile for various applications, including content creation, education, customer service, and more. Its integration with tools like DALL-E for image generation from textual descriptions and its continual updates for enhanced performance showcase its commitment to providing an engaging and innovative user experience. ChatGPT Key
A complete GPT2 implementation as a single SQL query in PostgreSQL.
The Amazon-backed AI startup said its "most intelligent model" outperformed OpenAI's powerful GPT-4
OpenAI's custom GPTs and GPT Store could proffer a potential new platform for publishers to drive referral traffic.
Sure, anyone can use OpenAI’s chatbot. But with smart engineering, you can get way more interesting results.
New experiments show that very young children are better at solving creative puzzles than ChatGPT and other AI models
The use of NLP in the realm of financial technology is broad and complex, with applications ranging from sentiment analysis and named entity recognition to question answering. Large Language...
Identify and target personas of keywords, competitors, Reddit discussions, and more.
Best AI Chatbots for Customer Support
Implementing a GPT model from scratch in NumPy.
OpenAI's ChatGPT Vision is making waves in the world of artificial intelligence, but what exactly is it, and how can
Bard is now Gemini. Get help with writing, planning, learning, and more from Google AI.
KDnuggets' latest cheat sheet covers 10 curated hands-on projects to boost data science workflows with ChatGPT across ML, NLP, and full stack dev, including links to full project details.
Do you use Python, Pandas, and Seaborn to collect, analyze, and plot data? Then you'll be amazed by what ChatGPT can do, when using ChatGPT+, GPT-4 model, and the plugin for Noteable's version of Jupyter notebooks. [UPDATE/NOTE: This was my first summary of Noteable and ChatGPT. I have done more experiments, which you can see here: https://www.youtube.com/watch?v=2WUZ0b-hUDU] In this video, I show you how I got things set up for using Noteable (and world headlines), then put together a query, You'll see how it goes well, where it goes wrong, and what sort of code I can create using just English-language descriptions of my plans. And I show you what's happening behind the scenes, as we see the JSON being written. This is all brand new and exciting, and I hope that you'll post suggestions and ideas in the comments for how we can take this even further! And if you're interested in analyzing data with Pandas, check out Bamboo Weekly at https://www.BambooWeekly.com/, where I look at current events through the eyes of data analysis.
Hey guys, welcome back to my R-tips newsletter. In today’s R-tip, I’m sharing a super common data science task (one that saved me 20 hours per week)… You’re getting the cheat code to automating Google Sheets. Plus, I’m sharing exactly how I made this a...
How you can fine-tune OpenAI’s GPT-3.5 Turbo model to perform new tasks using your custom data
Using ChatGPT & OpenAI's GPT API, this code tutorial teaches how to chat with PDFs, automate PDF tasks, and build PDF chatbots.
Anthropic released Claude 2, a new iteration of its AI model, to take on ChatGPT and Google Bard...
An Introduction to Auto-GPT
Explained with an example use case.
Large language models such as GPT-3/4, LLaMA and PaLM work in terms of tokens. They take text, convert it into tokens (integers), then predict which tokens should come next. Playing …
GPT, explained simply, in a metaphor of potion.
10 ChatGPT Plugins for Data Science Cheat Sheet • Noteable Plugin: The ChatGPT Plugin That Automates Data Analysis • 3 Ways to Access Claude AI for Free • What are Vector Databases and Why Are They Important for LLMs? • A Data Scientist’s Essential Guide to Exploratory Data Analysis
Transform your life with these ChatGPT’s hidden gems.
An effective prompt is the first step in benefitting from ChatGPT. That's the challenge — an effective prompt.
In this article, we will demonstrate how to use different prompts to ask ChatGPT for help and make...
ChatGPT can generate usable content. But it can also analyze existing content — articles, descriptions — and suggest improvements for SEO and social media.
Learn how standard greedy tokenization introduces a subtle and powerful bias that can have all kinds of unintended consequences.
The next generation of AI is leaving behind the viral chatbot.
On Tuesday, the result arrived via email: “NOT GUILTY.”
1) Reinforcement Learning with Human Feedback(RLHF) 2) The RLHF paper, 3) The transformer reinforcement learning framework.
More languages, image inputs, and extension support among Bard features at I/O ’23.
Use the power of ChatGPT from within your own apps using OpenAI’s API and this guide.
LangChain + OpenAI + Panel + HuggingFace
It's like learning a new language - kind of.
Sundays, The Sequence Scope brings a summary of the most important research papers, technology releases and VC funding deals in the artificial intelligence space.
Exploring why AI won’t replace designers, but rather enhance their work.
The simplest, fastest repository for training/finetuning medium-sized GPTs. - karpathy/nanoGPT
Dolly 2.0 could spark a new wave of fully open source LLMs similar to ChatGPT.
Artificial intelligence models have found their way into many people’s lives, for work and for fun.
Chain-of-Thought (CoT) prompting can effectively elicit complex multi-step reasoning from Large Language Models~(LLMs). For example, by simply adding CoT instruction ``Let's think step-by-step''...
#langchain #chatgpt #gpt4 #artificialintelligence #automation #python #notion #productivity #datascience #pdf #machinelearning In this tutorial, learn how to easily extract information from a PDF document using LangChain and ChatGPT. I'll walk you through installing dependencies, loading and processing a PDF file, creating embeddings, and querying the PDF with natural language questions. 00:00 - Introduction 00:21 - Downloading a sample PDF 00:49 - Importing required modules 01:21 - Setting up the PDF path and loading the PDF 01:38 - Printing the first page of the PDF 01:53 - Creating embeddings and setting up the Vector database 02:24 - Creating a chat database chain 02:49 - Querying the PDF with a question 03:27 - Understanding the query results 04:00 - Conclusion Remember to like and subscribe for more tutorials on learning, research and AI! - Source code: https://github.com/EnkrateiaLucca/talk_pdf - Link to the medium article: https://medium.com/p/e723337f26a6 - Subscribe!: https://www.youtube.com/channel/UCu8WF59Scx9f3H1N_FgZUwQ - Join Medium: https://lucas-soares.medium.com/membership - Tiktok: https://www.tiktok.com/@enkrateialucca?lang=en - Twitter: https://twitter.com/LucasEnkrateia - LinkedIn: https://www.linkedin.com/in/lucas-soares-969044167/ Music from [www.epidemicsound.com](http://www.epidemicsound.com/)
ChatGPT is a deep-learning model created by OpenAI whose ability to generate human-like prose has made AI a topic of conversation. Learn more
AI multiplies your efforts. I found out by how much...
OpenAI today announced its support of new third-party plugins for ChatGPT, and it already has Twitter buzzing about the company's potential platform play.
Quote "ChatGPT is like a genie in a bottle, but instead of granting you three wishes, it gives you endless responses until you realize you've been chatting with a machine for hours." 😂
Who Is publishing the most Impactful AI research right now? With the breakneck pace of innovation in AI, it is crucial to pick up some signal as soon as possible. No one has the time to read everything, but these 100 papers are sure to bend the road as to where our AI technology is going. The real test of impact of R&D teams is of course how the technology appears in products, and OpenAI shook the world by releasing ChatGPT at the end of November 2022, following fast on their March 2022 paper “T
ChatPDF is the fast and easy way to chat with any PDF, free and without sign-in. Talk to books, research papers, manuals, essays, legal contracts, whatever you have! The intelligence revolution is here, ChatGPT was just the beginning!
ChatGPT recently passed the U.S. Medical Licensing Exam, but using it for a real-world medical diagnosis would quickly turn deadly.
ChatGPT invented a hit puzzle game called Sumplete that could rival Wordle. There's just one problem: It already exists.
It’s March 2023 and right now ChatGPT, the amazing AI chatbot tool from OpenAI, is all the rage. But when OpenAI released their public web API for ChatGPT on the 1st of March you might have been a bit disappointed. If you’re an R user, that is. Because, when scrolling through the release announcement you find that there is a python package to use this new API, but no R package. I’m here to say: Don’t be disappointed! As long as there is a web API for a service then it’s going to be easy to use this service from R, no specialized package needed. So here’s an example of how to use the new (as of March 2023) ChatGPT API from R. But know that when the next AI API hotness comes out (likely April 2023, or so) then it’s going to be easy to interface with that from R, as well. Calling the ChatGPT web API from R To use the ChatGPT API in any way you first need to sign up and get an API key: The “password” you need to access the web API. It could look something like "sk-5xWWxmbnJvbWU4-M212Z2g5dzlu-MzhucmI5Yj-l4c2RkdmZ26". Of course, that’s not my real API key because that’s something you should keep secret! With an API key at hand you now look up the documentation and learn that this is how you would send a request to the API from the terminal: curl https://api.openai.com/v1/chat/completions \ -H "Authorization: Bearer $OPENAI_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "gpt-3.5-turbo", "messages": [{"role": "user", "content": "What is a banana?"}] }' But how do we send a request to the API using R? What we can do is to “replicate” this call using httr: a popular R package to send HTTP requests. Here’s how this request would be made using httr (with the curl lines as comments above the corresponding httr code) library(httr) api_key
LLaMA-13B reportedly outperforms ChatGPT-like tech despite being 10x smaller.
OpenAI’s chatbot offers paraphrases, whereas Google offers quotes. Which do we prefer?
As a Ruby developer, you can immediately incorporate AI into your applications through the use of the OpenAI API. Our beginners guide will take you through step by step.
Startup Empathy has launched a new tool that uses AI to generate obituaries. It might sound callous, but the results aren't terrible, surprisingly.
During the last two years there has been a plethora of large generative models such as ChatGPT or Stable Diffusion that have been published. Concretely, these models are able to perform tasks such...
A website aggregator is a website that collects data from other websites across the internet and puts...
Summarize web articles.
I was playing around with OpenAI (GPT-3) today, building a reasonably complicated email parser for a...
A new wave of chat bots like ChatGPT use artificial intelligence that could reinvent or even replace the traditional internet search engine.
The wave of enthusiasm around generative networks feels like another Imagenet moment - a step change in what ‘AI’ can do that could generalise far beyond the cool demos. What can it create, and where are the humans in the loop?
Everything I understand about chatgpt · GitHub
A conversational AI system that listens, learns, and challenges
The OpenAI ChatGPT chatbot was just released and is already quite popular. Say hello to the newest chatbot with one
The first obvious casualty of large language models is homework: the real training for everyone, though, and the best way to leverage AI, will be in verifying and editing information.
The way we search online hasn’t changed in decades. A new idea from Google researchers could make it more like talking to a human expert
Unpack the key features and marketing insights of SearchGPT, OpenAI’s innovative search tool and its potential to rival Google’s dominance.
A comparison between a report written by a human and one composed by AI reveals the weaknesses of the latter when it comes to journalism.
The Artifex blog covers the latest news and updates regarding Ghostscript, MuPDF, and SmartOffice. Subjects cover PDF and Postscript, open source, office productivity, new releases, and upcoming events.