<?xml version="1.0" encoding="utf-8"?>
<feed xml:lang="en-us" xmlns="http://www.w3.org/2005/Atom"><title>Simon Willison's Weblog: kakapo</title><link href="http://simonwillison.net/" rel="alternate"/><link href="http://simonwillison.net/tags/kakapo.atom" rel="self"/><id>http://simonwillison.net/</id><updated>2026-04-10T19:07:02+00:00</updated><author><name>Simon Willison</name></author><entry><title>Kākāpō parrots</title><link href="https://simonwillison.net/2026/Apr/10/kakapo/#atom-tag" rel="alternate"/><published>2026-04-10T19:07:02+00:00</published><updated>2026-04-10T19:07:02+00:00</updated><id>https://simonwillison.net/2026/Apr/10/kakapo/#atom-tag</id><summary type="html">
    &lt;p&gt;Lenny &lt;a href="https://twitter.com/lennysan/status/2042615413494939943"&gt;posted&lt;/a&gt; another snippet from &lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/"&gt;our 1 hour 40 minute podcast recording&lt;/a&gt; and it's about kākāpō parrots!&lt;/p&gt;
&lt;p&gt;&lt;video
  src="https://static.simonwillison.net/static/2026/kakapo-lenny.mp4"
  poster="https://static.simonwillison.net/static/2026/kakapo-lenny.jpg"
  controls
  preload="none"
  playsinline
  style="display:block; max-width:400px; width:100%; height:auto; margin:0 auto"
&gt;&lt;track src="https://static.simonwillison.net/static/cors-allow/2026/kakapo-lenny.vtt" kind="captions" srclang="en" label="English"&gt;&lt;/video&gt;
&lt;/p&gt;

    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/kakapo"&gt;kakapo&lt;/a&gt;&lt;/p&gt;



</summary><category term="kakapo"/></entry><entry><title>Highlights from my conversation about agentic engineering on Lenny's Podcast</title><link href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#atom-tag" rel="alternate"/><published>2026-04-02T20:40:47+00:00</published><updated>2026-04-02T20:40:47+00:00</updated><id>https://simonwillison.net/2026/Apr/2/lennys-podcast/#atom-tag</id><summary type="html">
    &lt;p&gt;I was a guest on Lenny Rachitsky's podcast, in a new episode titled &lt;a href="https://www.lennysnewsletter.com/p/an-ai-state-of-the-union"&gt;An AI state of the union: We've passed the inflection point, dark factories are coming, and automation timelines&lt;/a&gt;. It's available on &lt;a href="https://youtu.be/wc8FBhQtdsA"&gt;YouTube&lt;/a&gt;, &lt;a href="https://open.spotify.com/episode/0DVjwLT6wgtscdB78Qf1BQ"&gt;Spotify&lt;/a&gt;, and &lt;a href="https://podcasts.apple.com/us/podcast/an-ai-state-of-the-union-weve-passed-the/id1627920305?i=1000758850377"&gt;Apple Podcasts&lt;/a&gt;. Here are my highlights from our conversation, with relevant links.&lt;/p&gt;

&lt;iframe style="margin-top: 1.5em; margin-bottom: 1.5em;" width="560" height="315" src="https://www.youtube-nocookie.com/embed/wc8FBhQtdsA" title="Why we’ve passed the AI inflection point and automation has already started | Simon Willison" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen="allowfullscreen"&gt; &lt;/iframe&gt;

&lt;ul&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#the-november-inflection-point"&gt;The November inflection point&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#software-engineers-as-bellwethers-for-other-information-workers"&gt;Software engineers as bellwethers for other information workers&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#writing-code-on-my-phone"&gt;Writing code on my phone&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#responsible-vibe-coding"&gt;Responsible vibe coding&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#dark-factories-and-strongdm"&gt;Dark Factories and StrongDM&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#the-bottleneck-has-moved-to-testing"&gt;The bottleneck has moved to testing&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#this-stuff-is-exhausting"&gt;This stuff is exhausting&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#interruptions-cost-a-lot-less-now"&gt;Interruptions cost a lot less now&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#my-ability-to-estimate-software-is-broken"&gt;My ability to estimate software is broken&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#it-s-tough-for-people-in-the-middle"&gt;It's tough for people in the middle&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#it-s-harder-to-evaluate-software"&gt;It's harder to evaluate software&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#the-misconception-that-ai-tools-are-easy"&gt;The misconception that AI tools are easy&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#coding-agents-are-useful-for-security-research-now"&gt;Coding agents are useful for security research now&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#openclaw"&gt;OpenClaw&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#journalists-are-good-at-dealing-with-unreliable-sources"&gt;Journalists are good at dealing with unreliable sources&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#the-pelican-benchmark"&gt;The pelican benchmark&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#and-finally-some-good-news-about-parrots"&gt;And finally, some good news about parrots&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Apr/2/lennys-podcast/#youtube-chapters"&gt;YouTube chapters&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h2 id="the-november-inflection-point"&gt;The November inflection point&lt;/h2&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=269"&gt;4:19&lt;/a&gt; - The end result of these two labs throwing everything they had at making their models better at code is that in November we had what I call the &lt;a href="https://simonwillison.net/tags/november-2025-inflection/"&gt;inflection point&lt;/a&gt; where GPT 5.1 and Claude Opus 4.5 came along.&lt;/p&gt;
&lt;p&gt;They were both incrementally better than the previous models, but in a way that crossed a threshold where previously the code would mostly work, but you had to pay very close attention to it. And suddenly we went from that to... almost all of the time it does what you told it to do, which makes all of the difference in the world.&lt;/p&gt;
&lt;p&gt;Now you can spin up a coding agent and say, &lt;a href="https://simonwillison.net/2026/Feb/25/present/"&gt;build me a Mac application that does this thing&lt;/a&gt;, and you'll get something back which won't just be a buggy pile of rubbish that doesn't do anything.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id="software-engineers-as-bellwethers-for-other-information-workers"&gt;Software engineers as bellwethers for other information workers&lt;/h2&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=349"&gt;5:49&lt;/a&gt; - I can churn out 10,000 lines of code in a day. And most of it works. Is that good? Like, how do we get from most of it works to all of it works? There are so many new questions that we're facing, which I think makes us a bellwether for other information workers.&lt;/p&gt;
&lt;p&gt;Code is easier than almost every other problem that you pose these agents because code is obviously right or wrong - either it works or it doesn't work. There might be a few subtle hidden bugs, but generally you can tell if the thing actually works.&lt;/p&gt;
&lt;p&gt;If it writes you an essay, if it prepares a lawsuit for you, it's so much harder to derive if it's actually done a good job, and to figure out if it got things right or wrong. But it's happening to us as software engineers. It came for us first.&lt;/p&gt;
&lt;p&gt;And we're figuring out, OK, what do our careers look like? How do we work as teams when part of what we did that used to take most of the time doesn't take most of the time anymore? What does that look like? And it's going to be very interesting seeing how this rolls out to other information work in the future.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Lawyers are falling for this really badly. The &lt;a href="https://www.damiencharlotin.com/hallucinations/"&gt;AI hallucination cases database&lt;/a&gt; is up to 1,228 cases now!&lt;/p&gt;
&lt;p&gt;Plus this bit from the cold open at &lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=0s"&gt;the start&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;It used to be you'd ask ChatGPT for some code, and it would spit out some code, and you'd have to run it and test it. The coding agents take that step for you now. And an open question for me is how many other knowledge work fields are actually prone to these agent loops?&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id="writing-code-on-my-phone"&gt;Writing code on my phone&lt;/h2&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=499"&gt;8:19&lt;/a&gt; - I write so much of my code on my phone. It's wild. I can get good work done walking the dog along the beach, which is delightful.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I mainly use the Claude iPhone app for this, both with a regular Claude chat session (which &lt;a href="https://simonwillison.net/2025/Sep/9/claude-code-interpreter/"&gt;can execute code now&lt;/a&gt;) or using it to control &lt;a href="https://code.claude.com/docs/en/claude-code-on-the-web"&gt;Claude Code for web&lt;/a&gt;.&lt;/p&gt;
&lt;h2 id="responsible-vibe-coding"&gt;Responsible vibe coding&lt;/h2&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=595"&gt;9:55&lt;/a&gt; If you're vibe coding something for yourself, where the only person who gets hurt if it has bugs is you, go wild. That's completely fine. The moment you ship your vibe coding code for other people to use, where your bugs might actually harm somebody else, that's when you need to take a step back.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;See also &lt;a href="https://simonwillison.net/2025/Mar/19/vibe-coding/#when-is-it-ok-to-vibe-code-"&gt;When is it OK to vibe code?&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="dark-factories-and-strongdm"&gt;Dark Factories and StrongDM&lt;/h2&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=769"&gt;12:49&lt;/a&gt; The reason it's called the dark factory is there's this idea in factory automation that if your factory is so automated that you don't need any people there, you can turn the lights off. Like the machines can operate in complete darkness if you don't need people on the factory floor. What does that look like for software? [...]&lt;/p&gt;
&lt;p&gt;So there's this policy that nobody writes any code: you cannot type code into a computer. And honestly, six months ago, I thought that was crazy. And today, probably 95% of the code that I produce, I didn't type myself. That world is practical already because the latest models are good enough that you can tell them to rename that variable and refactor and add this line there... and they'll just do it - it's faster than you typing on the keyboard yourself.&lt;/p&gt;
&lt;p&gt;The next rule though, is nobody &lt;em&gt;reads&lt;/em&gt; the code. And this is the thing which StrongDM started doing last year.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I wrote a lot more about &lt;a href="https://simonwillison.net/2026/Feb/7/software-factory/"&gt;StrongDM's dark factory explorations&lt;/a&gt; back in February.&lt;/p&gt;
&lt;h2 id="the-bottleneck-has-moved-to-testing"&gt;The bottleneck has moved to testing&lt;/h2&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=1287"&gt;21:27&lt;/a&gt; - It used to be, you'd come up with a spec and you hand it to your engineering team. And three weeks later, if you're lucky, they'd come back with an implementation. And now that maybe takes three hours, depending on how well the coding agents are established for that kind of thing. So now what, right? Now, where else are the bottlenecks?&lt;/p&gt;
&lt;p&gt;Anyone who's done any product work knows that your initial ideas are always wrong. What matters is proving them, and testing them.&lt;/p&gt;
&lt;p&gt;We can test things so much faster now because we can build workable prototypes so much quicker. So there's an interesting thing I've been doing in my own work where any feature that I want to design, I'll often prototype three different ways it could work because that takes very little time.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I've always loved prototyping things, and prototyping is even more valuable now.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=1360"&gt;22:40&lt;/a&gt; - A UI prototype is free now. ChatGPT and Claude will just build you a very convincing UI for anything that you describe. And that's how you should be working. I think anyone who's doing product design and isn't vibe coding little prototypes is missing out on the most powerful boost that we get in that step.&lt;/p&gt;
&lt;p&gt;But then what do you do? Given your three options that you have instead of one option, how do you prove to yourself which one of those is the best? I don't have a confident answer to that. I expect this is where the good old fashioned usability testing comes in.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;More on prototyping later on:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=2795"&gt;46:35&lt;/a&gt; - Throughout my entire career, my superpower has been prototyping. I've been very quick at knocking out working prototypes of things. I'm the person who can show up at a meeting and say, look, here's how it could work. And that was kind of my unique selling point. And that's gone. Anyone can do what I could do.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id="this-stuff-is-exhausting"&gt;This stuff is exhausting&lt;/h2&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=1585"&gt;26:25&lt;/a&gt; - I'm finding that using coding agents well is taking every inch of my 25 years of experience as a software engineer, and it is mentally exhausting. I can fire up four agents in parallel and have them work on four different problems. And by like 11 AM, I am wiped out for the day. [...]&lt;/p&gt;
&lt;p&gt;There's a personal skill we have to learn in finding our new limits - what's a responsible way for us not to burn out.&lt;/p&gt;
&lt;p&gt;I've talked to a lot of people who are losing sleep because they're like, my coding agents could be doing work for me. I'm just going to stay up an extra half hour and set off a bunch of extra things... and then waking up at four in the morning. That's obviously unsustainable. [...]&lt;/p&gt;
&lt;p&gt;There's an element of sort of gambling and addiction to how we're using some of these tools.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id="interruptions-cost-a-lot-less-now"&gt;Interruptions cost a lot less now&lt;/h2&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=2716"&gt;45:16&lt;/a&gt; - People talk about how important it is not to interrupt your coders. Your coders need to have solid two to four hour blocks of uninterrupted work so they can spin up their mental model and churn out the code. That's changed completely. My programming work, I need two minutes every now and then to prompt my agent about what to do next. And then I can do the other stuff and I can go back. I'm much more interruptible than I used to be.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id="my-ability-to-estimate-software-is-broken"&gt;My ability to estimate software is broken&lt;/h2&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=1699"&gt;28:19&lt;/a&gt; - I've got 25 years of experience in how long it takes to build something. And that's all completely gone - it doesn't work anymore because I can look at a problem and say that this is going to take two weeks, so it's not worth it. And now it's like... maybe it's going to take 20 minutes because the reason it would have taken two weeks was all of the sort of crufty coding things that the AI is now covering for us.&lt;/p&gt;
&lt;p&gt;I constantly throw tasks at AI that I don't think it'll be able to do because every now and then it does it. And when it doesn't do it, you learn, right? But when it &lt;em&gt;does&lt;/em&gt; do something, especially something that the previous models couldn't do, that's actually cutting edge AI research.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;And a related anecdote:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=2216"&gt;36:56&lt;/a&gt; - A lot of my friends have been talking about how they have this backlog of side projects, right? For the last 10, 15 years, they've got projects they never quite finished. And some of them are like, well, I've done them all now. Last couple of months, I just went through and every evening I'm like, let's take that project and finish it. And they almost feel a sort of sense of loss at the end where they're like, well, okay, my backlog's gone. Now what am I going to build?&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id="it-s-tough-for-people-in-the-middle"&gt;It's tough for people in the middle&lt;/h2&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=1769"&gt;29:29&lt;/a&gt; - So ThoughtWorks, the big IT consultancy, &lt;a href="https://www.thoughtworks.com/insights/articles/reflections-future-software-engineering-retreat"&gt;did an offsite about a month ago&lt;/a&gt;, and they got a whole bunch of engineering VPs in from different companies to talk about this stuff. And one of the interesting theories they came up with is they think this stuff is really good for experienced engineers, like it amplifies their skills. It's really good for new engineers because it solves so many of those onboarding problems. The problem is the people in the middle. If you're mid-career, if you haven't made it to sort of super senior engineer yet, but you're not sort of new either, that's the group which is probably in the most trouble right now.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I mentioned &lt;a href="https://blog.cloudflare.com/cloudflare-1111-intern-program/"&gt;Cloudflare hiring 1,000 interns&lt;/a&gt;, and Shopify too.&lt;/p&gt;
&lt;p&gt;Lenny asked for my advice for people stuck in that middle:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=1881"&gt;31:21&lt;/a&gt; - That's a big responsibility you're putting on me there! I think the way forward is to lean into this stuff and figure out how do I help this make me better?&lt;/p&gt;
&lt;p&gt;A lot of people worry about skill atrophy: if the AI is doing it for you, you're not learning anything. I think if you're worried about that, you push back at it. You have to be mindful about how you're applying the technology and think, okay, I've been given this thing that can answer any question and &lt;em&gt;often&lt;/em&gt; gets it right. How can I use this to amplify my own skills, to learn new things, to take on much more ambitious projects? [...]&lt;/p&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=1985"&gt;33:05&lt;/a&gt; - Everything is changing so fast right now. The only universal skill is being able to roll with the changes. That's the thing that we all need.&lt;/p&gt;
&lt;p&gt;The term that comes up most in these conversations about how you can be great with AI is &lt;em&gt;agency&lt;/em&gt;. I think agents have no agency at all. I would argue that the one thing AI can never have is agency because it doesn't have human motivations.&lt;/p&gt;
&lt;p&gt;So I'd say that's the thing is to invest in your own agency and invest in how to use this technology to get better at what you do and to do new things.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id="it-s-harder-to-evaluate-software"&gt;It's harder to evaluate software&lt;/h2&gt;
&lt;p&gt;The fact that it's so easy to create software with detailed documentation and robust tests means it's harder to figure out what's a credible project.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=2267"&gt;37:47&lt;/a&gt; Sometimes I'll have an idea for a piece of software, Python library or whatever, and I can knock it out in like an hour and get to a point where it's got documentation and tests and all of those things, and it looks like the kind of software that previously I'd have spent several weeks on - and I can stick it up on GitHub&lt;/p&gt;
&lt;p&gt;And yet... I don't believe in it. And the reason I don't believe in it is that I got to rush through all of those things... I think the quality is probably good, but I haven't spent enough time with it to feel confident in that quality. Most importantly, I &lt;em&gt;haven't used it yet&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;It turns out when I'm using somebody else's software, the thing I care most about is I want them to have used it for months.&lt;/p&gt;
&lt;p&gt;I've got some very cool software that I built that I've &lt;em&gt;never used&lt;/em&gt;. It was quicker to build it than to actually try and use it!&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id="the-misconception-that-ai-tools-are-easy"&gt;The misconception that AI tools are easy&lt;/h2&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=2491"&gt;41:31&lt;/a&gt; - Everyone's like, oh, it must be easy. It's just a chat bot. It's not easy. That's one of the great misconceptions in AI is that using these tools effectively is easy. It takes a lot of practice and it takes a lot of trying things that didn't work and trying things that did work.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id="coding-agents-are-useful-for-security-research-now"&gt;Coding agents are useful for security research now&lt;/h2&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=1144"&gt;19:04&lt;/a&gt; - In the past sort of three to six months, they've started being credible as security researchers, which is sending shockwaves through the security research industry.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;See Thomas Ptacek: &lt;a href="https://sockpuppet.org/blog/2026/03/30/vulnerability-research-is-cooked/"&gt;Vulnerability Research Is Cooked&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;At the same time, open source projects are being bombarded with junk security reports:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=1205"&gt;20:05&lt;/a&gt; - There are these people who don't know what they're doing, who are asking ChatGPT to find a security hole and then reporting it to the maintainer. And the report looks good. ChatGPT can produce a very well formatted report of a vulnerability. It's a total waste of time. It's not actually verified as being a real problem.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;A good example of the right way to do this is &lt;a href="https://blog.mozilla.org/en/firefox/hardening-firefox-anthropic-red-team/"&gt;Anthropic's collaboration with Firefox&lt;/a&gt;, where Anthropic's security team &lt;em&gt;verified&lt;/em&gt; every security problem before passing them to Mozilla.&lt;/p&gt;
&lt;h2 id="openclaw"&gt;OpenClaw&lt;/h2&gt;
&lt;p&gt;Of course we had to talk about OpenClaw! Lenny had his running on a Mac Mini.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=5363"&gt;1:29:23&lt;/a&gt; - OpenClaw demonstrates that people want a personal digital assistant so much that they are willing to not just overlook the security side of things, but also getting the thing running is not easy. You've got to create API keys and tokens and install stuff. It's not trivial to get set up and hundreds of thousands of people got it set up. [...]&lt;/p&gt;
&lt;p&gt;The first line of code for OpenClaw was written on November the 25th. And then in the Super Bowl, there was an ad for AI.com, which was effectively a vaporware white labeled OpenClaw hosting provider. So we went from first line of code in November to Super Bowl ad in what? Three and a half months.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I continue to love Drew Breunig's description of OpenClaw as a digital pet:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;A friend of mine said that OpenClaw is basically a Tamagotchi. It's a digital pet and you buy the Mac Mini as an aquarium.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id="journalists-are-good-at-dealing-with-unreliable-sources"&gt;Journalists are good at dealing with unreliable sources&lt;/h2&gt;
&lt;p&gt;In talking about my explorations of AI for data journalism through &lt;a href="https://datasette.io/"&gt;Datasette&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=5698"&gt;1:34:58&lt;/a&gt; - You would have thought that AI is a very bad fit for journalism where the whole idea is to find the truth. But the flip side is journalists deal with untrustworthy sources all the time. The art of journalism is you talk to a bunch of people and some of them lie to you and you figure out what's true. So as long as the journalist treats the AI as yet another unreliable source, they're actually better equipped to work with AI than most other professions are.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id="the-pelican-benchmark"&gt;The pelican benchmark&lt;/h2&gt;
&lt;p&gt;Obviously we talked about &lt;a href="https://simonwillison.net/tags/pelican-riding-a-bicycle/"&gt;pelicans riding bicycles&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=3370"&gt;56:10&lt;/a&gt; - There appears to be a very strong correlation between how good their drawing of a pelican riding a bicycle is and how good they are at everything else. And nobody can explain to me why that is. [...]&lt;/p&gt;
&lt;p&gt;People kept on asking me, what if labs cheat on the benchmark? And my answer has always been, really, &lt;a href="https://simonwillison.net/2025/Nov/13/training-for-pelicans-riding-bicycles/"&gt;all I want from life is a really good picture of a pelican riding a bicycle&lt;/a&gt;. And if I can trick every AI lab in the world into cheating on benchmarks to get it, then that just achieves my goal.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=3596"&gt;59:56&lt;/a&gt; - I think something people often miss is that this space is inherently funny. The fact that we have these incredibly expensive, power hungry, supposedly the most advanced computers of all time. And if you ask them to draw a pelican on a bicycle, it looks like a five-year-old drew it. That's really funny to me.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id="and-finally-some-good-news-about-parrots"&gt;And finally, some good news about parrots&lt;/h2&gt;
&lt;p&gt;Lenny asked if I had anything else I wanted to leave listeners with to wrap up the show, so I went with the best piece of news in the world right now.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;a href="https://youtu.be/wc8FBhQtdsA?t=5890"&gt;1:38:10&lt;/a&gt; - There is a rare parrot in New Zealand called the Kākāpō. There are only 250 of these parrots left in the world. They are flightless nocturnal parrots - beautiful green dumpy looking things. And the good news is they're having a fantastic breeding season in 2026,&lt;/p&gt;
&lt;p&gt;They only breed when the Rimu trees in New Zealand have a mass fruiting season, and the Rimu trees haven't done that since 2022 - so there has not been a single baby kākāpō born in four years.&lt;/p&gt;
&lt;p&gt;This year, the Rimu trees are in fruit. The kākāpō are breeding. There have been dozens of new chicks born. It's a really, really good time. It's great news for rare New Zealand parrots and you should look them up because they're delightful.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Everyone should &lt;a href="https://www.youtube.com/live/LDSWtyU6-Lg"&gt;watch the live stream of Rakiura on her nest with two chicks&lt;/a&gt;!&lt;/p&gt;
&lt;h2 id="youtube-chapters"&gt;YouTube chapters&lt;/h2&gt;
&lt;p&gt;Here's the full list of chapters Lenny's team defined for the YouTube video:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA"&gt;00:00&lt;/a&gt;: Introduction to Simon Willison&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=160s"&gt;02:40&lt;/a&gt;: The November 2025 inflection point&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=481s"&gt;08:01&lt;/a&gt;: What's possible now with AI coding&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=642s"&gt;10:42&lt;/a&gt;: Vibe coding vs. agentic engineering&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=837s"&gt;13:57&lt;/a&gt;: The dark-factory pattern&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=1241s"&gt;20:41&lt;/a&gt;: Where bottlenecks have shifted&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=1416s"&gt;23:36&lt;/a&gt;: Where human brains will continue to be valuable&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=1532s"&gt;25:32&lt;/a&gt;: Defending of software engineers&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=1752s"&gt;29:12&lt;/a&gt;: Why experienced engineers get better results&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=1848s"&gt;30:48&lt;/a&gt;: Advice for avoiding the permanent underclass&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=2032s"&gt;33:52&lt;/a&gt;: Leaning into AI to amplify your skills&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=2112s"&gt;35:12&lt;/a&gt;: Why Simon says he's working harder than ever&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=2243s"&gt;37:23&lt;/a&gt;: The market for pre-2022 human-written code&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=2401s"&gt;40:01&lt;/a&gt;: Prediction: 50% of engineers writing 95% AI code by the end of 2026&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=2674s"&gt;44:34&lt;/a&gt;: The impact of cheap code&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=2907s"&gt;48:27&lt;/a&gt;: Simon's AI stack&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=3248s"&gt;54:08&lt;/a&gt;: Using AI for research&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=3312s"&gt;55:12&lt;/a&gt;: The pelican-riding-a-bicycle benchmark&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=3541s"&gt;59:01&lt;/a&gt;: The inherent ridiculousness of AI&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=3652s"&gt;1:00:52&lt;/a&gt;: Hoarding things you know how to do&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=4101s"&gt;1:08:21&lt;/a&gt;: Red/green TDD pattern for better AI code&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=4483s"&gt;1:14:43&lt;/a&gt;: Starting projects with good templates&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=4591s"&gt;1:16:31&lt;/a&gt;: The lethal trifecta and prompt injection&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=4913s"&gt;1:21:53&lt;/a&gt;: Why 97% effectiveness is a failing grade&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=5119s"&gt;1:25:19&lt;/a&gt;: The normalization of deviance&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=5312s"&gt;1:28:32&lt;/a&gt;: OpenClaw: the security nightmare everyone is looking past&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=5662s"&gt;1:34:22&lt;/a&gt;: What's next for Simon&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=5807s"&gt;1:36:47&lt;/a&gt;: Zero-deliverable consulting&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.youtube.com/watch?v=wc8FBhQtdsA&amp;amp;t=5885s"&gt;1:38:05&lt;/a&gt;: Good news about Kakapo parrots&lt;/li&gt;
&lt;/ul&gt;
    
        &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/ai"&gt;ai&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/kakapo"&gt;kakapo&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/generative-ai"&gt;generative-ai&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/llms"&gt;llms&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/podcast-appearances"&gt;podcast-appearances&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/coding-agents"&gt;coding-agents&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/agentic-engineering"&gt;agentic-engineering&lt;/a&gt;&lt;/p&gt;
    

</summary><category term="ai"/><category term="kakapo"/><category term="generative-ai"/><category term="llms"/><category term="podcast-appearances"/><category term="coding-agents"/><category term="agentic-engineering"/></entry><entry><title>February sponsors-only newsletter</title><link href="https://simonwillison.net/2026/Mar/2/february-newsletter/#atom-tag" rel="alternate"/><published>2026-03-02T14:53:15+00:00</published><updated>2026-03-02T14:53:15+00:00</updated><id>https://simonwillison.net/2026/Mar/2/february-newsletter/#atom-tag</id><summary type="html">
    &lt;p&gt;I just sent the February edition of my &lt;a href="https://github.com/sponsors/simonw/"&gt;sponsors-only monthly newsletter&lt;/a&gt;. If you are a sponsor (or if you start a sponsorship now) you can &lt;a href="https://github.com/simonw-private/monthly/blob/main/2026-02-february.md"&gt;access it here&lt;/a&gt;. In this month's newsletter:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;More OpenClaw, and Claws in general&lt;/li&gt;
&lt;li&gt;I started a not-quite-a-book about Agentic Engineering&lt;/li&gt;
&lt;li&gt;StrongDM, Showboat and Rodney&lt;/li&gt;
&lt;li&gt;Kākāpō breeding season&lt;/li&gt;
&lt;li&gt;Model releases&lt;/li&gt;
&lt;li&gt;What I'm using, February 2026 edition&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Here's &lt;a href="https://gist.github.com/simonw/36f567d1b3f8bb4ab4d872d477fbb295"&gt;a copy of the January newsletter&lt;/a&gt; as a preview of what you'll get. Pay $10/month to stay a month ahead of the free copy!&lt;/p&gt;
&lt;p&gt;I use Claude as a proofreader for spelling and grammar via &lt;a href="https://simonwillison.net/guides/agentic-engineering-patterns/prompts/#proofreader"&gt;this prompt&lt;/a&gt; which also asks it to "Spot any logical errors or factual mistakes". I'm delighted to report that Claude Opus 4.6 called me out on this one:&lt;/p&gt;
&lt;p&gt;&lt;img alt="5. &amp;quot;No new chicks for four years (due to a lack of fruiting rimu trees)&amp;quot;
The phrasing &amp;quot;lack of fruiting rimu trees&amp;quot; is slightly imprecise. The issue isn't that rimu trees failed to fruit at all, but that there was no mass fruiting (masting) event, which is the specific trigger for kākāpō breeding. Consider &amp;quot;due to a lack of rimu masting&amp;quot; or &amp;quot;due to a lack of mass rimu fruiting.&amp;quot;" src="https://static.simonwillison.net/static/2026/claude-fact-check.jpg" /&gt;&lt;/p&gt;

    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/newsletter"&gt;newsletter&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/kakapo"&gt;kakapo&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/claude"&gt;claude&lt;/a&gt;&lt;/p&gt;



</summary><category term="newsletter"/><category term="kakapo"/><category term="claude"/></entry><entry><title>First kākāpō chick in four years hatches on Valentine's Day</title><link href="https://simonwillison.net/2026/Feb/17/first-kakapo-chick-in-four-years/#atom-tag" rel="alternate"/><published>2026-02-17T14:09:43+00:00</published><updated>2026-02-17T14:09:43+00:00</updated><id>https://simonwillison.net/2026/Feb/17/first-kakapo-chick-in-four-years/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="https://www.doc.govt.nz/news/media-releases/2026-media-releases/first-kakapo-chick-in-four-years-hatches-on-valentines-day/"&gt;First kākāpō chick in four years hatches on Valentine&amp;#x27;s Day&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
First chick of &lt;a href="https://simonwillison.net/2026/Jan/8/llm-predictions-for-2026/#1-year-k-k-p-parrots-will-have-an-outstanding-breeding-season"&gt;the 2026 breeding season&lt;/a&gt;!&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Kākāpō Yasmine hatched an egg fostered from kākāpō Tīwhiri on Valentine's Day, bringing the total number of kākāpō to 237 – though it won’t be officially added to the population until it fledges.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Here's why the egg was fostered:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;"Kākāpō mums typically have the best outcomes when raising a maximum of two chicks. Biological mum Tīwhiri has four fertile eggs this season already, while Yasmine, an experienced foster mum, had no fertile eggs."&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;And an &lt;a href="https://bsky.app/profile/digs.bsky.social/post/3mf25glzt2c2b"&gt;update from conservation biologist Andrew Digby&lt;/a&gt; - a second chick hatched this morning!&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;The second #kakapo chick of the #kakapo2026 breeding season hatched this morning: Hine Taumai-A1-2026 on Ako's nest on Te Kākahu. We transferred the egg from Anchor two nights ago. This is Ako's first-ever chick, which is just a few hours old in this video.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;That post &lt;a href="https://bsky.app/profile/digs.bsky.social/post/3mf25glzt2c2b"&gt;has a video&lt;/a&gt; of mother and chick.&lt;/p&gt;
&lt;p&gt;&lt;img alt="A beautiful charismatic green Kākāp feeding a little grey chick" src="https://static.simonwillison.net/static/2026/kakapo-plus-chick.jpg" /&gt;

    &lt;p&gt;&lt;small&gt;&lt;/small&gt;Via &lt;a href="https://www.metafilter.com/212231/Happy-Valen-Kkp-Tines"&gt;MetaFilter&lt;/a&gt;&lt;/small&gt;&lt;/p&gt;


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/kakapo"&gt;kakapo&lt;/a&gt;&lt;/p&gt;



</summary><category term="kakapo"/></entry><entry><title>Kākāpō mug by Karen James</title><link href="https://simonwillison.net/2026/Feb/8/kakapo-mug/#atom-tag" rel="alternate"/><published>2026-02-08T17:25:07+00:00</published><updated>2026-02-08T17:25:07+00:00</updated><id>https://simonwillison.net/2026/Feb/8/kakapo-mug/#atom-tag</id><summary type="html">
    &lt;p&gt;Friend and neighbour &lt;a href="https://www.etsy.com/shop/KarenJamesMakes"&gt;Karen James&lt;/a&gt; made me a Kākāpō mug. It has a charismatic Kākāpō, four Kākāpō chicks (in celebration of the &lt;a href="https://simonwillison.net/2026/Jan/8/llm-predictions-for-2026/#1-year-k-k-p-parrots-will-have-an-outstanding-breeding-season"&gt;2026 breeding season&lt;/a&gt;) and even has some &lt;a href="https://www.theguardian.com/world/2026/jan/13/nz-kakapo-mating-season"&gt;rimu fruit&lt;/a&gt;!&lt;/p&gt;
&lt;p&gt;&lt;img src="https://static.simonwillison.net/static/2026/kakapo-mug-1.jpg" alt="A simply spectacular sgraffito ceramic mug with a bold, charismatic Kākāpō parrot taking up most of the visible space. It has a yellow beard and green feathers." style="max-width: 100%;" /&gt;&lt;/p&gt;
&lt;p&gt;&lt;img src="https://static.simonwillison.net/static/2026/kakapo-mug-2.jpg" alt="Another side of the mug, two cute grey Kākāpō chicks are visible and three red rimu fruit that look like berries, one on the floor and two hanging from wiry branches." style="max-width: 100%;" /&gt;&lt;/p&gt;
&lt;p&gt;I love it so much.&lt;/p&gt;

    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/kakapo"&gt;kakapo&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/art"&gt;art&lt;/a&gt;&lt;/p&gt;



</summary><category term="kakapo"/><category term="art"/></entry><entry><title>Kākāpō Cam: Rakiura live stream</title><link href="https://simonwillison.net/2026/Jan/25/kakapo-cam/#atom-tag" rel="alternate"/><published>2026-01-25T04:53:01+00:00</published><updated>2026-01-25T04:53:01+00:00</updated><id>https://simonwillison.net/2026/Jan/25/kakapo-cam/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="https://www.doc.govt.nz/our-work/kakapo-recovery/what-we-do/kakapo-cam-rakiura-live-stream/"&gt;Kākāpō Cam: Rakiura live stream&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
Critical update for this year's &lt;a href="https://simonwillison.net/2026/Jan/8/llm-predictions-for-2026/#1-year-k-k-p-parrots-will-have-an-outstanding-breeding-season"&gt;Kākāpō breeding season&lt;/a&gt;: the New Zealand Department of Conservation have a livestream running of Rakiura's nest!&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;You’re looking at the underground nest of 23-year-old Rakiura. She has chosen this same site to nest for all seven breeding seasons since 2008, a large cavity under a rātā tree. Because she returns to the site so reliably, we’ve been able to make modifications over the years to keep it safe and dry, including adding a well-placed hatch for monitoring eggs and chicks.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Rakiura is a legendary Kākāpō:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Rakiura hatched on 19 February 2002 on Whenua Hou/Codfish Island. She is the offspring of Flossie and Bill. Her name comes from the te reo Māori name for Stewart Island, the place where most of the founding kākāpō population originated.&lt;/p&gt;
&lt;p&gt;Rakiura has nine living descendants, three females and six males, across six breeding seasons. In 2008 came Tōitiiti, in 2009 Tamahou and Te Atapō, in 2011 Tia and Tūtoko, in 2014 Taeatanga and Te Awa, in 2019 Mati-mā and Tautahi. She also has many grandchicks.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;She laid her first egg of the season at 4:30pm NZ time on 22nd January. The livestream went live shortly afterwards, once she committed to this nest.&lt;/p&gt;
&lt;p&gt;The stream is &lt;a href="https://www.youtube.com/watch?v=BfGL7A2YgUY"&gt;on YouTube&lt;/a&gt;. I &lt;a href="https://gisthost.github.io/?dc78322de89a2191c593215f109c65d7/index.html"&gt;used Claude Code&lt;/a&gt; to write &lt;a href="https://tools.simonwillison.net/python/#livestream-gifpy"&gt;a livestream-gif.py script&lt;/a&gt; and used that to capture this sped-up video of the last few hours of footage, within which you can catch a glimpse of the egg!&lt;/p&gt;
&lt;video autoplay muted loop controls playsinline style="width: 100%;"&gt;
  &lt;source src="https://static.simonwillison.net/static/2026/kakapo-timelapse.mp4" type="video/mp4"&gt;
&lt;/video&gt;

    &lt;p&gt;&lt;small&gt;&lt;/small&gt;Via &lt;a href="https://www.metafilter.com/211927/The-only-parrot-to-have-a-polygynous-lek-breeding-system-sits-on-an-egg"&gt;MetaFilter&lt;/a&gt;&lt;/small&gt;&lt;/p&gt;


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/youtube"&gt;youtube&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/kakapo"&gt;kakapo&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/conservation"&gt;conservation&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/claude-code"&gt;claude-code&lt;/a&gt;&lt;/p&gt;



</summary><category term="youtube"/><category term="kakapo"/><category term="conservation"/><category term="claude-code"/></entry><entry><title>LLM predictions for 2026, shared with Oxide and Friends</title><link href="https://simonwillison.net/2026/Jan/8/llm-predictions-for-2026/#atom-tag" rel="alternate"/><published>2026-01-08T19:42:13+00:00</published><updated>2026-01-08T19:42:13+00:00</updated><id>https://simonwillison.net/2026/Jan/8/llm-predictions-for-2026/#atom-tag</id><summary type="html">
    &lt;p&gt;I joined a recording of the Oxide and Friends podcast on Tuesday to talk about 1, 3 and 6 year predictions for the tech industry. This is my second appearance on their annual predictions episode, you can see &lt;a href="https://simonwillison.net/2025/Jan/10/ai-predictions/"&gt;my predictions from January 2025 here&lt;/a&gt;. Here's &lt;a href="https://oxide-and-friends.transistor.fm/episodes/predictions-2026"&gt;the page for this year's episode&lt;/a&gt;, with options to listen in all of your favorite podcast apps or &lt;a href="https://www.youtube.com/watch?v=lVDhQMiAbR8"&gt;directly on YouTube&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Bryan Cantrill started the episode by declaring that he's never been so unsure about what's coming in the next year. I share that uncertainty - the significant advances in coding agents just in the last two months have left me certain that things will change significantly, but unclear as to what those changes will be.&lt;/p&gt;
&lt;p&gt;Here are the predictions I shared in the episode.&lt;/p&gt;
&lt;ul&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Jan/8/llm-predictions-for-2026/#1-year-it-will-become-undeniable-that-llms-write-good-code"&gt;1 year: It will become undeniable that LLMs write good code&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Jan/8/llm-predictions-for-2026/#1-year-we-re-finally-going-to-solve-sandboxing"&gt;1 year: We're finally going to solve sandboxing&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Jan/8/llm-predictions-for-2026/#1-year-a-challenger-disaster-for-coding-agent-security"&gt;1 year: A "Challenger disaster" for coding agent security&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Jan/8/llm-predictions-for-2026/#1-year-k-k-p-parrots-will-have-an-outstanding-breeding-season"&gt;1 year: Kākāpō parrots will have an outstanding breeding season&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Jan/8/llm-predictions-for-2026/#3-years-the-coding-agents-jevons-paradox-for-software-engineering-will-resolve-one-way-or-the-other"&gt;3 years: the coding agents Jevons paradox for software engineering will resolve, one way or the other&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Jan/8/llm-predictions-for-2026/#3-years-someone-will-build-a-new-browser-using-mainly-ai-assisted-coding-and-it-won-t-even-be-a-surprise"&gt;3 years: Someone will build a new browser using mainly AI-assisted coding and it won't even be a surprise&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href="https://simonwillison.net/2026/Jan/8/llm-predictions-for-2026/#6-years-typing-code-by-hand-will-go-the-way-of-punch-cards"&gt;6 years: Typing code by hand will go the way of punch cards&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h4 id="1-year-it-will-become-undeniable-that-llms-write-good-code"&gt;1 year: It will become undeniable that LLMs write good code &lt;a href="https://www.youtube.com/watch?v=lVDhQMiAbR8&amp;amp;t=1167s" class="predictions-video-link"&gt;▶ 19:27&lt;/a&gt;&lt;/h4&gt;
&lt;blockquote&gt;
&lt;p&gt;I think that there are still people out there who are convinced that LLMs cannot write good code. Those people are in for a very nasty shock in 2026. I do not think it will be possible to get to the end of even the next three months while still holding on to that idea that the code they write is all junk and it's it's likely any decent human programmer will write better code than they will.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;In 2023, saying that LLMs write garbage code was entirely correct. For most of 2024 that stayed true. In 2025 that changed, but you could be forgiven for continuing to hold out. In 2026 the quality of LLM-generated code will become impossible to deny.&lt;/p&gt;
&lt;p&gt;I base this on my own experience - I've spent more time exploring &lt;a href="https://simonwillison.net/tags/ai-assisted-programming/"&gt;AI-assisted programming&lt;/a&gt; than most.&lt;/p&gt;
&lt;p&gt;The key change in 2025 (see &lt;a href="https://simonwillison.net/2025/Dec/31/the-year-in-llms/#the-year-of-reasoning-"&gt;my overview for the year&lt;/a&gt;) was the introduction of "reasoning models" trained specifically against code using Reinforcement Learning. The major labs spent a full year competing with each other on who could get the best code capabilities from their models, and that problem turns out to be perfectly attuned to RL since code challenges come with built-in verifiable success conditions.&lt;/p&gt;
&lt;p&gt;Since Claude Opus 4.5 and GPT-5.2 came out in November and December respectively the amount of code I've written by hand has dropped to a single digit percentage of my overall output. The same is true for many other expert programmers I know.&lt;/p&gt;
&lt;p&gt;At this point if you continue to argue that LLMs write useless code you're damaging your own credibility.&lt;/p&gt;
&lt;h4 id="1-year-we-re-finally-going-to-solve-sandboxing"&gt;1 year: We're finally going to solve sandboxing &lt;a href="https://www.youtube.com/watch?v=lVDhQMiAbR8&amp;amp;t=1205s" class="predictions-video-link"&gt;▶ 20:05&lt;/a&gt;&lt;/h4&gt;
&lt;blockquote&gt;
&lt;p&gt;I think this year is the year we're going to solve sandboxing. I want to run code other people have written on my computing devices without it destroying my computing devices if it's malicious or has bugs. [...] It's crazy that it's 2026 and I still &lt;code&gt;pip install&lt;/code&gt; random code and then execute it in a way that it can steal all of my data and delete all my files. [...] I don't want to run a piece of code on any of my devices that somebody else wrote outside of sandbox ever again.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;This isn't just about LLMs, but it becomes even more important now there are so many more people writing code often without knowing what they're doing. Sandboxing is also a key part of the battle against prompt injection.&lt;/p&gt;
&lt;p&gt;We have a &lt;em&gt;lot&lt;/em&gt; of promising technologies in play already for this - containers and WebAssembly being the two I'm most optimistic about. There's real commercial value involved in solving this problem. The pieces are there, what's needed is UX work to reduce the friction in using them productively and securely.&lt;/p&gt;
&lt;h4 id="1-year-a-challenger-disaster-for-coding-agent-security"&gt;1 year: A "Challenger disaster" for coding agent security &lt;a href="https://www.youtube.com/watch?v=lVDhQMiAbR8&amp;amp;t=1281s" class="predictions-video-link"&gt;▶ 21:21&lt;/a&gt;&lt;/h4&gt;
&lt;blockquote&gt;
&lt;p&gt;I think we're due a Challenger disaster with respect to coding agent security[...] I think so many people, myself included, are running these coding agents practically as root, right? We're letting them do all of this stuff. And every time I do it, my computer doesn't get wiped. I'm like, "oh, it's fine". [...] The worst version of this is the worm - a prompt injection worm which infects people's computers and adds itself to the Python or NPM packages that person has access to.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I used this as an opportunity to promote my favourite recent essay about AI security, &lt;a href="https://embracethered.com/blog/posts/2025/the-normalization-of-deviance-in-ai/"&gt;the Normalization of Deviance in AI&lt;/a&gt; by Johann Rehberger.&lt;/p&gt;
&lt;p&gt;The Normalization of Deviance describes the phenomenon where people and organizations get used to operating in an unsafe manner because nothing bad has happened to them yet, which can result in enormous problems (like the 1986 Challenger disaster) when their luck runs out.&lt;/p&gt;
&lt;p&gt;Every six months I predict that a headline-grabbing prompt injection attack is coming soon, and every six months it doesn't happen. This is my most recent version of that prediction!&lt;/p&gt;
&lt;h4 id="1-year-k-k-p-parrots-will-have-an-outstanding-breeding-season"&gt;1 year: Kākāpō parrots will have an outstanding breeding season &lt;a href="https://www.youtube.com/watch?v=lVDhQMiAbR8&amp;amp;t=3006s" class="predictions-video-link"&gt;▶ 50:06&lt;/a&gt;&lt;/h4&gt;

&lt;p&gt;(I dropped this one to lighten the mood after a discussion of the deep sense of existential dread that many programmers are feeling right now!)&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;I think that Kākāpō parrots in New Zealand are going to have an outstanding breeding season. The reason I think this is that the Rimu trees are in fruit right now. There's only 250 of them,  and they only breed if the Rimu trees have a good fruiting. The Rimu trees have been terrible since 2019, but this year the Rimu trees were all blooming. There are researchers saying that all 87 females of breeding age might lay an egg. And for a species with only 250 remaining parrots that's great news.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;(I just &lt;a href="https://en.wikipedia.org/wiki/K%C4%81k%C4%81p%C5%8D#Population_timeline"&gt;checked Wikipedia&lt;/a&gt; and I was right with the parrot numbers but wrong about the last good breeding season, apparently 2022 was a good year too.)&lt;/p&gt;
&lt;p&gt;In a year with precious little in the form of good news I am utterly delighted to share this story. Here's more:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;a href="https://blog.doc.govt.nz/2025/06/27/kakapo-breeding-season-2026/"&gt;Kākāpō breeding season 2026&lt;/a&gt; introduction from the Department of Conservation from June 2025 .&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.auckland.ac.nz/en/news/2025/12/03/bumper-breeding-season-for-kakapo-on-the-cards.html"&gt;Bumper breeding season for kākāpō on the cards&lt;/a&gt; - 3rd December 2025, University of Auckland.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;I don't often use AI-generated images on this blog, but the Kākāpō image the Oxide team created for this episode is just &lt;em&gt;perfect&lt;/em&gt;:&lt;/p&gt;
&lt;p&gt;&lt;img src="https://static.simonwillison.net/static/2026/oxide-kakapo.jpg" alt="A beautiful green Kākāpō surrounded by candles gazes into a crystal ball" style="max-width: 100%;" /&gt;&lt;/p&gt;

&lt;h4 id="3-years-the-coding-agents-jevons-paradox-for-software-engineering-will-resolve-one-way-or-the-other"&gt;3 years: the coding agents Jevons paradox for software engineering will resolve, one way or the other &lt;a href="https://www.youtube.com/watch?v=lVDhQMiAbR8&amp;amp;t=3277s" class="predictions-video-link"&gt;▶ 54:37&lt;/a&gt;&lt;/h4&gt;

&lt;blockquote&gt;
&lt;p&gt;We will find out if the &lt;a href="https://en.wikipedia.org/wiki/Jevons_paradox"&gt;Jevons paradox&lt;/a&gt; saves our careers or not. This is a big question that anyone who's a software engineer has right now: we are driving the cost of actually producing working code down to a fraction of what it used to cost. Does that mean that our careers are completely devalued and we all have to learn to live on a tenth of our incomes, or does it mean that the demand for software, for custom software goes up by a factor of 10 and now our skills are even &lt;em&gt;more&lt;/em&gt; valuable because you can hire me and I can build you 10 times the software I used to be able to? I think by three years we will know for sure which way that one went.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;The quote says it all. There are two ways this coding agents thing could go: it could turn out software engineering skills are devalued, or it could turn out we're more valuable and effective than ever before.&lt;/p&gt;
&lt;p&gt;I'm crossing my fingers for the latter! So far it feels to me like it's working out that way.&lt;/p&gt;

&lt;h4 id="3-years-someone-will-build-a-new-browser-using-mainly-ai-assisted-coding-and-it-won-t-even-be-a-surprise"&gt;3 years: Someone will build a new browser using mainly AI-assisted coding and it won't even be a surprise &lt;a href="https://www.youtube.com/watch?v=lVDhQMiAbR8&amp;amp;t=3913s" class="predictions-video-link"&gt;▶ 65:13&lt;/a&gt;&lt;/h4&gt;

&lt;blockquote&gt;
&lt;p&gt;I think somebody will have built a full web browser mostly using AI assistance, and it won't even be surprising. Rolling a new web browser is one of the most complicated software projects I can imagine[...] the cheat code is the conformance suites. If there are existing tests that it'll get so much easier.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;A common complaint today from AI coding skeptics is that LLMs are fine for toy projects but can't be used for anything large and serious.&lt;/p&gt;
&lt;p&gt;I think within 3 years that will be comprehensively proven incorrect, to the point that it won't even be controversial anymore.&lt;/p&gt;
&lt;p&gt;I picked a web browser here because so much of the work building a browser involves writing code that has to conform to an enormous and daunting selection of both formal tests and informal websites-in-the-wild.&lt;/p&gt;
&lt;p&gt;Coding agents are &lt;em&gt;really good&lt;/em&gt; at tasks where you can define a concrete goal and then set them to work iterating in that direction.&lt;/p&gt;
&lt;p&gt;A web browser is the most ambitious project I can think of that leans into those capabilities.&lt;/p&gt;

&lt;h4 id="6-years-typing-code-by-hand-will-go-the-way-of-punch-cards"&gt;6 years: Typing code by hand will go the way of punch cards &lt;a href="https://www.youtube.com/watch?v=lVDhQMiAbR8&amp;amp;t=4839s" class="predictions-video-link"&gt;▶ 80:39&lt;/a&gt;&lt;/h4&gt;

&lt;blockquote&gt;
&lt;p&gt;I think the job of being paid money to type code into a computer will go the same way as punching punch cards [...] in six years time, I do not think anyone will be paid to just to do the thing where you type the code. I think software engineering will still be an enormous career. I just think the software engineers won't be spending multiple hours of their day in a text editor typing out syntax.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;The more time I spend on AI-assisted programming the less afraid I am for my job, because it turns out building software - especially at the rate it's now possible to build - still requires enormous skill, experience and depth of understanding.&lt;/p&gt;
&lt;p&gt;The skills are changing though! Being able to read a detailed specification and transform it into lines of code is the thing that's being automated away. What's left is everything else, and the more time I spend working with coding agents the larger that "everything else" becomes.&lt;/p&gt;
    
        &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/predictions"&gt;predictions&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/sandboxing"&gt;sandboxing&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/ai"&gt;ai&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/kakapo"&gt;kakapo&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/generative-ai"&gt;generative-ai&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/llms"&gt;llms&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/ai-assisted-programming"&gt;ai-assisted-programming&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/oxide"&gt;oxide&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/bryan-cantrill"&gt;bryan-cantrill&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/coding-agents"&gt;coding-agents&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/conformance-suites"&gt;conformance-suites&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/browser-challenge"&gt;browser-challenge&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/deep-blue"&gt;deep-blue&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/november-2025-inflection"&gt;november-2025-inflection&lt;/a&gt;&lt;/p&gt;
    

</summary><category term="predictions"/><category term="sandboxing"/><category term="ai"/><category term="kakapo"/><category term="generative-ai"/><category term="llms"/><category term="ai-assisted-programming"/><category term="oxide"/><category term="bryan-cantrill"/><category term="coding-agents"/><category term="conformance-suites"/><category term="browser-challenge"/><category term="deep-blue"/><category term="november-2025-inflection"/></entry><entry><title>The new ChatGPT Images is here</title><link href="https://simonwillison.net/2025/Dec/16/new-chatgpt-images/#atom-tag" rel="alternate"/><published>2025-12-16T23:59:22+00:00</published><updated>2025-12-16T23:59:22+00:00</updated><id>https://simonwillison.net/2025/Dec/16/new-chatgpt-images/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="https://openai.com/index/new-chatgpt-images-is-here/"&gt;The new ChatGPT Images is here&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
OpenAI shipped an update to their ChatGPT Images feature - the feature that &lt;a href="https://simonwillison.net/2025/May/13/launching-chatgpt-images/"&gt;gained them 100 million new users&lt;/a&gt; in a week when they first launched it back in March, but has since been eclipsed by Google's Nano Banana and then further by Nana Banana Pro &lt;a href="https://simonwillison.net/2025/Nov/20/nano-banana-pro/"&gt;in November&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;The focus for the new ChatGPT Images is speed and instruction following:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;It makes precise edits while keeping details intact, and generates images up to 4x faster&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;It's also a little cheaper: OpenAI say that the new &lt;a href="https://platform.openai.com/docs/models/gpt-image-1.5"&gt;gpt-image-1.5&lt;/a&gt; API model makes image input and output "20% cheaper in GPT Image 1.5 as compared to GPT Image 1". &lt;/p&gt;
&lt;p&gt;I tried a new test prompt against a photo I took of Natalie's ceramic stand at the farmers market a few weeks ago:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Add two kakapos inspecting the pots&lt;/p&gt;
&lt;p&gt;&lt;img alt="Outdoor craft market booth displaying handmade ceramics and jewelry on a navy tablecloth with &amp;quot;NATBAT CREATIONS CALIFORNIA USA&amp;quot; logo. Items include colorful glazed ceramic cups in blue, orange, and black; decorative bowls including a rainbow-striped piece; jewelry pendants and earrings on wooden display stands; ceramic plant markers in various colors labeled &amp;quot;Artichoke&amp;quot;, &amp;quot;Cilantro&amp;quot;, &amp;quot;Chili&amp;quot;, &amp;quot;Oregano&amp;quot;, &amp;quot;Potato&amp;quot;, &amp;quot;Pumpkin&amp;quot;, &amp;quot;Sage&amp;quot;." src="https://static.simonwillison.net/static/2025/pots-q80-half.jpg" /&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Here's the result from the new ChatGPT Images model:&lt;/p&gt;
&lt;p&gt;&lt;img alt="Same craft market booth as previous image, now with two large olive-green Kākāpō parrots perched on the table among the ceramics, one investigating the blue glazed cups and the other examining an orange cup." src="https://static.simonwillison.net/static/2025/pots-chatgpt-q80-half.jpg" /&gt;&lt;/p&gt;
&lt;p&gt;And here's what I got from Nano Banana Pro:&lt;/p&gt;
&lt;p&gt;&lt;img alt="Same craft market booth with two Kākāpō now in different positions: one remains center-table peering into the ceramic cups near the rainbow pot, while the second has moved to the right edge of the table near the plant markers, appearing to examine or possibly chew on items at the table's corner. They are both a little smaller than in the first image." src="https://static.simonwillison.net/static/2025/pots-nano-banana-q80-half.jpg" /&gt;&lt;/p&gt;
&lt;p&gt;The ChatGPT Kākāpō are a little chonkier, which I think counts as a win.&lt;/p&gt;
&lt;p&gt;I was a little less impressed by the result I got for an infographic from the prompt "Infographic explaining how the Datasette open source project works" followed by "Run some extensive searches and gather a bunch of relevant information and then try again" (&lt;a href="https://chatgpt.com/share/6941f249-cbd0-8006-b9ff-5a19167206bc"&gt;transcript&lt;/a&gt;):&lt;/p&gt;
&lt;p&gt;&lt;img alt="Infographic titled &amp;quot;HOW DATASETTE WORKS&amp;quot; with subtitle &amp;quot;THE OPEN SOURCE DATA PLATFORM&amp;quot; showing a four-step workflow. STEP 1 (orange): &amp;quot;LOAD YOUR DATA&amp;quot; - &amp;quot;CSV, JSON, XLSX, SQLite, PostgreSQL, etc.&amp;quot; with icons of file types flowing into a laptop. Below: &amp;quot;IMPORT DATASETS - Turn your structured data into SQLite databases and .db files.&amp;quot; with checkmarks for &amp;quot;Datasette Desktop App for local deployment&amp;quot;, &amp;quot;CLI tool for command-line imports&amp;quot;, &amp;quot;Automatic CSV import tool&amp;quot;. STEP 2 (green): &amp;quot;PUBLISH &amp;amp; DEPLOY&amp;quot; - &amp;quot;HOST DATASETS ONLINE&amp;quot; with cloud and server icons labeled &amp;quot;DEPLOY&amp;quot;. Below: &amp;quot;SHARE ONLINE - Deploy your Datasette instance to a public server.&amp;quot; with checkmarks for &amp;quot;Datasette Cloud - Free hosting service&amp;quot;, &amp;quot;Deploy anywhere via plugins&amp;quot;, &amp;quot;Configurable API tools&amp;quot;. STEP 3 (purple): &amp;quot;EXPLORE &amp;amp; QUERY&amp;quot; - &amp;quot;BROWSE, SEARCH &amp;amp; VISUALIZE&amp;quot; with database and browser window icons. Below: &amp;quot;SQL QUERIES &amp;amp; SEARCH - Browse, filter, search, and visualize your data with an interactive web interface.&amp;quot; with checkmarks for &amp;quot;Perform SQL queries directly from the browser&amp;quot;, &amp;quot;Filter, sort, and facet data&amp;quot;, &amp;quot;Generate custom visualizations and charts&amp;quot;. STEP 4 (red): &amp;quot;BUILD &amp;amp; EXTEND&amp;quot; - &amp;quot;PLUGINS, APIS &amp;amp; INTEGRATIONS&amp;quot; with gear and wrench icons labeled &amp;quot;API&amp;quot;. Below: &amp;quot;CUSTOMIZE &amp;amp; DEVELOP&amp;quot; with bullets &amp;quot;Develop custom plugins for added functionality&amp;quot;, &amp;quot;Access JSON API for programmatic queries&amp;quot;, &amp;quot;Embed and integrate Datasette into other applications&amp;quot;. Bottom banner shows four features: &amp;quot;OPEN DATA PLATFORM - Widely used for visualizing, sharing and building applications with SQLite backed data&amp;quot;, &amp;quot;EXTENSIBLE PLUGINS - 100+ plugins available, inc uding chaps, charts authentication, and more&amp;quot;, &amp;quot;ACCESS CONTROL - Granular permissions for controlling who s an access and interact with your data&amp;quot;, &amp;quot;OPEN SOURCE PROJECT - Actively developed open source project with a vibrant community of contributors&amp;quot;." src="https://static.simonwillison.net/static/2025/chatgpt-infographic.jpg" /&gt;&lt;/p&gt;
&lt;p&gt;See &lt;a href="https://simonwillison.net/2025/Nov/20/nano-banana-pro/#creating-an-infographic"&gt;my Nano Banana Pro post&lt;/a&gt; for comparison.&lt;/p&gt;
&lt;p&gt;Both models are clearly now usable for text-heavy graphics though, which makes them far more useful than previous generations of this technology.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Update 21st December 2025&lt;/strong&gt;: I realized I &lt;a href="https://tools.simonwillison.net/python/#openai_imagepy"&gt;already have a tool&lt;/a&gt; for accessing this new model via the API. Here's what I got from the following:&lt;/p&gt;
&lt;pre&gt;&lt;code&gt;OPENAI_API_KEY="$(llm keys get openai)" \
  uv run openai_image.py -m gpt-image-1.5\
  'a raccoon with a double bass in a jazz bar rocking out'
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;&lt;img alt="Digital artwork of a raccoon wearing a black fedora and vest, passionately playing an upright double bass on stage at a dimly lit jazz club. The raccoon's mouth is open as if singing. A vintage microphone stands to the right, another raccoon musician is visible in the background, and a neon sign reading &amp;quot;Jazz Club&amp;quot; glows in warm orange letters. The scene has a smoky, atmospheric quality with rich amber and brown tones." src="https://static.simonwillison.net/static/2025/raccoon-jazz-gpt-image-1.5.jpg" /&gt;&lt;/p&gt;
&lt;p&gt;Total cost: &lt;a href="https://chatgpt.com/share/694867b3-8a20-8006-981c-6514618ff5b5"&gt;$0.2041&lt;/a&gt;.


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/ai"&gt;ai&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/kakapo"&gt;kakapo&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/openai"&gt;openai&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/generative-ai"&gt;generative-ai&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/text-to-image"&gt;text-to-image&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/nano-banana"&gt;nano-banana&lt;/a&gt;&lt;/p&gt;



</summary><category term="ai"/><category term="kakapo"/><category term="openai"/><category term="generative-ai"/><category term="text-to-image"/><category term="nano-banana"/></entry><entry><title>OpenAI are quietly adopting skills, now available in ChatGPT and Codex CLI</title><link href="https://simonwillison.net/2025/Dec/12/openai-skills/#atom-tag" rel="alternate"/><published>2025-12-12T23:29:51+00:00</published><updated>2025-12-12T23:29:51+00:00</updated><id>https://simonwillison.net/2025/Dec/12/openai-skills/#atom-tag</id><summary type="html">
    &lt;p&gt;One of the things that most excited me about &lt;a href="https://simonwillison.net/2025/Oct/16/claude-skills/"&gt;Anthropic's new Skills mechanism&lt;/a&gt; back in October is how easy it looked for other platforms to implement. A skill is just a folder with a Markdown file and some optional extra resources and scripts, so any LLM tool with the ability to navigate and read from a filesystem should be capable of using them. It turns out OpenAI are doing exactly that, with skills support quietly showing up in both their Codex CLI tool and now also in ChatGPT itself.&lt;/p&gt;
&lt;h4 id="skills-in-chatgpt"&gt;Skills in ChatGPT&lt;/h4&gt;
&lt;p&gt;I learned about this &lt;a href="https://x.com/elias_judin/status/1999491647563006171"&gt;from Elias Judin&lt;/a&gt; this morning. It turns out the Code Interpreter feature of ChatGPT now has a new &lt;code&gt;/home/oai/skills&lt;/code&gt; folder which you can access simply by prompting:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;code&gt;Create a zip file of /home/oai/skills&lt;/code&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I &lt;a href="https://chatgpt.com/share/693c9645-caa4-8006-9302-0a9226ea7599"&gt;tried that myself&lt;/a&gt; and got back &lt;a href="https://static.simonwillison.net/static/cors-allow/2025/skills.zip"&gt;this zip file&lt;/a&gt;. Here's &lt;a href="https://tools.simonwillison.net/zip-wheel-explorer?url=https%3A%2F%2Fstatic.simonwillison.net%2Fstatic%2Fcors-allow%2F2025%2Fskills.zip"&gt;a UI for exploring its content&lt;/a&gt; (&lt;a href="https://tools.simonwillison.net/colophon#zip-wheel-explorer.html"&gt;more about that tool&lt;/a&gt;).&lt;/p&gt;
&lt;p&gt;&lt;img src="https://static.simonwillison.net/static/2025/skills-explore.jpg" alt="Screenshot of file explorer. Files skills/docs/render_docsx.py and skills/docs/skill.md and skills/pdfs/ and skills/pdfs/skill.md - that last one is expanded and reads: # PDF reading, creation, and review guidance  ## Reading PDFs - Use pdftoppm -png $OUTDIR/$BASENAME.pdf $OUTDIR/$BASENAME to convert PDFs to PNGs. - Then open the PNGs and read the images. - pdfplumber is also installed and can be used to read PDFs. It can be used as a complementary tool to pdftoppm but not replacing it. - Only do python printing as a last resort because you will miss important details with text extraction (e.g. figures, tables, diagrams).  ## Primary tooling for creating PDFs - Generate PDFs programmatically with reportlab as the primary tool. In most cases, you should use reportlab to create PDFs. - If there are other packages you think are necessary for the task (eg. pypdf, pyMuPDF), you can use them but you may need topip install them first. - After each meaningful update—content additions, layout adjustments, or style changes—render the PDF to images to check layout fidelity:   - pdftoppm -png $INPUT_PDF $OUTPUT_PREFIX - Inspect every exported PNG before continuing work. If anything looks off, fix the source and re-run the render → inspect loop until the pages are clean.  ## Quality expectations - Maintain a polished, intentional visual design: consistent typography, spacing, margins, color palette, and clear section breaks across all pages. - Avoid major rendering issues—no clipped text, overlapping elements, black squares, broken tables, or unreadable glyphs. The rendered pages should look like a curated document, not raw template output. - Charts, tables, diagrams, and images must be sharp, well-aligned, and properly labeled in the PNGs. Legends and axes should be readable without excessive zoom. - Text must be readable at normal viewing size; avoid walls of filler text or dense, unstructured bullet lists. Use whitespace to separate ideas. - Never use the U+2011 non-breaking hyphen or other unicode dashes as they will not be" style="max-width: 100%;" /&gt;&lt;/p&gt;
&lt;p&gt;So far they cover spreadsheets, docx and PDFs. Interestingly their chosen approach for PDFs and documents is to convert them to rendered per-page PNGs and then pass those through their vision-enabled GPT models, presumably to maintain information from layout and graphics that would be lost if they just ran text extraction.&lt;/p&gt;
&lt;p&gt;Elias &lt;a href="https://github.com/eliasjudin/oai-skills"&gt;shared copies in a GitHub repo&lt;/a&gt;. They look very similar to Anthropic's implementation of the same kind of idea, currently published in their &lt;a href="https://github.com/anthropics/skills/tree/main/skills"&gt;anthropics/skills&lt;/a&gt; repository.&lt;/p&gt;
&lt;p&gt;I tried it out by prompting:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Create a PDF with a summary of the rimu tree situation right now and what it means for kakapo breeding season&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Sure enough, GPT-5.2 Thinking started with:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Reading skill.md for PDF creation guidelines&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Then:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Searching rimu mast and Kākāpō 2025 breeding status&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;It took &lt;a href="https://chatgpt.com/share/693ca54b-f770-8006-904b-9f31a585180a"&gt;just over eleven minutes&lt;/a&gt; to produce &lt;a href="https://static.simonwillison.net/static/cors-allow/2025/rimu_kakapo_breeding_brief.pdf"&gt;this PDF&lt;/a&gt;, which was long enough that I had Claude Code for web &lt;a href="https://github.com/simonw/tools/pull/155"&gt;build me a custom PDF viewing tool&lt;/a&gt; while I waited.&lt;/p&gt;
&lt;p&gt;&lt;a href="https://tools.simonwillison.net/view-pdf?url=https%3A%2F%2Fstatic.simonwillison.net%2Fstatic%2Fcors-allow%2F2025%2Frimu_kakapo_breeding_brief.pdf"&gt;Here's ChatGPT's PDF in that tool&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&lt;img src="https://static.simonwillison.net/static/2025/rimu.jpg" alt="Screenshot of my tool. There is a URL at the top, a Load PDF button and pagination controls. Then the PDF itself is shown, which reads: Rimu mast status and what it means for the kākāpō breeding season Summary as of 12 December 2025 (Pacific/Auckland context) Kākāpō breeding is tightly linked to rimu (Dacrydium cupressinum) mast events: when rimu trees set and ripen large amounts of fruit, female kākāpō are much more likely to nest, and more chicks can be successfully raised. Current monitoring indicates an unusually strong rimu fruiting signal heading into the 2025/26 season, which sets the stage for a potentially large breeding year in 2026.^1,2 Key numbers at a glance Kākāpō population (official DOC count) 237 birds alive Breeding trigger (rimu fruiting)&amp;gt;10% of rimu branch tips bearing fruit Forecast rimu fruiting for 2026 (DOC monitoring) Around 50–60% fruiting across breeding islands¹Breeding-age females (DOC 2025 planning figure)About 87 females (potentially nearly all could nest)" style="max-width: 100%;" /&gt;&lt;/p&gt;
&lt;p&gt;(I am &lt;strong&gt;very excited&lt;/strong&gt; about &lt;a href="https://www.auckland.ac.nz/en/news/2025/12/03/bumper-breeding-season-for-kakapo-on-the-cards.html"&gt;Kākāpō breeding season this year&lt;/a&gt;.)&lt;/p&gt;
&lt;p&gt;The reason it took so long is that it was fastidious about looking at and tweaking its own work. I appreciated that at one point it tried rendering the PDF and noticed that the macrons in kākāpō were not supported by the chosen font, so it switched to something else:&lt;/p&gt;
&lt;p&gt;&lt;img src="https://static.simonwillison.net/static/2025/skills-macrons.jpg" alt="ChatGPT screenshot. Analyzed image. There's an image of a page of PDF with obvious black blocks on some of the letters in the heading. It then says: Fixing font issues with macrons. The page is showing black squares for words like &amp;quot;kākāpō,&amp;quot; probably because Helvetica can't handle macrons. I'll switch to a font that supports them, such as DejaVu Sans or Noto Sans. I'll register both regular and bold fonts, then apply them to the document. I'll update the footer to note the issue with Helvetica. Time to rebuild the PDF!" style="max-width: 100%;" /&gt;&lt;/p&gt;
&lt;h4 id="skills-in-codex-cli"&gt;Skills in Codex CLI&lt;/h4&gt;
&lt;p&gt;Meanwhile, two weeks ago OpenAI's open source Codex CLI tool landed a PR titled &lt;a href="https://github.com/openai/codex/pull/7412"&gt;feat: experimental support for skills.md&lt;/a&gt;. The most recent docs for that are in &lt;a href="https://github.com/openai/codex/blob/main/docs/skills.md"&gt;docs/skills.md&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;The documentation suggests that any folder in &lt;code&gt;~/.codex/skills&lt;/code&gt; will be treated as a skill.&lt;/p&gt;
&lt;p&gt;I dug around and found the code that generates the prompt that drives the skill system in &lt;a href="https://github.com/openai/codex/blob/ad7b9d63c326d5c92049abd16f9f5fb64a573a69/codex-rs/core/src/skills/render.rs#L20-L38"&gt;codex-rs/core/src/skills/render.rs&lt;/a&gt; - here's a Gist with &lt;a href="https://gist.github.com/simonw/25f2c3a9e350274bc2b76a79bc8ae8b2"&gt;a more readable version of that prompt&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;I &lt;a href="https://claude.ai/share/0a9b369b-f868-4065-91d1-fd646c5db3f4"&gt;used Claude Opus 4.5's skill authoring skill&lt;/a&gt; to create &lt;a href="https://github.com/datasette/skill"&gt;this skill for creating Datasette plugins&lt;/a&gt;, then installed it into my Codex CLI skills folder like this:&lt;/p&gt;
&lt;div class="highlight highlight-source-shell"&gt;&lt;pre&gt;git clone https://github.com/datasette/skill \
  &lt;span class="pl-k"&gt;~&lt;/span&gt;/.codex/skills/datasette-plugin&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;You have to run Codex with the &lt;code&gt;--enable skills&lt;/code&gt; option. I ran this:&lt;/p&gt;
&lt;div class="highlight highlight-source-shell"&gt;&lt;pre&gt;&lt;span class="pl-c1"&gt;cd&lt;/span&gt; /tmp
mkdir datasette-cowsay
&lt;span class="pl-c1"&gt;cd&lt;/span&gt; datasette-cowsay
codex --enable skills -m gpt-5.2&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;Then prompted:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;code&gt;list skills&lt;/code&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;And Codex replied:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;code&gt;- datasette-plugins — Writing Datasette plugins using Python + pluggy (file: /Users/simon/.codex/skills/datasette-plugin/SKILL.md)&lt;/code&gt;&lt;br /&gt;
&lt;code&gt;- Discovery — How to find/identify available skills (no SKILL.md path provided in the list)&lt;/code&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Then I said:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;code&gt;Write a Datasette plugin in this folder adding a /-/cowsay?text=hello page that displays a pre with cowsay from PyPI saying that text&lt;/code&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;It worked perfectly! Here's &lt;a href="https://github.com/simonw/datasette-cowsay"&gt;the plugin code it wrote&lt;/a&gt; and here's &lt;a href="http://gistpreview.github.io/?96ee928370b18eabc2e0fad9aaa46d4b"&gt;a copy of the full Codex CLI transcript&lt;/a&gt;, generated with my &lt;a href="https://simonwillison.net/2025/Oct/23/claude-code-for-web-video/"&gt;terminal-to-html tool&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;You can try that out yourself if you have &lt;code&gt;uvx&lt;/code&gt; installed like this:&lt;/p&gt;
&lt;div class="highlight highlight-source-shell"&gt;&lt;pre&gt;uvx --with https://github.com/simonw/datasette-cowsay/archive/refs/heads/main.zip \
  datasette&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;Then visit:&lt;/p&gt;
&lt;pre&gt;&lt;code&gt;http://127.0.0.1:8001/-/cowsay?text=This+is+pretty+fun
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;&lt;img src="https://static.simonwillison.net/static/2025/cowsay-datasette.jpg" alt="Screenshot of that URL in Firefox, an ASCII art cow says This is pretty fun." style="max-width: 100%;" /&gt;&lt;/p&gt;
&lt;h4 id="skills-are-a-keeper"&gt;Skills are a keeper&lt;/h4&gt;
&lt;p&gt;When I first wrote about skills in October I said &lt;a href="https://simonwillison.net/2025/Oct/16/claude-skills/"&gt;Claude Skills are awesome, maybe a bigger deal than MCP&lt;/a&gt;. The fact that it's just turned December and OpenAI have already leaned into them in a big way reinforces to me that I called that one correctly.&lt;/p&gt;
&lt;p&gt;Skills are based on a &lt;em&gt;very&lt;/em&gt; light specification, if you could even call it that, but I still think it would be good for these to be formally documented somewhere. This could be a good initiative for the new &lt;a href="https://aaif.io/"&gt;Agentic AI Foundation&lt;/a&gt; (&lt;a href="https://simonwillison.net/2025/Dec/9/agentic-ai-foundation/"&gt;previously&lt;/a&gt;) to take on.&lt;/p&gt;
    
        &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/pdf"&gt;pdf&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/ai"&gt;ai&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/kakapo"&gt;kakapo&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/openai"&gt;openai&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/prompt-engineering"&gt;prompt-engineering&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/generative-ai"&gt;generative-ai&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/chatgpt"&gt;chatgpt&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/llms"&gt;llms&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/ai-assisted-programming"&gt;ai-assisted-programming&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/anthropic"&gt;anthropic&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/coding-agents"&gt;coding-agents&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/gpt-5"&gt;gpt-5&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/codex-cli"&gt;codex-cli&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/skills"&gt;skills&lt;/a&gt;&lt;/p&gt;
    

</summary><category term="pdf"/><category term="ai"/><category term="kakapo"/><category term="openai"/><category term="prompt-engineering"/><category term="generative-ai"/><category term="chatgpt"/><category term="llms"/><category term="ai-assisted-programming"/><category term="anthropic"/><category term="coding-agents"/><category term="gpt-5"/><category term="codex-cli"/><category term="skills"/></entry><entry><title>In Kākāpō breeding season news…</title><link href="https://simonwillison.net/2019/Apr/19/kakapo-breeding-season/#atom-tag" rel="alternate"/><published>2019-04-19T03:11:34+00:00</published><updated>2019-04-19T03:11:34+00:00</updated><id>https://simonwillison.net/2019/Apr/19/kakapo-breeding-season/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="https://www.metafilter.com/180491/In-Kkp-breeding-season-news"&gt;In Kākāpō breeding season news…&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
I posted on MetaFilter about this year’s record-breaking Kākāpō breeding season.


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/metafilter"&gt;metafilter&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/kakapo"&gt;kakapo&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/conservation"&gt;conservation&lt;/a&gt;&lt;/p&gt;



</summary><category term="metafilter"/><category term="kakapo"/><category term="conservation"/></entry></feed>