Simon Willison on datasette

1,476 posts tagged “datasette”

Datasette is an open source tool for exploring and publishing data.

2026

Tool datasette.io news preview

The datasette.io website has a news section built from this news.yaml file in the underlying GitHub repository. The YAML format looks like this:

- date: 2026-04-15
  body: |-
    [Datasette 1.0a27](https://docs.datasette.io/en/latest/changelog.html#a27-2026-04-15) changes how CSRF protection works in a way that simplifies form and API integration, and introduces a new `RenameTableEvent` for when a table is renamed by a SQL query.
- date: 2026-03-18
  body: |-
    ...

This format is a little hard to edit, so I finally had Claude build a custom preview UI to make checking for errors have slightly less friction.

I built it using standard claude.ai and Claude Artifacts, taking advantage of Claude's ability to clone GitHub repos and look at their content as part of a regular chat:

Clone https://github.com/simonw/datasette.io and look at the news.yaml file and how it is rendered on the homepage. Build an artifact I can paste that YAML into which previews what it will look like, and highlights any markdown errors or YAML errors

16th Apr 2026, 12:18 am · vibe-coding, claude, tools, datasette

Release datasette-export-database 0.3a1 — Export a copy of a mutable SQLite database on demand

This plugin was using the ds_csrftoken cookie as part of a custom signed URL, which needed upgrading now that Datasette 1.0a27 no longer sets that cookie.

15th Apr 2026, 11:52 pm · datasette

Release datasette 1.0a27 — An open source multi-tool for exploring and publishing data

Two major changes in this new Datasette alpha. I covered the first of those in detail yesterday - Datasette no longer uses Django-style CSRF form tokens, instead using modern browser headers as described by Filippo Valsorda.

The second big change is that Datasette now fires a new RenameTableEvent any time a table is renamed during a SQLite transaction. This is useful because some plugins (like datasette-comments) attach additional data to table records by name, so a renamed table requires them to react in appropriate ways.

Here are the rest of the changes in the alpha:

New actor= parameter for datasette.client methods, allowing internal requests to be made as a specific actor. This is particularly useful for writing automated tests. (#2688)

New Database(is_temp_disk=True) option, used internally for the internal database. This helps resolve intermittent database locked errors caused by the internal database being in-memory as opposed to on-disk. (#2683) (#2684)

The /<database>/<table>/-/upsert API (docs) now rejects rows with null primary key values. (#1936)

Improved example in the API explorer for the /-/upsert endpoint (docs). (#1936)

The /<database>.json endpoint now includes an "ok": true key, for consistency with other JSON API responses.

call_with_supported_arguments() is now documented as a supported public API. (#2678)

15th Apr 2026, 11:16 pm · annotated-release-notes, datasette, python

Release datasette-ports 0.3 — Find all currently running Datasette instances and list their ports

A small update for my tool for helping me figure out what all of the Datasette instances on my laptop are up to.

Show working directory derived from each PID

Show the full path to each database file

Output now looks like this:

http://127.0.0.1:8007/ - v1.0a26
  Directory: /Users/simon/dev/blog
  Databases:
    simonwillisonblog: /Users/simon/dev/blog/simonwillisonblog.db
  Plugins:
    datasette-llm
    datasette-secrets
http://127.0.0.1:8001/ - v1.0a26
  Directory: /Users/simon/dev/creatures
  Databases:
    creatures: /tmp/creatures.db

15th Apr 2026, 2:50 am · datasette

datasette PR #2689: Replace token-based CSRF with Sec-Fetch-Site header protection. Datasette has long protected against CSRF attacks using CSRF tokens, implemented using my asgi-csrf Python library. These are something of a pain to work with - you need to scatter forms in templates with <input type="hidden" name="csrftoken" value="{{ csrftoken() }}"> lines and then selectively disable CSRF protection for APIs that are intended to be called from outside the browser.

I've been following Filippo Valsorda's research here with interest, described in this detailed essay from August 2025 and shipped as part of Go 1.25 that same month.

I've now landed the same change in Datasette. Here's the PR description - Claude Code did much of the work (across 10 commits, closely guided by me and cross-reviewed by GPT-5.4) but I've decided to start writing these PR descriptions by hand, partly to make them more concise and also as an exercise in keeping myself honest.

New CSRF protection middleware inspired by Go 1.25 and this research by Filippo Valsorda. This replaces the old CSRF token based protection.

Removes all instances of <input type="hidden" name="csrftoken" value="{{ csrftoken() }}"> in the templates - they are no longer needed.

Removes the def skip_csrf(datasette, scope): plugin hook defined in datasette/hookspecs.py and its documentation and tests.

Updated CSRF protection documentation to describe the new approach.

Upgrade guide now describes the CSRF change.

# 14th April 2026, 11:58 pm / csrf, security, datasette, ai-assisted-programming

Release datasette-ports 0.2 — Find all currently running Datasette instances and list their ports

No longer requires Datasette - running uvx datasette-ports now works as well.

Installing it as a Datasette plugin continues to provide the datasette ports command.

6th Apr 2026, 3:25 am · datasette

Release datasette-ports 0.1 — Find all currently running Datasette instances and list their ports

Another example of README-driven development, this time solving a problem that might be unique to me.

I often find myself running a bunch of different Datasette instances with different databases and different in-development plugins, spreads across dozens of different terminal windows - enough that I frequently lose them!

Now I can run this:

datasette install datasette-ports
datasette ports

And get a list of every running instance that looks something like this:

http://127.0.0.1:8333/ - v1.0a26
  Databases: data
  Plugins: datasette-enrichments, datasette-enrichments-llm, datasette-llm, datasette-secrets
http://127.0.0.1:8001/ - v1.0a26
  Databases: creatures
  Plugins: datasette-extract, datasette-llm, datasette-secrets
http://127.0.0.1:8900/ - v0.65.2
  Databases: logs

6th Apr 2026, 12:23 am · datasette

Release datasette-llm 0.1a6 — LLM integration plugin for other plugins to depend on

The same model ID no longer needs to be repeated in both the default model and allowed models lists - setting it as a default model automatically adds it to the allowed models list. #6

Improved documentation for Python API usage.

1st Apr 2026, 11:01 pm · llm, datasette

Release datasette-enrichments-llm 0.2a1 — Enrich data by prompting LLMs

The actor who triggers an enrichment is now passed to the llm.mode(... actor=actor) method. #3

1st Apr 2026, 10 pm · enrichments, llm, datasette

Release datasette-extract 0.3a0 — Import unstructured data (text and images) into structured tables

Now uses datasette-llm to manage model configuration, which means you can control which models are available for extraction tasks using the extract purpose and LLM model configuration. #38

1st Apr 2026, 3:32 am · llm, datasette

Release datasette-enrichments-llm 0.2a0 — Enrich data by prompting LLMs

This plugin now uses datasette-llm to configure and manage models. This means it's possible to specify which models should be made available for enrichments, using the new enrichments purpose.

1st Apr 2026, 3:28 am · llm, datasette

Release datasette-llm-usage 0.2a0 — Track usage of LLM tokens in a SQLite table

Removed features relating to allowances and estimated pricing. These are now the domain of datasette-llm-accountant.

Now depends on datasette-llm for model configuration. #3

Full prompts and responses and tool calls can now be logged to the llm_usage_prompt_log table in the internal database if you set the new datasette-llm-usage.log_prompts plugin configuration setting.

Redesigned the /-/llm-usage-simple-prompt page, which now requires the llm-usage-simple-prompt permission.

1st Apr 2026, 3:24 am · llm, datasette

Release datasette-llm 0.1a5 — LLM integration plugin for other plugins to depend on

The llm_prompt_context() plugin hook wrapper mechanism now tracks prompts executed within a chain as well as one-off prompts, which means it can be used to track tool call loops. #5

1st Apr 2026, 3:11 am · llm, datasette

Release datasette-llm 0.1a4 — LLM integration plugin for other plugins to depend on

Ability to configure different API keys for models based on their purpose - for example, set it up so enrichments always use gpt-5.4-mini with an API key dedicated to that purpose. #4

I released llm-echo 0.3 to provide an API key testing utility I needed for the tests for this new feature.

31st Mar 2026, 9:17 pm · llm, datasette

Release datasette-files 0.1a3 — Upload files to Datasette

I'm working on integrating datasette-files into other plugins, such as datasette-extract. This necessitated a new release of the base plugin.

owners_can_edit and owners_can_delete configuration options, plus the files-edit and files-delete actions are now scoped to a new FileResource which is a child of FileSourceResource. #18

The file picker UI is now available as a <datasette-file-picker> Web Component. Thanks, Alex Garcia. #19

New from datasette_files import get_file Python API for other plugins that need to access file data. #20

30th Mar 2026, 11:58 pm · datasette

Release datasette-llm 0.1a3 — LLM integration plugin for other plugins to depend on

Adds the ability to configure which LLMs are available for which purpose, which means you can restrict the list of models that can be used with a specific plugin. #3

30th Mar 2026, 7:48 pm · llm, datasette

Release datasette-llm 0.1a2 — LLM integration plugin for other plugins to depend on

actor is now available to the llm_prompt_context plugin hook. #2

26th Mar 2026, 3:52 pm · llm, datasette

Release datasette-files-s3 0.1a1 — datasette-files S3 backend

A backend for datasette-files that adds the ability to store and retrieve files using an S3 bucket. This release added a mechanism for fetching S3 configuration periodically from a URL, which means we can use time limited IAM credentials that are restricted to a prefix within a bucket.

25th Mar 2026, 9:57 pm · s3, datasette

Release datasette-llm 0.1a1 — LLM integration plugin for other plugins to depend on

New release of the base plugin that makes models from LLM available for use by other Datasette plugins such as datasette-enrichments-llm.

New register_llm_purposes() plugin hook and get_purposes() function for retrieving registered purpose strings. #1

One of the responsibilities of this plugin is to configure which models are used for which purposes, so you can say in one place "data enrichment uses GPT-5.4-nano but SQL query assistance happens using Sonnet 4.6", for example.

Plugins that depend on this can use model = await llm.model(purpose="enrichment") to indicate the purpose of the prompts they wish to execute against the model. Those plugins can now also use the new register_llm_purposes() hook to register those purpose strings, which means future plugins can list those purposes in one place to power things like an admin UI for assigning models to purposes.

25th Mar 2026, 9:24 pm · annotated-release-notes, llm, datasette, plugins

Release datasette-files 0.1a2 — Upload files to Datasette

The most interesting alpha of datasette-files yet, a new plugin which adds the ability to upload files directly into a Datasette instance. Here are the release notes in full:

Columns are now configured using the new column_types system from Datasette 1.0a26. #8

New file_actions plugin hook, plus ability to import an uploaded CSV/TSV file to a table. #10

UI for uploading multiple files at once via the new documented JSON upload API. #11

Thumbnails are now generated for image files and stored in an internal datasette_files_thumbnails table. #13

23rd Mar 2026, 11:06 pm · annotated-release-notes, datasette

Coding agents for data analysis. Here's the handout I prepared for my NICAR 2026 workshop "Coding agents for data analysis" - a three hour session aimed at data journalists demonstrating ways that tools like Claude Code and OpenAI Codex can be used to explore, analyze and clean data.

Here's the table of contents:

Coding agents

Warmup: ChatGPT and Claude

Setup Claude Code and Codex

Asking questions against a database

Exploring data with agents

Cleaning data: decoding neighborhood codes

Creating visualizations with agents

Scraping data with agents

I ran the workshop using GitHub Codespaces and OpenAI Codex, since it was easy (and inexpensive) to distribute a budget-restricted API key for Codex that attendees could use during the class. Participants ended up burning $23 of Codex tokens.

The exercises all used Python and SQLite and some of them used Datasette.

One highlight of the workshop was when we started running Datasette such that it served static content from a viz/ folder, then had Claude Code start vibe coding new interactive visualizations directly in that folder. Here's a heat map it created for my trees database using Leaflet and Leaflet.heat, source code here.

I designed the handout to also be useful for people who weren't able to attend the session in person. As is usually the case, material aimed at data journalists is equally applicable to anyone else with data to explore.

# 16th March 2026, 8:12 pm / nicar, sqlite, ai, speaking, llms, coding-agents, generative-ai, data-journalism, github-codespaces, codex-cli, datasette, claude-code, python, leaflet, geospatial

Release datasette-files 0.1a1 — Upload files to Datasette

20th Feb 2026, 12:36 am · datasette

Release datasette-files-s3 0.1a0 — datasette-files S3 backend

20th Feb 2026, 12:32 am · datasette

Release datasette-files 0.1a0 — Upload files to Datasette

19th Feb 2026, 10:25 pm · datasette

Release datasette-showboat 0.1a1 — Datasette plugin for SHOWBOAT_REMOTE_URL

18th Feb 2026, 12:53 pm · showboat, datasette

Two new Showboat tools: Chartroom and datasette-showboat

I introduced Showboat a week ago—my CLI tool that helps coding agents create Markdown documents that demonstrate the code that they have created. I’ve been finding new ways to use it on a daily basis, and I’ve just released two new tools to help get the best out of the Showboat pattern. Chartroom is a CLI charting tool that works well with Showboat, and datasette-showboat lets Showboat’s new remote publishing feature incrementally push documents to a Datasette instance.

[... 1,756 words]

12:43 am / 17th February 2026 / ai, claude-code, llms, coding-agents, ai-assisted-programming, datasette, generative-ai, projects, showboat, charting

Release datasette-showboat 0.1a0 — Datasette plugin for SHOWBOAT_REMOTE_URL

16th Feb 2026, 7:43 pm · showboat, datasette

Release datasette-pins 0.1a7 — Pin databases, tables, and other items to the Datasette homepage

5th Feb 2026, 4:34 pm · datasette

Release datasette-youtube-embed 0.2 — Turn YouTube URLs into embedded players in Datasette

4th Feb 2026, 4:46 pm · datasette

Distributing Go binaries like sqlite-scanner through PyPI using go-to-wheel

I’ve been exploring Go for building small, fast and self-contained binary applications recently. I’m enjoying how there’s generally one obvious way to do things and the resulting code is boring and readable—and something that LLMs are very competent at writing. The one catch is distribution, but it turns out publishing Go binaries to PyPI means any Go binary can be just a uvx package-name call away.

[... 1,312 words]

2:59 pm / 4th February 2026 / uv, go, pypi, packaging, ai-assisted-programming, python, datasette, projects, sqlite

page 1 / 50 next » last »»

Simon Willison’s Weblog

1,476 posts tagged “datasette”

2026

Two new Showboat tools: Chartroom and datasette-showboat

Distributing Go binaries like sqlite-scanner through PyPI using go-to-wheel