<?xml version="1.0" encoding="utf-8"?>
<feed xml:lang="en-us" xmlns="http://www.w3.org/2005/Atom"><title>Simon Willison's Weblog: metadata</title><link href="http://simonwillison.net/" rel="alternate"/><link href="http://simonwillison.net/tags/metadata.atom" rel="self"/><id>http://simonwillison.net/</id><updated>2025-09-06T06:41:49+00:00</updated><author><name>Simon Willison</name></author><entry><title>Quoting IanCal</title><link href="https://simonwillison.net/2025/Sep/6/iancal/#atom-tag" rel="alternate"/><published>2025-09-06T06:41:49+00:00</published><updated>2025-09-06T06:41:49+00:00</updated><id>https://simonwillison.net/2025/Sep/6/iancal/#atom-tag</id><summary type="html">
    &lt;blockquote cite="https://news.ycombinator.com/item?id=45135302#45135852"&gt;&lt;p&gt;RDF has the same problems as the SQL schemas with information scattered. What fields mean requires documentation.&lt;/p&gt;
&lt;p&gt;There - they have a name on a person. What name? Given? Legal? Chosen? Preferred for this use case?&lt;/p&gt;
&lt;p&gt;You only have one ID for Apple eh? Companies are complex to model, do you mean Apple just as someone would talk about it? The legal structure of entities that underpins all major companies, what part of it is referred to?&lt;/p&gt;
&lt;p&gt;I spent a long time building identifiers for universities and companies (which was taken for &lt;a href="https://ror.org/"&gt;ROR&lt;/a&gt; later) and it was a nightmare to say what a university even was. What’s the name of Cambridge? It’s not “Cambridge University” or “The university of Cambridge” legally. But it also is the actual name as people use it. &lt;em&gt;[It's &lt;a href="https://www.cam.ac.uk/about-the-university/how-the-university-and-colleges-work/the-university-as-a-charity"&gt;The Chancellor, Masters, and Scholars of the University of Cambridge&lt;/a&gt;]&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;The university of Paris went from something like 13 institutes to maybe one to then a bunch more. Are companies locations at their headquarters? Which headquarters?&lt;/p&gt;
&lt;p&gt;Someone will suggest modelling to solve this but here lies the biggest problem:&lt;/p&gt;
&lt;p&gt;The correct modelling depends on &lt;em&gt;the questions you want to answer&lt;/em&gt;.&lt;/p&gt;&lt;/blockquote&gt;
&lt;p class="cite"&gt;&amp;mdash; &lt;a href="https://news.ycombinator.com/item?id=45135302#45135852"&gt;IanCal&lt;/a&gt;, on Hacker News, discussing RDF&lt;/p&gt;

    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/hacker-news"&gt;hacker-news&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/metadata"&gt;metadata&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/rdf"&gt;rdf&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/sql"&gt;sql&lt;/a&gt;&lt;/p&gt;



</summary><category term="hacker-news"/><category term="metadata"/><category term="rdf"/><category term="sql"/></entry><entry><title>Microbrowsers are Everywhere</title><link href="https://simonwillison.net/2019/Dec/18/microbrowsers-are-everywhere/#atom-tag" rel="alternate"/><published>2019-12-18T08:32:19+00:00</published><updated>2019-12-18T08:32:19+00:00</updated><id>https://simonwillison.net/2019/Dec/18/microbrowsers-are-everywhere/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="https://24ways.org/2019/microbrowsers-are-everywhere/"&gt;Microbrowsers are Everywhere&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
Colin Bendell introduces a new-to-me term, “microbrowsers”, to describe the user-agents which hit websites to generate unfurled link previews in messenger apps. Twitter and Facebook first popularized them, but today you’re likely getting far more preview-generating traffic from chat clients such as iMessage, WhatsApp and Slack (which won’t execute script and ignore cookies, and hence won’t show up in Google Analytics). Lots of great tips here—one example: if you provide three og:image meta tags iMessage will render them as a collage.

    &lt;p&gt;&lt;small&gt;&lt;/small&gt;Via &lt;a href="https://news.ycombinator.com/item?id=21800346"&gt;Hacker News&lt;/a&gt;&lt;/small&gt;&lt;/p&gt;


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/24-ways"&gt;24-ways&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/metadata"&gt;metadata&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/urls"&gt;urls&lt;/a&gt;&lt;/p&gt;



</summary><category term="24-ways"/><category term="metadata"/><category term="urls"/></entry><entry><title>Facebook's Open Graph Protocol from a Web Developer's Perspective</title><link href="https://simonwillison.net/2010/Apr/26/graph/#atom-tag" rel="alternate"/><published>2010-04-26T13:21:32+00:00</published><updated>2010-04-26T13:21:32+00:00</updated><id>https://simonwillison.net/2010/Apr/26/graph/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="http://www.25hoursaday.com/weblog/2010/04/24/FacebooksOpenGraphProtocolFromAWebDevelopersPerspective.aspx"&gt;Facebook&amp;#x27;s Open Graph Protocol from a Web Developer&amp;#x27;s Perspective&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
Best explanation I’ve seen yet of what the Open Graph protocol actually does. Add the RDFa-inspired metadata and a Like button to a standard web page representing a place, group, product, website or one of another limited set of object types and people can “Like” it just like they might join a fan page within Facebook itself. You can then send news feed updates to all of that page’s subscribers. The bootstrapped metadata can then benefit other services as well.


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/dare-obasanjo"&gt;dare-obasanjo&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/facebook"&gt;facebook&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/metadata"&gt;metadata&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/opengraph"&gt;opengraph&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/opengraphprotocol"&gt;opengraphprotocol&lt;/a&gt;&lt;/p&gt;



</summary><category term="dare-obasanjo"/><category term="facebook"/><category term="metadata"/><category term="opengraph"/><category term="opengraphprotocol"/></entry><entry><title>WildlifeNearYou can now tag your Flickr photos for you</title><link href="https://simonwillison.net/2010/Feb/4/wildlifenearyou/#atom-tag" rel="alternate"/><published>2010-02-04T17:01:09+00:00</published><updated>2010-02-04T17:01:09+00:00</updated><id>https://simonwillison.net/2010/Feb/4/wildlifenearyou/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="http://www.wildlifenearyou.com/blog/2010/feb/4/tag-flickr-photos/"&gt;WildlifeNearYou can now tag your Flickr photos for you&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
I’m really excited about this feature: if you opt-in, WildlifeNearYou will now write name and latin name tags to your Flickr photos after you’ve marked the species in the photo. This is even more interesting when you combine it with our suggest-a-species feature (the photo won’t get tagged until you’ve approved the suggestion). We also set the location on photos which don’t yet have one, but the real fun is the machine tags we’ve added, which allow developers to use the Flickr API to find photos by their WildlifeNearYou metadata (trip, species and place IDs). As a neat extra touch, the identifiers we use in the machine tags are the same as the ones used by our custom wlny.eu URL shortener, so it’s trivial to turn a machine tag in to the URL for that page on the main site.


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/flickr"&gt;flickr&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/machinetags"&gt;machinetags&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/metadata"&gt;metadata&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/tagging"&gt;tagging&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/wildlifenearyou"&gt;wildlifenearyou&lt;/a&gt;&lt;/p&gt;



</summary><category term="flickr"/><category term="machinetags"/><category term="metadata"/><category term="tagging"/><category term="wildlifenearyou"/></entry><entry><title>Revving up</title><link href="https://simonwillison.net/2009/Apr/12/adactio/#atom-tag" rel="alternate"/><published>2009-04-12T12:29:25+00:00</published><updated>2009-04-12T12:29:25+00:00</updated><id>https://simonwillison.net/2009/Apr/12/adactio/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="http://adactio.com/journal/1568"&gt;Revving up&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
Jeremy Keith advocates adding the revcanonical attribute to regular A elements as well as / instead of hiding it in the head of the document, following the microformats design principle that invisible metadata is less valuable than augmenting visible links. I’ve updated my shorten bookmarklet to handle this case.


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/jeremy-keith"&gt;jeremy-keith&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/metadata"&gt;metadata&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/microformats"&gt;microformats&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/revcanonical"&gt;revcanonical&lt;/a&gt;&lt;/p&gt;



</summary><category term="jeremy-keith"/><category term="metadata"/><category term="microformats"/><category term="revcanonical"/></entry><entry><title>Specify your canonical</title><link href="https://simonwillison.net/2009/Feb/14/canonical/#atom-tag" rel="alternate"/><published>2009-02-14T11:28:20+00:00</published><updated>2009-02-14T11:28:20+00:00</updated><id>https://simonwillison.net/2009/Feb/14/canonical/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="http://googlewebmastercentral.blogspot.com/2009/02/specify-your-canonical.html"&gt;Specify your canonical&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
You can now use a link rel=“canonical” to tell Google that a page has a canonical URL elsewhere. I’ve run in to this problem a bunch of times—in some sites it really does make sense to have the same content shown in two different places—and this seems like a neat solution that could apply to much more than just metadata for external search engines.


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/canonical"&gt;canonical&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/google"&gt;google&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/metadata"&gt;metadata&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/relcanonical"&gt;relcanonical&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/search-engines"&gt;search-engines&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/seo"&gt;seo&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/urls"&gt;urls&lt;/a&gt;&lt;/p&gt;



</summary><category term="canonical"/><category term="google"/><category term="metadata"/><category term="relcanonical"/><category term="search-engines"/><category term="seo"/><category term="urls"/></entry><entry><title>freebase-suggest</title><link href="https://simonwillison.net/2008/Sep/24/freebasesuggest/#atom-tag" rel="alternate"/><published>2008-09-24T23:58:22+00:00</published><updated>2008-09-24T23:58:22+00:00</updated><id>https://simonwillison.net/2008/Sep/24/freebasesuggest/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="http://code.google.com/p/freebase-suggest/"&gt;freebase-suggest&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
A jQuery plugin that performs auto-completion against the Freebase JSONP API, and allows the results to be limited to specific categories or subsets.

    &lt;p&gt;&lt;small&gt;&lt;/small&gt;Via &lt;a href="http://hublog.hubmed.org/archives/001752.html"&gt;Alf Eaton&lt;/a&gt;&lt;/small&gt;&lt;/p&gt;


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/autocomplete"&gt;autocomplete&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/freebase"&gt;freebase&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/freebasesuggest"&gt;freebasesuggest&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/javascript"&gt;javascript&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/jquery"&gt;jquery&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/jsonp"&gt;jsonp&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/metadata"&gt;metadata&lt;/a&gt;&lt;/p&gt;



</summary><category term="autocomplete"/><category term="freebase"/><category term="freebasesuggest"/><category term="javascript"/><category term="jquery"/><category term="jsonp"/><category term="metadata"/></entry><entry><title>Amazon SimpleDB overview</title><link href="https://simonwillison.net/2007/Dec/14/amazoncom/#atom-tag" rel="alternate"/><published>2007-12-14T11:39:55+00:00</published><updated>2007-12-14T11:39:55+00:00</updated><id>https://simonwillison.net/2007/Dec/14/amazoncom/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="http://www.amazon.com/gp/browse.html?node=342335011"&gt;Amazon SimpleDB overview&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
Attribute values are limited to 1,024 bytes; Amazon suggest that you store larger fields in S3 and use SimpleDB to query metadata about those objects.


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/amazon"&gt;amazon&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/metadata"&gt;metadata&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/s3"&gt;s3&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/simpledb"&gt;simpledb&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/web-services"&gt;web-services&lt;/a&gt;&lt;/p&gt;



</summary><category term="amazon"/><category term="metadata"/><category term="s3"/><category term="simpledb"/><category term="web-services"/></entry><entry><title>Audio Fingerprinting for Clean Metadata</title><link href="https://simonwillison.net/2007/Sep/13/lastfm/#atom-tag" rel="alternate"/><published>2007-09-13T17:46:54+00:00</published><updated>2007-09-13T17:46:54+00:00</updated><id>https://simonwillison.net/2007/Sep/13/lastfm/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="http://blog.last.fm/2007/08/29/audio-fingerprinting-for-clean-metadata"&gt;Audio Fingerprinting for Clean Metadata&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
Last.fm have started using audio fingerprints to help clean up misspelled artists and duplicate track information.


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/audio"&gt;audio&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/audiofingerprinting"&gt;audiofingerprinting&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/lastfm"&gt;lastfm&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/metadata"&gt;metadata&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/mp3"&gt;mp3&lt;/a&gt;&lt;/p&gt;



</summary><category term="audio"/><category term="audiofingerprinting"/><category term="lastfm"/><category term="metadata"/><category term="mp3"/></entry><entry><title>Harper's Magazine</title><link href="https://simonwillison.net/2007/Apr/14/harpers/#atom-tag" rel="alternate"/><published>2007-04-14T00:05:37+00:00</published><updated>2007-04-14T00:05:37+00:00</updated><id>https://simonwillison.net/2007/Apr/14/harpers/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="http://www.harpers.org/"&gt;Harper&amp;#x27;s Magazine&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
The site with the best metadata on the Web just relaunched, with even MORE metadata.

    &lt;p&gt;&lt;small&gt;&lt;/small&gt;Via &lt;a href="http://ftrain.com/SiteLaunch.html"&gt;Launch (Ftrain.com)&lt;/a&gt;&lt;/small&gt;&lt;/p&gt;


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/harpers"&gt;harpers&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/metadata"&gt;metadata&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/paul-ford"&gt;paul-ford&lt;/a&gt;&lt;/p&gt;



</summary><category term="harpers"/><category term="metadata"/><category term="paul-ford"/></entry><entry><title>A better definition of Metadata</title><link href="https://simonwillison.net/2003/Oct/1/metadataDefinition/#atom-tag" rel="alternate"/><published>2003-10-01T11:17:24+00:00</published><updated>2003-10-01T11:17:24+00:00</updated><id>https://simonwillison.net/2003/Oct/1/metadataDefinition/#atom-tag</id><summary type="html">
    &lt;p&gt;Ned Batchelder: &lt;a href="http://www.nedbatchelder.com/text/metadata-is-nothing-new.html"&gt;Metadata is nothing new&lt;/a&gt;. Ned includes a far better definition of metadata than the standard "data about data" phrase:&lt;/p&gt;

&lt;blockquote cite="http://www.nedbatchelder.com/text/metadata-is-nothing-new.html"&gt;&lt;p&gt;
Metadata is information about a thing, apart from the thing itself.
&lt;/p&gt;&lt;/blockquote&gt;

&lt;p&gt;Ned also offers clear examples of metadata in the real world and shows how the concept goes back thousands of years. If you still don't quite understand what the term means you won't have any excuses once you've read his article.&lt;/p&gt;
    
        &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/metadata"&gt;metadata&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/ned-batchelder"&gt;ned-batchelder&lt;/a&gt;&lt;/p&gt;
    

</summary><category term="metadata"/><category term="ned-batchelder"/></entry></feed>