<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>inkdroid &#187; archives</title>
	<atom:link href="http://inkdroid.org/journal/tag/archives/feed/" rel="self" type="application/rss+xml" />
	<link>http://inkdroid.org/journal</link>
	<description>$pithy_personal_mission_statement</description>
	<lastBuildDate>Wed, 28 Jul 2010 13:48:50 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>history and genealogy at semwebdc</title>
		<link>http://inkdroid.org/journal/2010/03/22/history-and-genealogy-at-semwebdc/</link>
		<comments>http://inkdroid.org/journal/2010/03/22/history-and-genealogy-at-semwebdc/#comments</comments>
		<pubDate>Mon, 22 Mar 2010 08:41:57 +0000</pubDate>
		<dc:creator>ed</dc:creator>
				<category><![CDATA[dc]]></category>
		<category><![CDATA[semweb]]></category>
		<category><![CDATA[archives]]></category>
		<category><![CDATA[django]]></category>
		<category><![CDATA[genealogy]]></category>
		<category><![CDATA[history]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[oai-ore]]></category>
		<category><![CDATA[pbs]]></category>

		<guid isPermaLink="false">http://inkdroid.org/journal/?p=1809</guid>
		<description><![CDATA[spine CC BY 2.0 Last week&#8217;s Washington DC Semantic Web Meetup focused on History and Genealogy Semantics. It was a pretty small, friendly crowd (about 15-20) that met for the first time at the Library of Congress. The group included folks from PBS, the National Archives, the Library of Congress, and the Center for History [...]]]></description>
			<content:encoded><![CDATA[<div xmlns:cc="http://creativecommons.org/ns#" about="http://www.flickr.com/photos/spine/2076729686/" style="float: left; margin-right: 10px; text-align: center;"><a href="http://www.flickr.com/photos/spine/2076729686"><img src="http://inkdroid.org/images/familytree.jpg" style="height: 250px" /></a><br /><a href="http://www.flickr.com/photos/spine/">spine</a> <a rel="license" href="http://creativecommons.org/licenses/by/2.0/">CC BY 2.0</a>
</div>
<p>Last week&#8217;s <a href="http://semweb.meetup.com/31/">Washington DC Semantic Web Meetup</a> focused on <a href="http://semweb.meetup.com/31/calendar/12368900/">History and Genealogy Semantics</a>. It was a pretty small, friendly crowd (about 15-20) that met for the first time at the Library of Congress. The group included folks from <a href="http://pbs.org">PBS</a>, the <a href="http://archives.gov">National Archives</a>, the <a href="http://loc.gov">Library of Congress</a>, and the <a href="http://chnm.gmu.edu/">Center for History and New Media</a>&#8211;as well as some regulars from the <a href="http://www.eccnet.com/xmlug/">Washington DC SGML/XML Users Group</a>.</p>
<p><a href="http://semweb.meetup.com/31/members/6323472/">Brian Eubanks</a> gave a <a href="http://files.meetup.com/987383/History%20and%20Genealogy%20Semantics.pdf">presentation</a> on what the Semantic Web, <a href="http://linkeddata.org">Linked Data</a> and specifically <a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a> and <a href="http://www.w3.org/2004/03/trix/">Named Graphs</a> have to offer genealogical research. He took us on a tour through a variety of websites, such as <a href="http://www.blm.gov/or/landrecords">Land Records Database</a> at the Bureau of Land Management, <a href="http://www.ancestry.com/">Ancestry.com</a>, <a href="http://www.footnote.com/">Footnote</a> and <a href="http://books.google.com">Google Books</a> and made a strong case for using RDF to link these sorts of documents with a family tree. </p>
<p>As more and more historic records make their way online as Web Documents with URIs, RDF becomes an increasingly useful data model for providing provenance and source information for a family tree. On sites like Ancestry.com it is important to understand the provenance of genealogical assertions, since Ancestry.com allows you to merge other people&#8217;s family trees into your own, based on likely common ancestors. In situations like this researchers need to be able to evaluate the credibility or truthfulness of other people&#8217;s trees&#8211;and being able to source the family tree links to the documents that support them is an essential part of the equation.</p>
<p>Along the way Brian let people know about a variety of vocabularies that are available for making assertions that are of value to genealogical research: </p>
<ul>
<li><a href="http://www.w3.org/TR/rdfcal/#Locations">rdfcal</a> : for Events</li>
<li><a href="http://vocab.org/bio/0.1/.html">BIO</a> : for biographical information</li>
<li><a href="http://vocab.org/relationship/.html">Relationship</a> : for describing the links between people</li>
<li><a href="http://xmlns.com/foaf/spec/">FOAF</a> : for describing people</li>
<li><a href="http://www4.wiwiss.fu-berlin.de/bizer/TriG/">TriG</a> : for identifying the assertions that a researcher makes and linking them to a given document</li>
</ul>
<p>The beautiful thing about RDF for me, is that it&#8217;s possible to find and use these vocabularies in concert, and I&#8217;m not tempted to create the-greatest-genealogy-vocabulary that does it all. In addition, Brian pointed out that sites like <a href="http://dbpedia.org">dbpedia</a> and <a href="http://geonames.org">geonames</a> are great sources of names (URIs) for people, places and events that can be used in building descriptions. Brian has started the <a href="http://groups.google.com/group/history-and-genealogy-semantics-wg/">History and Genealogy Semantics Working Group</a> which has an open membership, and encourages anyone with interest in this area to join. While writing this post I happened to run across a Wikipedia page about <a href="http://en.wikipedia.org/wiki/Family_tree_mapping">Family Tree Mapping</a>, which indicated that some genealogical software already supports geocoding family trees. As usual it seems like the geo community is leading the way in making semantics on the web down to earth and practical. </p>
<p>I followed Brian by giving a brief talk about the <a href="http://chroniclingamerica.loc.gov">Chronicling America</a>, which is the web front-end for data collected by <a href="http://en.wikipedia.org/wiki/National_Digital_Newspaper_Program">National Digital Newspaper Program</a>, which in turn is a joint project of the <a href="http://loc.gov">Library of Congress</a> and the <a href="http://neh.gov">National Endowment for the Humanities</a>. After giving a brief overview of the program, I described how we were naturally led to using Linked Data and embracing a generally <a href="http://en.wikipedia.org/wiki/Representational_State_Transfer">RESTful</a> approach by a few factors:</p>
<ul>
<li>a need to create persistent <a href="http://www.w3.org/Provider/Style/URI">Cool URIs</a> for newspaper titles, issues and pages so that people could reference them.</li>
<li>the desire to make views available for the <a href="http://chroniclingamerica.loc.gov/awardees/">institutions</a> around the United States that supply us with <a href="http://chroniclingamerica.loc.gov/batches/">data</a></li>
<li>the need to make our data available as a participant in the  <a href="http://www.diggingintodata.org/">Digging Into Data Challenge</a></li>
<li>a desire to kick the tires on the relatively new <a href="http://www.openarchives.org/ore/1.0/vocabulary">Open Archives Initiative Object Reuse and Exchange vocabulary</a> for describing aggregations of resources on the Web so that they can be meaningfully harvested
</li>
</ul>
<p>One thing that I learned during Brian&#8217;s presentation is that sites like <a href="http://footnote.com">Footnote</a> are not only <a href="http://fcw.com/articles/2009/09/29/online-collection-helps-people-remember-holocaust.aspx">going around</a> digitizing historic collections for inclusion in their service,  but they also give their subscribers a rich editing environment to search and <em>annotate</em> document text. These annotations are exactly the sort of stuff that would be perfect to represent as and RDF graph, if you wanted to serialize the data. In fact the NSF funded <a href="http://www.openannotation.org/">Open Annotation Collaboration</a> project is exploring patterns and emerging best practices in this area. I&#8217;ve had it in the back of my mind that allowing users to annotate page content in Chronicling America would be a really nice feature to have. If not at chroniclingamerica.loc.gov proper, then perhaps showing how it could be done by a 3rd party using the API. To some extent we&#8217;re already seeing annotation happening in Wikipedia, where people are creating links to newspaper pages and titles in their entries, which we can see in the <a href="http://en.wikipedia.org/wiki/Referrer">referrer</a> information in our web server logs. <em>Update: and I just learned that wikipedia themselves provide a <a href="http://en.wikipedia.org/w/index.php?title=Special:LinkSearch">service</a> that allows you to discover entries that have outbound links to a particular site, like <a href="http://en.wikipedia.org/w/index.php?title=Special:LinkSearch&#038;target=http://chroniclingamerica.loc.gov&#038;limit=500&#038;offset=0">chroniclingamerica.loc.gov</a>.</em></p>
<p>Speaking of the API (which really is just REST) if you are interested in learning more about it check out the  <a href="http://chroniclingamerica.loc.gov/about/api/">API Document</a> that <a href="http://onebiglibrary.net">Dan Chudnov</a> prepared. I also made my <a href="http://docs.google.com/present/view?id=dv89m3d_38246f986km">slides</a> available, hopefully the speaker notes provide a bit more context for what I talked about when showing images of various things.</p>
<p><iframe src="http://docs.google.com/present/embed?id=dv89m3d_38246f986km" frameborder="0" width="410" height="342" style="float: left; margin-right: 10px;"></iframe></p>
<p>Afterwards a bunch of us headed across the street to have a drink. I was really interested to hear from <a href="http://twitter.com/samdeng">Sam Deng</a> that (like the group I work in at LC) <a href="http://pbs.org">PBS</a> are big <a href="http://python.org">Python</a> and <a href="http://djangoproject.com">Django</a> shop. We&#8217;re going to try to get a little brown bag lunch going on between PBS and LC to talk about their use of Django on <a href="http://aws.amazon.com/ec2/">Amazon EC2</a>, as well as software like <a href="http://celeryproject.org/">Celery</a> for managing asynchronous task queues. </p>
<p>Also, after chatting with <a href="http://twitter.com/GlennClatworthy">Glenn Clatworthy</a> of PBS, I learned that he has been experimenting with making Linked Data views available for their programs. It was great to hear Glenn describe how assigning each program a URI, and leveraging the nature of the web would make a perfect fit for distributing data in the PBS enterprise. It makes me think that perhaps having a session on what the <a href="http://bbc.co.uk">BBC</a> are <a href="http://derivadow.com/2009/03/31/linking-bbccouk-to-the-linked-data-cloud/">doing</a> with Linked Data would be timely?</p>
]]></content:encoded>
			<wfw:commentRss>http://inkdroid.org/journal/2010/03/22/history-and-genealogy-at-semwebdc/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>provide and enable</title>
		<link>http://inkdroid.org/journal/2008/06/18/provide-and-enable/</link>
		<comments>http://inkdroid.org/journal/2008/06/18/provide-and-enable/#comments</comments>
		<pubDate>Wed, 18 Jun 2008 18:05:24 +0000</pubDate>
		<dc:creator>ed</dc:creator>
				<category><![CDATA[politics]]></category>
		<category><![CDATA[publishing]]></category>
		<category><![CDATA[semweb]]></category>
		<category><![CDATA[archives]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[government]]></category>
		<category><![CDATA[rdf]]></category>
		<category><![CDATA[rdfa]]></category>

		<guid isPermaLink="false">http://inkdroid.org/journal/?p=204</guid>
		<description><![CDATA[I got a chance to meet Jennifer Rigby of the National Archives UK at the LinkedDataPlanet Conference in New York City (thanks Ian). Jennifer is the Head of IT Strategy, and told me lots of interesting stuff related to a profound shift they&#8217;ve had in their online strategies to: Provide and Enable So rather than [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://inkdroid.org/images/natarch.gif" style="float: left; border: thin inset gray; margin-right: 15px;" />I got a chance to meet Jennifer Rigby of the <a href="http://www.nationalarchives.gov.uk/">National Archives UK</a> at the <a href="http://linkeddataplanet.com">LinkedDataPlanet</a> Conference in New York City (thanks <a href="http://iandavis.com">Ian</a>).  Jennifer is the Head of IT Strategy, and told me lots of interesting stuff related to a profound shift they&#8217;ve had in their online strategies to:</p>
<blockquote><p>
<a href="http://www.nationalarchives.gov.uk/documents/provide-enable-summary.pdf">Provide and Enable</a>
</p></blockquote>
<p>So rather than pouring all their energy into making applications to visualize archival resources, the National Archives have recognized that making machine readable resources available to the public (in formats like RDF and RDFa) is really important to their core mission. In addition to <em>providing</em> services and data, they are trying to <em>enable</em> an ecosystem of innovation around their assets&#8211;or in their words:</p>
<p>• We will allow others to harness the power of our information, leading to a far wider range of products and services than we could provide ourselves.<br />
• We will continue to work with commercial partners to provide online access to millions of records.</p>
<p>Jennifer said we can look forward to an announcement around <a href="http://www.ukuug.org/events/opentech2008/">OpenTech2008</a> (July 5th) about a set of important publications that are going to made available by the Archives as RDF and RDFa. In addition I heard about how they work with website data harvested by <a href="http://archive.org">Internet Archive</a> to create a resolver service for transient publications on the web.</p>
<p>Hearing how a big organization like the National Archives can come to this realization of &#8220;Provide and Enable&#8221;, and then start to execute on it was really encouraging&#8211;and inspiring.  It is also refreshing to see people recognize, in writing the importance of semantic web technologies:</p>
<blockquote><p>We have started exploring new ideas and technologies, including using RDFa for publishing the Gazettes. The way we now publish legislation has a key role to play in the further development of the semantic web.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://inkdroid.org/journal/2008/06/18/provide-and-enable/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
