<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: BagIt</title>
	<atom:link href="http://inkdroid.org/journal/2008/06/06/bagit/feed/" rel="self" type="application/rss+xml" />
	<link>http://inkdroid.org/journal/2008/06/06/bagit/</link>
	<description>$pithy_personal_mission_statement</description>
	<pubDate>Thu, 04 Dec 2008 00:35:40 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.5</generator>
		<item>
		<title>By: Jim</title>
		<link>http://inkdroid.org/journal/2008/06/06/bagit/#comment-65445</link>
		<dc:creator>Jim</dc:creator>
		<pubDate>Tue, 24 Jun 2008 17:41:29 +0000</pubDate>
		<guid isPermaLink="false">http://inkdroid.org/journal/?p=203#comment-65445</guid>
		<description>I was pretty pleased when LC suggested we use the spec to deliver NDIIPP content to them.  Super low cost and easy to validate.  

Addressing Peter's comment regarding 'best before' dates, it's my suspicion that LC isn't in a position to fast-track data acquisition presently.  That doesn't mean that the spec, as implemented by others, wouldn't benefit from a data associated with data.</description>
		<content:encoded><![CDATA[<p>I was pretty pleased when LC suggested we use the spec to deliver NDIIPP content to them.  Super low cost and easy to validate.  </p>
<p>Addressing Peter&#8217;s comment regarding &#8216;best before&#8217; dates, it&#8217;s my suspicion that LC isn&#8217;t in a position to fast-track data acquisition presently.  That doesn&#8217;t mean that the spec, as implemented by others, wouldn&#8217;t benefit from a data associated with data.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Peter Binkley</title>
		<link>http://inkdroid.org/journal/2008/06/06/bagit/#comment-64650</link>
		<dc:creator>Peter Binkley</dc:creator>
		<pubDate>Fri, 06 Jun 2008 21:32:25 +0000</pubDate>
		<guid isPermaLink="false">http://inkdroid.org/journal/?p=203#comment-64650</guid>
		<description>We heard about this at the PASIG meeting in San Francisco last week - it looks very handy. I didn't pick up on the possibility of holey bags there, though. Given that this is intended for delivery rather than persistence, it might be helpful to allow a best-before date in fetch.txt (maybe per line). Since this will often be used to share ingestion-friendly files, which I may not want to leave lying around in web-accessible space, the date would let me say "get it by Friday cuz it may not be there next week".</description>
		<content:encoded><![CDATA[<p>We heard about this at the PASIG meeting in San Francisco last week - it looks very handy. I didn&#8217;t pick up on the possibility of holey bags there, though. Given that this is intended for delivery rather than persistence, it might be helpful to allow a best-before date in fetch.txt (maybe per line). Since this will often be used to share ingestion-friendly files, which I may not want to leave lying around in web-accessible space, the date would let me say &#8220;get it by Friday cuz it may not be there next week&#8221;.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael J. Giarlo</title>
		<link>http://inkdroid.org/journal/2008/06/06/bagit/#comment-64645</link>
		<dc:creator>Michael J. Giarlo</dc:creator>
		<pubDate>Fri, 06 Jun 2008 16:34:06 +0000</pubDate>
		<guid isPermaLink="false">http://inkdroid.org/journal/?p=203#comment-64645</guid>
		<description>One quick response to gsf's query about fetch.txt, "one question: why aren’t URLs in the fetch.txt preceded by a checksum?"

The checksums for files listed in fetch.txt are already listed in the manifest.  I'm not sure how useful the length param is since the checksum is already recorded elsewhere, but I suppose it's a good way to -quickly- check if the content from a URL is what you're looking for.</description>
		<content:encoded><![CDATA[<p>One quick response to gsf&#8217;s query about fetch.txt, &#8220;one question: why aren’t URLs in the fetch.txt preceded by a checksum?&#8221;</p>
<p>The checksums for files listed in fetch.txt are already listed in the manifest.  I&#8217;m not sure how useful the length param is since the checksum is already recorded elsewhere, but I suppose it&#8217;s a good way to -quickly- check if the content from a URL is what you&#8217;re looking for.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Gabriel Sean Farrell</title>
		<link>http://inkdroid.org/journal/2008/06/06/bagit/#comment-64642</link>
		<dc:creator>Gabriel Sean Farrell</dc:creator>
		<pubDate>Fri, 06 Jun 2008 13:47:58 +0000</pubDate>
		<guid isPermaLink="false">http://inkdroid.org/journal/?p=203#comment-64642</guid>
		<description>One thing I'm struck by right away is the similarity of of BagIt to the DSpace Simple Archive Format, which I'll call SAF even though the DSpace docs fail to employ such a lovely acronym.  The unfortunate URL for that section of the docs is http://www.dspace.org/index.php?option=com_content&#38;task=view&#38;id=144#itemimporter.  Anyway, as you can see there, SAF consists of an archive directory with item subdirectories.  Each item directory contains a dublin_core.xml describing the item and a "contents" file similar to the BagIt "manifest", but without the checksums.  

BagIt is obviously more flexible, and the fetch.txt is a banging idea (one question: why aren't URLs in the fetch.txt preceded by a checksum?).  It occurs to me now that it would be simple to send an SAF item or bunch of items within a Bag, though I'm not sure what the gain there would be.  In any case, that part of SWORD (i.e. converting from BagIt to something DSpace/Fedora/etc. can ingest) should be pretty simple, so you're probably right that the retrieval of "holey bags" will be the kicker.</description>
		<content:encoded><![CDATA[<p>One thing I&#8217;m struck by right away is the similarity of of BagIt to the DSpace Simple Archive Format, which I&#8217;ll call SAF even though the DSpace docs fail to employ such a lovely acronym.  The unfortunate URL for that section of the docs is <a href="http://www.dspace.org/index.php?option=com_content&amp;task=view&amp;id=144#itemimporter" rel="nofollow">http://www.dspace.org/index.php?option=com_content&amp;task=view&amp;id=144#itemimporter</a>.  Anyway, as you can see there, SAF consists of an archive directory with item subdirectories.  Each item directory contains a dublin_core.xml describing the item and a &#8220;contents&#8221; file similar to the BagIt &#8220;manifest&#8221;, but without the checksums.  </p>
<p>BagIt is obviously more flexible, and the fetch.txt is a banging idea (one question: why aren&#8217;t URLs in the fetch.txt preceded by a checksum?).  It occurs to me now that it would be simple to send an SAF item or bunch of items within a Bag, though I&#8217;m not sure what the gain there would be.  In any case, that part of SWORD (i.e. converting from BagIt to something DSpace/Fedora/etc. can ingest) should be pretty simple, so you&#8217;re probably right that the retrieval of &#8220;holey bags&#8221; will be the kicker.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
