As a follow up to my last post I added a script to my fork of Aaron's py-flarchive that will load up a Redis instance with comments, notes, tags and sets for Flickr images that were uploaded by Brooklyn Museum. The script assumes you've got a snapshot of the archived metadata, which I downloaded as a tarball. It took several hours to unpack the tarball on a medium ec2 instance; so if you want to play around and just want the redis database let me know and I'll get it to you.
Once I loaded up Redis I was able to generate some high level stats:
- images: 5,697
- authors: 4,617
- tags: 6,132
- machine tags: 933
- comments: 7,353
- notes: 963
- sets: 141
Given how many images there were there it represents an astonishing number of authors: unique people who added tags, comments or notes. If you are curious I generated a list of the tags and saved them as a Google Doc. The machine tags were particularly interesting to me. The majority (849) of them look like Brooklyn Museum IDs of some kind, for example:
But there were also 51 geotags, and what looks like 23 links to items in Pleiades, for example:
If I had to guess I'd say this particular machine tag indicated that the Brooklyn Museum image depicted Abu Simbel. Now there weren't tons of these machine tags but it's important to remember that other people use Flickr as a scratch space for annotating images this way.
If you aren't familiar with them, Flickr notes are annotations of an image, where the user has attached a textual note to a region in the image. Just eyeballing the list, it appears that there is quite a bit of diversity in them, ranging from the whimsical:
- cool! they look soo surreal
- teehee somebody wrote some graffiti in greek
- Lol are these painted?
- Steaks are ready!
to the seemingly useful:
- Hunter's Island
- Ramesses III Temple
- Lapland Village
- Lake Michigan
- Montuemhat Crypt
- Napoleon's troops are often accused of destroying the nose, but they are not the culprits. The nose was already gone during the 18th century.
Similarly the general comments run the gamut from:
- very nostalgic...
- always wanted to visit Egypt
- Just a few points. This is not 'East Jordan' it is in the Hauran region of southern Syria. Second it is not Qarawat (I guess you meant Qanawat) but Suweida. Third there is no mention that the house is enveloped by the colonnade of a Roman peripteral temple.
- The fire that destroyed the buildings was almost certainly arson. it occurred at the height of the Pullman strike and at the time, rightly or wrongly, the strikers were blamed.
- You can see in the background, the TROCADERO with two towers .. This "medieval city" was built on the right bank where are now buildings in modern art style erected for the exposition of 1937.
Brooklyn Museum pulled over 48 tags from Flickr before they deleted the account. That's just 0.7% of the tags that were there. None of the comments or notes were moved over.
In the data that Aaron archived there was one indicator of user engagement: the datetime included with comments. Combined with the upload time for the images it was possible to create a spreadsheet that correlates the number of comments with the number of uploads per month:
I'm guessing the drop off in December of 2013 is due to that being the last time Aaron archived Brooklyn Museum's metadata. You can see that there was a decline in user engagement: the peak in late 2008 / early 2009 was never matched again. I was half expecting to see that user engagement fell off when Brooklyn Museum's interest in the platform (uploads) fell off. But you can see that they continued to push content to Flickr, without seeing much of a reward, at least in the shape of comments. It's impossible now to tell if tagging, notes or sets trended differently.
Since Flickr includes the number of times each image was viewed it's possible to look at all the images and see how many times images were viewed, the answer?
Not a bad run for 5,697 images. I don't know if Brooklyn Museum downloaded their metadata prior to removing their account. But luckily Aaron did.