If Twitter is sitting on a goldmine of data, someone needs to create a better method of digging it up. The Library of Congress was given access to the entire firehose of tweets in 2010, but researchers at the institution are still scratching their heads over how to organize and display the collection, which currently totals 170 billion tweets dating back to 2006.
Library of Congress
The Library of Congress has also come to an agreement with Amazon Europe to make thousands of books in the public domain available in France and Germany using print on-demand technology. The library will give CreateSpace and Amazon Europe’s print on-demand business digital copies of scanned public domain books in their collections to be manufactured on-demand.
Back in February, The Library of Congress partnered with The British Library to bring about 65,000 public domain titles into its collection.
In June, Google struck a 30 million euro deal with Austria’s national library Osterreichische Nationalbibliothek (ONB) to digitize 400,000 copyright-free books from its collection.
The U.S. Library of Congress announced earlier today that they would be acquiring the entire archive of tweets from Twitter. Not surprisingly, the announcement was accompanied by a tweet whose viral retweeting might explain why their blog is down at the time of this writing.