The National Archives
Search our website
  • Search our website
  • Search our records

Posts tagged 'digital preservation'

Trainee Tuesday: Tales from the Dark Archive

Checksums, dark archives, OAIS, trusted storage and ingest packages. No, these are not the vital components to some epic science fiction novel – although they are all terms that were completely alien to me before I started my Opening Up Archives Traineeship at Gloucestershire Archives. My Name is Tom Charnock and since April I have been working with all of these terms (and more!) on an almost daily basis as a large part of my traineeship is focused on Digital Preservation. Before I started at Gloucestershire, my knowledge of Digital Preservation was fairly minimal – I’d never even heard the phrase before. I did have a good idea what it the term meant when I was introduced to it, seeing as I have a fairly good grasp of computer technology, software and am a bit of a tech geek at heart…but as far as being actively involved with Digital Preservation? No.

Gloucestershire Archives' building

Gloucestershire Archives' building

That’s changed quite a bit in the five months that I’ve been at Gloucestershire. The first thing I had to learn to appreciate was what exactly the term ‘Digital Preservation’ actually means. At the most basic level, it clearly involves the preserving of digital objects, but there is so much more to it than that and, even though the learning curve has been a pretty steep one, I feel I’ve grasped both the concept and the actual practical implementation of the concept quite well.

Continue reading »

Writer’s Block…

Ask anyone in our department at The National Archives and they will say I’m never short of words… Okay, ask anyone out of half a dozen or more departments at The National Archives and they’ll pretty much agree too! Well, that was up to today I suppose. Perhaps it’s writer’s block, perhaps it’s just the natural wrapping up of my duties given that (note it down Wikipedia!) tomorrow, 7 September, is my last day at the organisation. It has been three years, three months and seven days since I started, a fresh-faced C++ developer from the Midlands. My humanities background was Digital Culture at Kings College London and, between you and me, I think I might have confused digitisation with digital preservation at my interview (they let me through the net though!)

In three years, I’ve seen quite a lot happen in the world of digital preservation. I thought my last blog post for The National Archives might be an opportunity to put a shout-out to some of the existing community projects and initiatives which have already done enormous amounts for the cause and look set to continue this trend for a long time.

Digital Preservation Coalition - Save the Bits

Digital Preservation Coalition

While I am sure I was introduced to the Digital Preservation Coalition long before this, in February 2010 Planets held one of its ‘The Planets Way’ training events in London. The first day of the event was in a conference format and, just after lunch, William Kilbride from the DPC took the opportunity to say a few words about the work they do. The statement he made to the room that resonated with me to this day, and a sentiment that can make us all smile in digital preservation, was (to paraphrase):

“Once you solve the problem of digital preservation, I can retire.”

Continue reading »

Be Not Afraid: help is at hand for archivists

As a sector, archives are well aware of the impact of the changing way we record and share information. The digital challenge is something that concerns us all. However, it may seem almost too daunting to start, particularly for those archivists working in smaller, less well-resourced archive services. Luckily, it has been a good summer for advice and guidance to help us all to tackle this challenge.

 

A screengrab of Manchester Archives+' flickr photostream showing six sets

A screengrab of Manchester Archives+ flickr photostream: digital engagement is becoming a vital part of archive work across the UK

 

The Heritage Lottery Fund has published Thinking About Good Digital Practice, a guide to support their new policy which for the first time opens the fund to primarily digital projects. The guidance is helpful well beyond those intending to bid for Lottery funds, as it encourages effective planning and advises on how to get the best out of a project involving digital content. It outlines different options for digital projects and reminds us all of some of the key points to planning successful projects, particularly: what do you want to achieve and who is your target audience? Answers to those questions are critical if your work is to have the impact you hope and to justify the investment you are planning. Some of this advice is important for any project, digital or otherwise, but the guidance also gives a host of digital-specific information such as example costs for aspects of a digital project, good practice for file names and metadata, and some really helpful guidance about rights and permissions, issues which can seem very intimidating to newcomers to the subject. The guidance also helps to identify the staff and skills needed, the options for digital outputs, and how to keep digital outputs accessible, safe and findable in future.

Continue reading »

When 900 years old, you reach… Look as good, you will not…

I think that was the quote we were looking for? Ok, maybe not but If I mention the word DROID you might figure the right one out!

Tenuous links over, in Digital Preservation today we’ve released a new version of the DROID (Digital Record and Object Identification) tool – version 6.1. We’ve spoken about the tool before when I blogged about the PRONOM and DROID user consultation we held at The National Archives last year. The day resulted in a consultation wiki where contribution is invited by all members of the public with an interest in a potential DROID 7. The wiki page lists requirements that users of the tool have for DROID 7 and all future versions.

DROID 6.1 User Interface

Hedgehog Street and Linked Open Data

The best way to explain the title of this blog is to begin by quoting directly from the Hedgehog Street website:

“Through Hedgehog Street, we are asking people to become Hedgehog Champions to rally support from their neighbours and work together to create ideal hedgehog habitat throughout their street, estate or communal grounds.”

I saw this initiative on BBC Springwatch a while back, specifically, one simple thing we can all do to become Hedgehog Champions – link your garden. Again to quote the Hedgehog Street website:

“Hedgehogs travel around one mile every night through our parks and gardens in their quest to find enough food and a mate. If you have an enclosed garden you might be getting in the way of their plans. Hedgehogs have enough barriers to contend with such as roads and rivers that we can’t do much about. However we can make their life a little easier by removing the barriers within our control – for example making holes in or under our garden fences and walls for them to pass through. The gap need only be around 15cm in diameter and so should not affect your pets’ safety.”

The idea of doing something so simple to protect our cute friends is a nice one. We’re converting one garden into hundreds, and combined with more naturally occurring wildlife corridors, potentially thousands. This is what we’re doing when we link data, the gardens represent our data and datasets and the link we’ve created gives users and machines unrestricted access to navigate from one dataset to another. It’s an almost perfect analogy – an analogy which I hope will help to open up the concept to all our readers, technical and non-technical alike.

Linky - The Linked Data Hedgehog

Continue reading »

An introduction to the PRONOM contribution model and the Signature Developer role

My name is David Clipsham and I have been employed as the File Format Signature Developer for a month, having previously worked as Customer Service Manager for the cross-government social collaboration tool, Civil Pages. My role is to improve the coverage of The National Archives’ PRONOM file format registry. The internal and external signature information contained in the PRONOM registry is utilised by our file format identification tool DROID, which is used to identify file formats so we can make informed decisions about the long term preservation of digital records.

The PRONOM contribution model

My day is typically spent researching obscure and not-so-obscure file formats, picking through the internal code of each format and identifying the key characteristics that make the file format what it is, as described in Ross Spencer’s recent blog post. I then recreate the key byte sequences, test them against sample files and upload them to PRONOM, ready for our bi-monthly signature release.

How do I focus my research?

Continue reading »

Ancient Deeds and the Semantic Cardigan

Last week, some of us from The National Archives were privileged to spend a day in Cambridge with over 50 people from the heritage spheres gathered for the Digital Preservation Coalition event, Links that Last.

(For those new to the concept of linked data, I have no hesitation in recommending this Wikipedia article: ‘linked data describes a method of publishing structured data so that it can be interlinked and become more useful. It builds upon standard Web technologies such as HTTP and URIs but rather than using them to serve web pages for human readers, it extends them to share information in a way that can be read automatically by computers.’)

As the Links that Last programme puts it:

‘The emerging ‘Linked Data’ approach … challenges us to think about preservation in new ways. Simultaneously, the digital preservation community has put considerable effort into the development of persistent identifiers, services that seek to ensure that essential links are not lost and … that the highly distributed contexts in which information is presented are protected against the vagaries of time and obsolescence’.

CAFED00Ds and CAFEBABEs

Wouldn’t it be cool if every digital file created could be identified with a signature or ‘magic number’ of some kind? This would make preservation, and the concept of knowing what you’ve got in order to be able to preserve it, that much easier.

By design or otherwise, for some file formats this is actually the case. The title of today’s blog post provides two such magic numbers used to identify Java class objects and Java pack200 files. These aren’t the first examples to use magic numbers but 0xCAFEBABE is the one I find the most striking as an introduction to the concept. You can read more on the origins of 0xCAFEBABE at Wikipedia. Continue reading »