All posts by jwbaker

James Baker is Director of Digital Humanities at the University of Southampton. James is a Software Sustainability Institute Fellow, a Fellow of the Royal Historical Society, and holds degrees from the University of Southampton and latterly the University of Kent, where in 2010 he completed his doctoral research on the late-Georgian artist-engraver Isaac Cruikshank. James works at the intersection of history, cultural heritage, and digital technologies. He is currently working on a history of knowledge organisation in twentieth century Britain. In 2021, I begin a major new Arts and Humanities Research Council funded project 'Beyond Notability: Re-evaluating Women’s Work in Archaeology, History and Heritage, 1870 – 1950'. Previous externally funded research projects have focused on legacy descriptions of art objects ('Legacies of Catalogue Descriptions and Curatorial Voice: Opportunities for Digital Scholarship', Arts and Humanities Research Council), the preservation of intangible cultural heritage ('Coptic Culture Conservation Collective', British Council, and 'Heritage Repertoires for inclusive and sustainable development', British Academy), the born digital archival record ('Digital Forensics in the Historical Humanities', European Commission), and decolonial futures for museum collections ('Making African Connections: Decolonial Futures for Colonial Collections', Arts and Humanities Research Council). Prior to joining Southampton, James held positions of Senior Lecturer in Digital History and Archives at the University of Sussex and Director of the Sussex Humanities Lab, Digital Curator at the British Library, and Postdoctoral Fellow with the Paul Mellon Centre for Studies in British Art. He is a member of the Arts and Humanities Research Council Peer Review College, a convenor of the Institute of Historical Research Digital History seminar, a member of The Programming Historian Editorial Board and a Director of ProgHist Ltd (Company Number 12192946), and an International Advisory Board Member of British Art Studies.

Archiving the age of personal computing

This week I finished a small pilot project ‘archiving’ some of the data storage devices held at University of Sussex Special Collections. My interest in this area is predicated on the premise that the paper archive has been replaced by the hard disk. In just 25 years most people in Britain and worldwide have come to create information in a new way. In turn, both archivists and historians will need figure out how to deal with data storage devices in archival contexts (for more on this see my recent talk ‘Hard disks as archives of everyday life‘. Thankfully, open source digital forensics tools that enable archivists to preserve and curate these born-digital archives have made huge strides in recent years thanks largely to the efforts of the BitCurator project led by University of North Carolina Chapel Hill.

Together with my wonderful colleagues in the Library and Special Collections, this pilot used BitCurator to capture a small number of data storage devices with a view to:

  1. Creating evidence around processes and outputs that can feed into future data management plans
  2. Preserving and stabilising these materials in line with best practice
  3. Establishing a set of shared principles in anticipation of future born digital deposits

To achieve this I spent a couple of days in The Keep (where the Special Collections are held) make captures of media devices including floppy disks, USB sticks, and CDs, most of which held institutional records. Working through these helped me playtest and iterate a step-by-step Processing Workflow for Digital Media using BitCurator that I’d developed in advance. This workflow is available on GitHub under a CC BY-SA license, so do share, use, and build on it. And if you are an archivist using BitCurator to capture comparable archives I’d be grateful for your input!

My digital forensics setup! (all media dummy media; mass of various cables not shown!).
My digital forensics setup! – laptop with BitCurator as native install, big external flash drive for storage, 3½-inch floppy drive, Wiebetech Forensic Combodock, mass of various cables not shown (NB: all media dummy media).

Having worked with born-digital collections whilst at the British Library, it was great to get stuck into them again. Whilst I was at The Keep I also took the opportunity to survey response to two Mass Observation Directives (Summer 2004 Letters and Emails, Summer 2015 You Online) for descriptions of how people remembered and experienced interactions with computers. As I plot research that will use both born-digital and physical archives as primary sources of historical change in the age of the personal computer, it is encouraging to see great work in the area bubbling up. The first workshop of AHRC-funded ‘Born digital big data and approaches for history and the humanities’ research network took place this month, showcase the growing maturity of humanities approaches to born-digital data. And last month Matthew G. Kirschenbaum’s (very excellent) Track Changes appeared, a book that traces literary encounters with word-processing. As one of the BitCurator project, Kirschenbaum offers some astute observation on using born-digital archives in humanities research. For example, during a discussion of the reliability – or otherwise – of created and modified date/time file metadata (in short, how do we know the clock of a computer was set correctly and in the case of a frequent flier to which time zone?), he writes:

A scholar working with such materials must be conversant in the antiquarian cants of vanished operating systems, file formats, and emulators, just as we expect an early modernist doing book history to know something of signatures and collation formulas (233)

Unsurpringly perhaps, I couldn’t agree more.