Document detail

Archives, access and artificial intelligence: working with born-digital and digitised archival collections

Bielefeld: Bielefeld University Press; transcript (2021), 330 pp., 20 illustr.
ISBN 978-3-8376-5584-1 (print); 978-3-8394-5584-5 (pdf) CC BY-NC
"Cultural heritage organizations face at least three main challenges. First, the volume of digital archives makes it extremely difficult for archivists to assess records. Applying Artificial Intelligence (AI) and machine learning (ML) to archives is still at an experimental stage, but AI/ ML could become an integral part of archival processes. To manage the sheer bulk and potential sensitivity of records, archivists will also rely on creators to help them make appraisal and selection decisions at the point of deposit. Second, most born-digital collections are currently closed due to a wide range of reasons (including technical issues, copyright, and data protection). Regardless of whether archives are digital or not, archivists need to balance individual rights and the public interest in the context of the General Data Protection Regulation (GDPR) in Europe. Nobody would reasonably claim that all born-digital data should be unlocked and openly accessible. Yet, it is important to recognize that “dark” archives contain vast amounts of data essential to scholars – including email correspondence, drafts of manuscripts, digital photos and videos. Within current legal frameworks, making born-digital archives more accessible is an urgent priority to fully make sense of our cultural heritage. Third, data science and AI are becoming essential tools, but very few scholars (particularly in the humanities) have been trained to master these research methods, a skills gap which in turn has an impact on the training we offer to students." (Introduction, p.7-8)
Contents
Introduction / Lise Jaillant, 7
1 Artificial Intelligence and Discovering the Digitized Photoarchive / X.Y. Han, Vardan Papyan, Ellen Prokop, David L. Donoho, C. Richard Johnson, 29
2 Web Archives and the Problem of Access: Prototyping a Researcher Dashboard for the UK Government Web Archive / Mark Bell, Tom Storrar, Jane Winters, 61
3 Design Thinking, UX and Born-digital Archives: Solving the Problem of Dark Archives Closed to Users / Lise Jaillant, 83
4 Towards Critically Addressable Data for Digital Library User Studies / Paul Gooding, 109
5 Reviewing the Reviewers: Training Neural Networks to Read Peer Review Reports / Martin Paul Eve, Birkbeck, Robert Gadie, Victoria Odeniyi, Shahina Parvin, 131
6 Supervised and Unsupervised: Approaches to Machine Learning for Textual Entities / Tobias Hodel, 157
7 Inviting AI into the Archives: The Reception of Handwritten Recognition Technology into Historical Manuscript Transcription / Melissa Terras, 179
AFTERWORD: Towards a new Discipline of Computational Archival Science (CAS) / Richard Marciano, 205