Home | Distant Listening, Using Python and Apps Scripts to Text Mine and Tag Oral History Collections

Animated gif of the title of this paper with colored lines connecting reel to reel audio spools.

Git Repository for Transcript Text Mining Tool

Abstract

This article will provide a case study of new processes for creating subject tags across multiple oral history recordings. It outlines a workflow that empowers student workers to run, modify, and expand these tags during the copyediting process. The goal is to produce richer, more accurate tagging, allowing future researchers to more easily identify connections across audio collections. The paper provides a detailed description of the workflow, explores the challenges it addresses, shares pedagogical experiences of transcribers, and examines the limitations of data-driven, human-edited automated tagging.

Abstract

Contents: