NLP and Machine Learning for Born-Digital Materials - Part 2 (Collections Containing Email) Christopher Lee, Kam Woods University of North Carolina, United States of America
This workshop is about use of open-source natural language processing (NLP) and machine learning (ML) tools to process and provide access to born-digital materials. Part two will focus on applying topic modeling and named entity recognition to characterize and explore contents of removable storage media (e.g. floppy disks, optical media).