Open Access Open Access  Restricted Access Subscription Access

Chapter 4. An Exploration of Machine Learning in Libraries

Craig Boman


Chapter 4 of Library Technology Reports (vol. 55, no. 1), "An Exploration of Machine Learning in Libraries"

In this chapter, contributing author Craig Boman explores the use of latent Dirichlet allocation (LDA), a type of machine learning model, in the generation of library subject headings.

Full Text:



Cornell University Research Data Management Service Group, “Metadata and Describing Data,” accessed October 10, 2018,

Roy Tennant, “MARC Must Die,” Digital Libraries, LJ Infotech, Library Journal 127, no. 17 (October 15, 2002): 26–27.

Jasmine Aguilera, “Another Word for ‘Illegal Alien’ at the Library of Congress: Contentious,” New York Times, July 22, 2016,

Derek Hawkins, “The Long Struggle over What to Call ‘Undocumented Immigrants’ or, as Trump Said in His Order, ‘Illegal Aliens,’” Washington Post, February 9, 2017,

Melissa A. Adler, “The ALA Task Force on Gay Liberation: Effecting Change in Naming and Classification of GLBTQ Subjects,” Advances in Classification Research Online 23, no. 1 (2013):

Thomas G. Padilla, “Collections as Data: Implications for Enclosure,” College and Research Libraries News 79, no. 6 (June 2018): 296–300,

Chris Bourg, “What Happens to Libraries and Librarians When Machines Can Read All the Books?” Feral Librarian (blog), March 16, 2017,

Safiya Umoja Noble, Algorithms of Oppression: How Search Engines Reinforce Racism (New York: New York University Press, 2018).

Bourg, “What Happens to Libraries?”

Rong Ge, “Lecture 1: Machine Learning Basics” (slide presentation, COMPSCI 590.7—Algorithmic Aspects of Machine Learning, Duke University Department of Computer Science, Fall 2015),

David M. Blei, Andrew Y. Ng, Michael I. Jordan, and John Lafferty, “Latent Dirichlet Allocation,” Journal of Machine Learning Research 3, no. 4/5 (2003): 993–1022.

Julia Silge and David Robinson, “Topic Modeling,” chapter 6 in Text Mining with R: A Tidy Approach (Sebastopol, CA: O’Reilly Media, 2017),

Théo Vanderheyden, “Pickle in Python: Object Serialization,” DataCamp, April 5, 2018,

Jason Brownlee, Machine Learning Mastery website, accessed October 10, 2018,

Michael Dudley, “Algorithms Don’t Think about Race. So Tech Giants Need To,” The Decolonized Librarian (blog), February 7, 2017,


  • There are currently no refbacks.

Published by ALA TechSource, an imprint of the American Library Association.
Copyright Statement | ALA Privacy Policy