Skip to main content

Artificial Intelligence for matching MARC records

Can AI be used for matching MARC records? We match bibliographic records from different institutions for deduplication. Most of the records we process have OCLC numbers, but records lacking them can’t be deduplicated. We have no automated process for auditing whether OCLC numbers were correctly assigned when cataloged. After years of relying on OCLC numbers, we began exploring additional methods for matching MARC records. We’ve experimented with fuzzy matching and natural language processing techniques in conjunction with a variety of machine learning models (logistic regression, random forests, decision trees, and neural networks). This talk will discuss our exploration of AI and our current model, share our obstacles, and outline our next steps.


11:25 AM
20 minutes