Chapter 3: Working with MARC Data

Cuttering Tools

MarcEdit includes tools to generate general LC-based cutters or cutters based on the Sanborn 3 cuttering table.  These functions cannot be embedded into a task.

Generate Call Numbers

The generate call numbers tool leverages OCLC’s Classify API to allow users to automatically generate Dewey or LC based call numbers.  To tool provides a range of options that enable users to customize call number creation, output and evaluation.  This function cannot be embedded into a Task.

Build Linked Records

This tool utilizes a rules file to locate and embed linked data endpoints into your records.  This tool can be embedded in a task.

RDA Helper

The RDA Helper was created to provide a simple workflow to facilitate the process of generating RDA formatted records.  The RDA Helper not only facilitates field creation and updating, but also facilitates abbreviation expansion.  This tool can be embedded in a task.

Record Deduplication

The Record Deduplication tool can be used to locate and remove duplicate records using a wide range of matching criteria.  This tool cannot be embedded as a task.

Reporting Functionality

MarcEdit provides the ability to generate a wide range of reports about your data.  While reports will be discussed in more detail in a later chapter, a brief description of each type of report is found below.

Field Count

Field count reports provide an output of field usage both overall, and within records.   Users can further interrogate this report, by right clicking on a specific field to retrieve information regarding the indicator and subfield usage within a particular field – or one can generate a report – and retrieve all field/indicator usage within the specified file.

Material Types

The Material types report provides users with the ability to return not only counts, but isolate records that appear to be a particular material type.

Validators

MarcEdit includes a number of different types of validation reports.  There is the MarcValidator, which utilizes a rules file to determine field usage guidelines, and will report any fields that are in violation of those rules.  By default, MarcEdit uses a rules file that corresponds to RDA cataloging rules.  Additionally, the tool provides ISBN and ISSN validators.  These are mathematical validators that determine if an ISBN or ISSN is structurally valid.  These functions do not actually check to see if the ISBN or ISSN is assigned to the specific item represented by the record.

Headings Validation

Build on top of MarcEdit’s linked data framework, the Validate Headings tool allows users to validate 1xx, 6xx, and 7xx data against the U.S. Library of Congress’s authority file.  The tool generates a report, as well as has options to automatically download authorities, and generate brief personal authority records.

Integrations

One of the areas of significant development in MarcEdit in 2016, was around how MarcEdit integrates with other tools and services.  Over 2016, I developed integrations between MarcEdit and OCLC, Alma, OpenRefine, and enhanced the Koha integrations.  Likewise, I improved interoperability with Microsoft Office’s Excel and Access, to enable users to move data easily between programs.  In 2017, this work continued, with the development of a plugin synchronizing content between Internet Archive and the HathiTrust, as well as a plugin for generating KBART data from MARC records to simplify the sharing of local holdings data.