Difference between revisions of "Step 3.0"

From AC Wiki
Jump to: navigation, search
(normalization)
(National authority files)
(16 intermediate revisions by 2 users not shown)
Line 1: Line 1:
==AUTHORITY CLEANUP OVERVIEW==
+
==Authority Cleanup Overview==
 
The first phase of MARS 2.0 Authority Control comprises a battery of routines that update and correct individual subfields and contiguous pairs of subfields, the purpose of which is to increase the likelihood of finding the appropriate authority match.
 
The first phase of MARS 2.0 Authority Control comprises a battery of routines that update and correct individual subfields and contiguous pairs of subfields, the purpose of which is to increase the likelihood of finding the appropriate authority match.
  
Line 12: Line 12:
 
* Correcting spacing, capitalization, and punctuation
 
* Correcting spacing, capitalization, and punctuation
  
===update obsolete subdivisions===
+
===Update obsolete subdivisions===
 
MARS 2.0 uses a number of subfield correction tables to correct common errors in LC subfields
 
MARS 2.0 uses a number of subfield correction tables to correct common errors in LC subfields
  
   <font size="3">— Relations <font color="red">(General) with the</font> United States
+
   <font size="3">
 +
  — Relations <font color="red">(General) with the</font> United States
 
    
 
    
 
     '''changes to''':
 
     '''changes to''':
 
   — Relations <font color="red">—</font> United States</font>
 
   — Relations <font color="red">—</font> United States</font>
  
===correct typographical errors===
+
===Correct typographical errors===
 
MARS 2.0 also uses the subfield correction tables to correct common spelling errors in LC subdivisions:
 
MARS 2.0 also uses the subfield correction tables to correct common spelling errors in LC subdivisions:
  
{| border="1" cellspacing="0" cellpadding="5" align="left"
+
{| border="1" cellspacing="0" cellpadding="5" align="left" style="border-collapse:collapse;"
 
! style="background:silver" | Error  
 
! style="background:silver" | Error  
 
! style="background:silver" | Changes to  
 
! style="background:silver" | Changes to  
Line 34: Line 35:
 
<div style=clear:both></div>
 
<div style=clear:both></div>
  
===expand abbreviations===
+
===Expand abbreviations===
 
The subfield correction tables also support the expansion of outdated or invalid abbreviations in LC headings to the full form. Changes are made only when the outdated or invalid form is the entire text of the subfield:
 
The subfield correction tables also support the expansion of outdated or invalid abbreviations in LC headings to the full form. Changes are made only when the outdated or invalid form is the entire text of the subfield:
{| border="1" cellspacing="0" cellpadding="5" align="left"
+
{| border="1" cellspacing="0" cellpadding="5" align="left" style="border-collapse:collapse;"
 
! style="background:silver" | Outdated / Invalid  
 
! style="background:silver" | Outdated / Invalid  
 
! style="background:silver" | Changes to  
 
! style="background:silver" | Changes to  
Line 49: Line 50:
 
<div style=clear:both></div>
 
<div style=clear:both></div>
  
===direct-to-indirect geographic conversion===
+
===Direct-to-indirect geographic conversion===
 
MARS 2.0 uses a table to convert direct geographic subdivisions to the indirect form. Changes are made by the direct-to-indirect subfield conversion program only when the invalid form is the entire text of the $z '''and''' there is only one $z in the heading:
 
MARS 2.0 uses a table to convert direct geographic subdivisions to the indirect form. Changes are made by the direct-to-indirect subfield conversion program only when the invalid form is the entire text of the $z '''and''' there is only one $z in the heading:
{| border="1" cellspacing="0" cellpadding="5" align="left"
+
{| border="1" cellspacing="0" cellpadding="5" align="left" style="border-collapse:collapse;"
 
! style="background:silver" | Direct Subdivision
 
! style="background:silver" | Direct Subdivision
 
! style="background:silver" | Changes to
 
! style="background:silver" | Changes to
Line 65: Line 66:
  
  
===chronological conversion===
+
===Chronological conversion===
 
MARS 2.0 uses a table to convert chronological headings '''$y''' to their correct form. Corrections are made to spelling and punctuation as well as to format:
 
MARS 2.0 uses a table to convert chronological headings '''$y''' to their correct form. Corrections are made to spelling and punctuation as well as to format:
{| border="1" cellspacing="0" cellpadding="5" align="left"
+
{| border="1" cellspacing="0" cellpadding="5" align="left" style="border-collapse:collapse;"
 
! style="background:silver" | Subdivision
 
! style="background:silver" | Subdivision
 
! style="background:silver" | Changes to
 
! style="background:silver" | Changes to
Line 81: Line 82:
  
  
===delete obsolete subdivisions===
+
===Delete obsolete subdivisions===
 
MARS 2.0 uses a subfield deletion table to eliminate canceled subfields from LC bibliographic headings, deleting subfields only when the invalid form is the entire text of the subfield:
 
MARS 2.0 uses a subfield deletion table to eliminate canceled subfields from LC bibliographic headings, deleting subfields only when the invalid form is the entire text of the subfield:
{| border="1" cellspacing="0" cellpadding="5" align="left"
+
{| border="1" cellspacing="0" cellpadding="5" align="left" style="border-collapse:collapse;"
 
! style="background:silver" | Deleted
 
! style="background:silver" | Deleted
 
! style="background:silver" | Field / Subfield
 
! style="background:silver" | Field / Subfield
Line 97: Line 98:
  
 
The subfield deletion table also includes common misspellings and typographical errors:
 
The subfield deletion table also includes common misspellings and typographical errors:
{| border="1" cellspacing="0" cellpadding="5" align="left"
+
{| border="1" cellspacing="0" cellpadding="5" align="left" style="border-collapse:collapse;"
 
! style="background:silver" | Error
 
! style="background:silver" | Error
 
! style="background:silver" | Field / Subfield
 
! style="background:silver" | Field / Subfield
Line 110: Line 111:
  
  
===retain selected subdivisions===
+
===Retain selected subdivisions===
 
The subfield deletion table includes a section that prevents subfield conversions and deletions in headings meeting specific criteria.
 
The subfield deletion table includes a section that prevents subfield conversions and deletions in headings meeting specific criteria.
   <font size="3">— Yearbooks
+
   <font size="3">
 +
  — Yearbooks
 
    
 
    
 
     '''changes to''':
 
     '''changes to''':
Line 119: Line 121:
 
     — Students — Yearbooks</font>
 
     — Students — Yearbooks</font>
  
===correct spacing, capitalization, and punctuation===
+
===Correct spacing, capitalization, and punctuation===
 
Most errors in spacing, capitalization and punctuation are corrected as an integral part of the authority cleanup and authority matching processes. Routines are also run to correct spacing and punctuation on the following fields:
 
Most errors in spacing, capitalization and punctuation are corrected as an integral part of the authority cleanup and authority matching processes. Routines are also run to correct spacing and punctuation on the following fields:
   <font size="3">1XX, 240, 243, 245, 260, 4XX, 6XX, 7XX, 8XX</font>
+
   <font size="3">
 +
  1XX, 240, 243, 245, 260, 4XX, 6XX, 7XX, 8XX</font>
  
 
These processes eliminate any excess spaces in each field, makes sure each field has the correct punctuation within and between each subfield, and makes sure each field has ending punctuation.
 
These processes eliminate any excess spaces in each field, makes sure each field has the correct punctuation within and between each subfield, and makes sure each field has ending punctuation.
  
   <font size="3">'''original headings''':
+
   <font size="3">
 +
  '''original headings''':
 
   100  10 $a Black, Adam, $d 1974-  <font color="red">.</font>$t Crested geckos
 
   100  10 $a Black, Adam, $d 1974-  <font color="red">.</font>$t Crested geckos
 
   111  20 $a IEEE 1394 (FireWire) Workshop $d (2001<font color="red">,</font> $c Berlin, Germany)
 
   111  20 $a IEEE 1394 (FireWire) Workshop $d (2001<font color="red">,</font> $c Berlin, Germany)
Line 133: Line 137:
 
   111  20 $a IEEE 1394 (FireWire) Workshop $d (2001 <font color="red">:</font>$c Berlin, Germany)</font>
 
   111  20 $a IEEE 1394 (FireWire) Workshop $d (2001 <font color="red">:</font>$c Berlin, Germany)</font>
  
==AUTHORITY MATCHING OVERVIEW==  
+
==Authority Matching Overview==  
 
While Authority Cleanup improves authority controlled headings using proprietary MARS 2.0 correcton tables, authority matching compares each authority controlled heading in your bibliographic records against authority record headings from any of a number of national and other authority files.
 
While Authority Cleanup improves authority controlled headings using proprietary MARS 2.0 correcton tables, authority matching compares each authority controlled heading in your bibliographic records against authority record headings from any of a number of national and other authority files.
  
===national authority files===
+
===National authority files===
{| border="1" cellspacing="0" cellpadding="5" align="left"
+
{| border="1" cellspacing="0" cellpadding="5" align="left" style="border-collapse:collapse;"
 
! style="background:silver" | Abbrev.
 
! style="background:silver" | Abbrev.
 
! style="background:silver" | National Authority File
 
! style="background:silver" | National Authority File
Line 149: Line 153:
 
| ! align="center" | CHILD || Library of Congress Annotated Card Program Subjects || ! align="center" | Weekly || ! align="center" | 1,000
 
| ! align="center" | CHILD || Library of Congress Annotated Card Program Subjects || ! align="center" | Weekly || ! align="center" | 1,000
 
|-
 
|-
| ! align="center" | LCGFT || Library of Congress Genre Form Terms || ! align="center" | Weekly || ! align="center" | 800
+
| ! align="center" | LCGFT || Library of Congress Genre Form Terms || ! align="center" | Weekly || ! align="center" | 2,000
 
|-
 
|-
 
| ! align="center" | MESH || National Library of Medicine || ! align="center" | Annual || ! align="center" | 616,000
 
| ! align="center" | MESH || National Library of Medicine || ! align="center" | Annual || ! align="center" | 616,000
 
|-
 
|-
| ! align="center" | NLC-N || Library Archives Canada Names || ! align="center" | Semi-Annual || ! align="center" | 653,000
+
| ! align="center" | NLC-N || Library Archives Canada Names || ! align="center" | Frozen || ! align="center" | 653,000
 
|-
 
|-
| ! align="center" | NLC-S || Library Archives Canada Subjects || ! align="center" | Monthly || ! align="center" | 659,000
+
| ! align="center" | NLC-S || Library Archives Canada Subjects || ! align="center" | Frozen || ! align="center" | 659,000
 
|-
 
|-
| ! align="center" | AAT || Art & Architecture Thesaurus || ! align="center" | Annual || ! align="center" | 35,000
+
| ! align="center" | AAT || Art & Architecture Thesaurus || ! align="center" | Frozen || ! align="center" | 35,000
 
|-
 
|-
| ! align="center" | RBMS || Rare Books and Manuscripts Section Vocabularies || ! align="center" | Annual || ! align="center" | 1,600
+
| ! align="center" | RBMS || Rare Books and Manuscripts Section Vocabularies || ! align="center" | Frozen || ! align="center" | 1,600
 
|-
 
|-
 
| ! align="center" | TGM || Thesaurus for Graphic Materials || ! align="center" | Frozen || ! align="center" | 7,900
 
| ! align="center" | TGM || Thesaurus for Graphic Materials || ! align="center" | Frozen || ! align="center" | 7,900
 
|-
 
|-
 
| ! align="center" | GSAFD || Guidelines on Subject Access to Individual Works of Fiction, Drama, etc || ! align="center" | Frozen || ! align="center" | 160
 
| ! align="center" | GSAFD || Guidelines on Subject Access to Individual Works of Fiction, Drama, etc || ! align="center" | Frozen || ! align="center" | 160
 +
|-
 +
| ! align="center" | FAST || Faceted Application of Subject Terminology || ! align="center" | Quarterly || ! align="center" | 1,700,000
 +
|-
 +
| ! align="center" | NASA || NASA Thesaurus || ! align="center" | TBD || ! align="center" | 18,336
 +
|-
 +
| ! align="center" | OLACVGGT || OLAC Video Game Genre Vocabulary || ! align="center" | Frozen || ! align="center" | 66
 +
|-
 +
| ! align="center" | EMBNE || National Library of Spain || ! align="center" | Frozen || ! align="center" | 4,085,978
 +
|-
 +
| ! align="center" | QLSP || Queens Library Spanish Language Subject Headings || ! align="center" | Frozen || ! align="center" | 11,745
 +
|-
 +
| ! align="center" | ERIC|| Education Resources Information Center (ERIC) Thesaurus || ! align="center" | TBD || ! align="center" | 4,539
 +
|-
 +
| ! align="center" | HOMOIT || Homosaurus || ! align="center" | TBD|| ! align="center" | 1,626
 
|}
 
|}
 +
 
<div style=clear:both></div>
 
<div style=clear:both></div>
  
===goals of authority matching===
+
NOTE: As of September 2018, the Canadiana Authorities product from Library and Archives Canada has been discontinued. Library and Archives Canada is joining NACO and authority records are expected to be distributed via LC.
 +
 
 +
===Goals of authority matching===
 
Authority matching uses the headings in authority records to update or correct the bibliographic headings so they conform to current standards.
 
Authority matching uses the headings in authority records to update or correct the bibliographic headings so they conform to current standards.
  
Line 177: Line 198:
 
*Identify headings requiring '''more attention''' by your staff, through the use of MARS 2.0 reports
 
*Identify headings requiring '''more attention''' by your staff, through the use of MARS 2.0 reports
  
===fields under authority control===
+
===Fields under authority control===
 
MARS 2.0 corrects and updates the full range of authority controlled headings. The following bibliographic headings / fields are included in MARS 2.0 authority control processing:
 
MARS 2.0 corrects and updates the full range of authority controlled headings. The following bibliographic headings / fields are included in MARS 2.0 authority control processing:
{| border="1" cellspacing="0" cellpadding="5" align="left"
+
{| border="1" cellspacing="0" cellpadding="5" align="left" style="border-collapse:collapse;"
 
! style="background:silver" colspan="2" | Name, Title, and Series Authority Controlled Headings
 
! style="background:silver" colspan="2" | Name, Title, and Series Authority Controlled Headings
 
|-
 
|-
Line 197: Line 218:
  
  
{| border="1" cellspacing="0" cellpadding="5" align="left"
+
{| border="1" cellspacing="0" cellpadding="5" align="left" style="border-collapse:collapse;"
 
! style="background:silver" colspan="2" |Subject Authority Controlled Headings
 
! style="background:silver" colspan="2" |Subject Authority Controlled Headings
 
|-
 
|-
Line 216: Line 237:
 
<div style=clear:both></div>
 
<div style=clear:both></div>
  
===subfields disregarded===
+
===Subfields disregarded===
 
A number of MARC subfields are disregarded during MARS 2.0 authority matching. In the following headings, the volume designations in fields 810 and 440 $v, the heading linkage information in field 130 $6, and the ISSN in field 440 $x, are all examples of subfield information which is not under authority control:
 
A number of MARC subfields are disregarded during MARS 2.0 authority matching. In the following headings, the volume designations in fields 810 and 440 $v, the heading linkage information in field 130 $6, and the ISSN in field 440 $x, are all examples of subfield information which is not under authority control:
  
   <font size="3">810  2 $a John Bartholomew and Son. $t Bartholomew world travel series <font color="red">;$v 10.</font>
+
   <font size="3">
 +
  810  2 $a John Bartholomew and Son. $t Bartholomew world travel series <font color="red">;$v 10.</font>
 
   130  0 <font color="red">$6 880-01</font> $a ”Hsuuan lai his kan” his lieh.
 
   130  0 <font color="red">$6 880-01</font> $a ”Hsuuan lai his kan” his lieh.
 
   440  4 $a Romanica Gothoburgensia <font color="red">;$v 12, 16 $x 0080-3863</font></font>
 
   440  4 $a Romanica Gothoburgensia <font color="red">;$v 12, 16 $x 0080-3863</font></font>
  
===subfields matched or ignored===
+
===Subfields matched or ignored===
 
The table below shows the subfields that are included in the MARS 2.0 authority matching process:
 
The table below shows the subfields that are included in the MARS 2.0 authority matching process:
{| border="1" cellspacing="0" cellpadding="5" align="left"
+
{| border="1" cellspacing="0" cellpadding="5" align="left" style="border-collapse:collapse;"
 
! style="background:silver" | MARC Field !!style="background:silver" |  Subfields Retained During Matching !! style="background:silver" | Subfields Ignored During Matching
 
! style="background:silver" | MARC Field !!style="background:silver" |  Subfields Retained During Matching !! style="background:silver" | Subfields Ignored During Matching
 
|-
 
|-
Line 256: Line 278:
 
<div style=clear:both></div>
 
<div style=clear:both></div>
  
===normalization===
+
===Normalization===
 
Headings from both your bibliographic records and the MARS 2.0 national authority files are normalized before they are compared for matching. MARS 2.0 uses the NACO normalization standard. During normalization:
 
Headings from both your bibliographic records and the MARS 2.0 national authority files are normalized before they are compared for matching. MARS 2.0 uses the NACO normalization standard. During normalization:
 
*Alphabetic characters are converted to '''uppercase'''
 
*Alphabetic characters are converted to '''uppercase'''
Line 289: Line 311:
 
The bibliographic heading then contains the correct diacritic ~ instead of ' and the subfield code for '''S'ao Paulo''' has been corrected to '''$z'''.
 
The bibliographic heading then contains the correct diacritic ~ instead of ' and the subfield code for '''S'ao Paulo''' has been corrected to '''$z'''.
  
== TOPICS ==
+
== Topics ==
* [[Step 3.1|Step 3.1]] - d
+
* [[Step 3.1|Step 3.1]] - Generic Name Headings
* [[Step 3.2|Step 3.2]] - d
+
* [[Step 3.2|Step 3.2]] - Tag Flipping
* [[Step 3.3|Step 3.3]] - d
+
* [[Step 3.3|Step 3.3]] - Partial Matches
* [[Step 3.4|Step 3.4]] - d
+
* [[Step 3.4|Step 3.4]] - Split Headings
* [[Step 3.5|Step 3.5]] - d
+
* [[Step 3.5|Step 3.5]] - Series Processing
* [[Step 3.6|Step 3.6]] - d
+
* [[Step 3.6|Step 3.6]] - Subdivision Updates
* [[Step 3.7|Step 3.7]] - d
+
* [[Step 3.7|Step 3.7]] - Childrens Matching
* [[Step 3.8|Step 3.8]] - d
+
* [[Step 3.8|Step 3.8]] - MESH Matching
* [[Step 3.9|Step 3.9]] - d
+
* [[Step 3.9|Step 3.9]] - Canadian Matching
* [[Step 3.10|Step 3.10]] - d
+
* [[Step 3.10|Step 3.10]] - Local Bibliographic Subject Matching
* [[Step 3.11|Step 3.11]] - d
+
* [[Step 3.11|Step 3.11]] - Genre Form Matching
* [[Step 3.12|Step 3.12]] - d
+
* [[Step 3.12|Step 3.12]] - Local Fields in Authority Records
* [[Step 3.13|Step 3.13]] - d
+
* [[Step 3.13|Step 3.13]] - Local Authority Master
* [[Step 3.14|Step 3.14]] - d
+
* [[Step 3.15|Step 3.15]] - LCSH to FAST Conversion
* [[Step 3.15|Step 3.15]] - d
+
* [[Step 3.16|Step 3.16]] - JACKPHY Vernacular
* [[Step 3.16|Step 3.16]] - d
+
* [[Step 3.17|Step 3.17]] - Pseudonyms
 +
* [[Step 3.18|Step 3.18]] - Deblinding XRefs
  
 
The rest of the information contained in Step 3 details the matching options for your bibliographic and authority records. As with each step of this profile, these options are suggestions though each one can be customized according to your preference.
 
The rest of the information contained in Step 3 details the matching options for your bibliographic and authority records. As with each step of this profile, these options are suggestions though each one can be customized according to your preference.
  
 +
==links==
 +
<center><font size="4">[[Step 3.1|3.1]] - [[Step 3.2|3.2]] - [[Step 3.3|3.3]] - [[Step 3.4|3.4]] - [[Step 3.5|3.5]] - [[Step 3.6|3.6]] - [[Step 3.7|3.7]] - [[Step 3.8|3.8]] - [[Step 3.9|3.9]] - [[Step 3.10|3.10]] - [[Step 3.11|3.11]] - [[Step 3.12|3.12]] - [[Step 3.13|3.13]] - [[Step 3.15|3.15]] - [[Step 3.16|3.16]] - [[Step 3.17|3.17]] - [[Step 3.18|3.18]]
 +
<hr>
 +
[[Step 1.0|1.0]] - [[Step 2.0|2.0]] - [[Step 3.0|3.0]] - [[Step 4.0|4.0]] - [[Step 5.0|5.0]] - [[Step 6.0|6.0]]</font></center>
 
[[category:Profile Guide]]
 
[[category:Profile Guide]]

Revision as of 14:24, 14 April 2021

Authority Cleanup Overview

The first phase of MARS 2.0 Authority Control comprises a battery of routines that update and correct individual subfields and contiguous pairs of subfields, the purpose of which is to increase the likelihood of finding the appropriate authority match.

These corrections are based on a number of subfield update tables, maintained by MARS 2.0 authorities librarians.

MARS 2.0 subfield correction routines include:

  • Updating obsolete forms of subdivisions to the current form
  • Correcting common typographical errors
  • Expanding abbreviations in subject subfields to their fuller form
  • Converting common direct geographic subdivisions to their indirect form
  • Deleting subject subdivisions which have been canceled or discontinued
  • Correcting spacing, capitalization, and punctuation

Update obsolete subdivisions

MARS 2.0 uses a number of subfield correction tables to correct common errors in LC subfields

 
 — Relations (General) with the United States
 
   changes to:
 — Relations  United States

Correct typographical errors

MARS 2.0 also uses the subfield correction tables to correct common spelling errors in LC subdivisions:

Error Changes to In Field / Subfield
Histroy History LC 6XX $x
Untied States United States LC 650 $z and 651 $a

Expand abbreviations

The subfield correction tables also support the expansion of outdated or invalid abbreviations in LC headings to the full form. Changes are made only when the outdated or invalid form is the entire text of the subfield:

Outdated / Invalid Changes to In Field / Subfield
Hist. & crit. History and criticism LC 6XX $x
U.S. United States LC 651 $a, X10 $a, and 6XX $z
Econ. cond. Economic conditions LC 6XX $x

Direct-to-indirect geographic conversion

MARS 2.0 uses a table to convert direct geographic subdivisions to the indirect form. Changes are made by the direct-to-indirect subfield conversion program only when the invalid form is the entire text of the $z and there is only one $z in the heading:

Direct Subdivision Changes to In Field / Subfield
$z Paris $z France $z Paris LC 6XX fields
$z Jefferson Co., Kan. $z Kansas $z Jefferson County LC 6XX fields
$z Jefferson County, Kan. $z Kansas $z Jefferson County LC 6XX fields


Chronological conversion

MARS 2.0 uses a table to convert chronological headings $y to their correct form. Corrections are made to spelling and punctuation as well as to format:

Subdivision Changes to In Field / Subfield
$y Twentieth century $y 20th century LC 6XX fields
$z 20th century $y 20th century LC 6XX fields
$y 20th centry $y 20th century LC 6XX fields


Delete obsolete subdivisions

MARS 2.0 uses a subfield deletion table to eliminate canceled subfields from LC bibliographic headings, deleting subfields only when the invalid form is the entire text of the subfield:

Deleted Field / Subfield
Addresses, essays, lectures 6XX $x
Addresses, sermons, etc. 6XX $x
Collected works 6XX $x


The subfield deletion table also includes common misspellings and typographical errors:

Error Field / Subfield
Adresses, essays, lectures 6XX $x
Addressses, essays, lectures 6XX $x
Collected work 6XX $x


Retain selected subdivisions

The subfield deletion table includes a section that prevents subfield conversions and deletions in headings meeting specific criteria.

 
 — Yearbooks
 
    changes to:
 — Periodicals
    except when  — Yearbooks is part of the subfield pair:
    — Students — Yearbooks

Correct spacing, capitalization, and punctuation

Most errors in spacing, capitalization and punctuation are corrected as an integral part of the authority cleanup and authority matching processes. Routines are also run to correct spacing and punctuation on the following fields:

 
 1XX, 240, 243, 245, 260, 4XX, 6XX, 7XX, 8XX

These processes eliminate any excess spaces in each field, makes sure each field has the correct punctuation within and between each subfield, and makes sure each field has ending punctuation.

 
 original headings:
 100  10 $a Black, Adam, $d 1974-   .$t Crested geckos
 111  20 $a IEEE 1394 (FireWire) Workshop $d (2001, $c Berlin, Germany)
   
 updated headings:
 100  10 $a Black, Adam, $d 1974- $t Crested geckos
 111  20 $a IEEE 1394 (FireWire) Workshop $d (2001 :$c Berlin, Germany)

Authority Matching Overview

While Authority Cleanup improves authority controlled headings using proprietary MARS 2.0 correcton tables, authority matching compares each authority controlled heading in your bibliographic records against authority record headings from any of a number of national and other authority files.

National authority files

Abbrev. National Authority File Updates Size
NAF Library of Congress Names Weekly 8,212,000
SAF Library of Congress Subjects Weekly 8,611,000
CHILD Library of Congress Annotated Card Program Subjects Weekly 1,000
LCGFT Library of Congress Genre Form Terms Weekly 2,000
MESH National Library of Medicine Annual 616,000
NLC-N Library Archives Canada Names Frozen 653,000
NLC-S Library Archives Canada Subjects Frozen 659,000
AAT Art & Architecture Thesaurus Frozen 35,000
RBMS Rare Books and Manuscripts Section Vocabularies Frozen 1,600
TGM Thesaurus for Graphic Materials Frozen 7,900
GSAFD Guidelines on Subject Access to Individual Works of Fiction, Drama, etc Frozen 160
FAST Faceted Application of Subject Terminology Quarterly 1,700,000
NASA NASA Thesaurus TBD 18,336
OLACVGGT OLAC Video Game Genre Vocabulary Frozen 66
EMBNE National Library of Spain Frozen 4,085,978
QLSP Queens Library Spanish Language Subject Headings Frozen 11,745
ERIC Education Resources Information Center (ERIC) Thesaurus TBD 4,539
HOMOIT Homosaurus TBD 1,626

NOTE: As of September 2018, the Canadiana Authorities product from Library and Archives Canada has been discontinued. Library and Archives Canada is joining NACO and authority records are expected to be distributed via LC.

Goals of authority matching

Authority matching uses the headings in authority records to update or correct the bibliographic headings so they conform to current standards.

Authority matching is also the basis for providing full authority records for your local system. The goals of authority matching are to:

  • Update invalid headings to valid forms based on cross-references found in authority records (convert to the established form of heading)
  • Modify headings that have incorrect spacing, punctuation, indicators, or subfield codes to the correct form based on matches found
  • Update invalid higher levels of a heading to their valid forms, based on cross-references found in authority records
  • Distribute matched authority records to your institution
  • Identify headings requiring more attention by your staff, through the use of MARS 2.0 reports

Fields under authority control

MARS 2.0 corrects and updates the full range of authority controlled headings. The following bibliographic headings / fields are included in MARS 2.0 authority control processing:

Name, Title, and Series Authority Controlled Headings
Personal Names 100, 700
Corporate Names 110, 710
Conference Names 111, 711
Uniform Titles 130, 240, 730
Uniform Titles in a $t 600, 610, 611, 700, 710, 711
Series 400, 410, 411, 440, 800, 810, 811, 830


Subject Authority Controlled Headings
Personal Names 600
Corporate Names 610
Conference Names 611
Uniform Titles 630
Topical 650
Geographic 651
Genre 655

Subfields disregarded

A number of MARC subfields are disregarded during MARS 2.0 authority matching. In the following headings, the volume designations in fields 810 and 440 $v, the heading linkage information in field 130 $6, and the ISSN in field 440 $x, are all examples of subfield information which is not under authority control:

 
 810   2 $a John Bartholomew and Son. $t Bartholomew world travel series ;$v 10.
 130   0 $6 880-01 $a ”Hsuuan lai his kan” his lieh.
 440   4 $a Romanica Gothoburgensia ;$v 12, 16 $x 0080-3863

Subfields matched or ignored

The table below shows the subfields that are included in the MARS 2.0 authority matching process:

MARC Field Subfields Retained During Matching Subfields Ignored During Matching
100, 400, 700, 800 a b c d f g h k l m n o p q r s t y z e u v w x 2 3 4 5 6
110, 410, 710, 810 a b c d f g h k l m n o p r s t y z e u v w x 2 3 4 5 6
111, 411, 711, 811 a b c d e f g h k l m n o p q r s t u y z v w x 2 3 4 5 6
130, 830 a d f g h k l m n o p r s t x y z v w 2 3 5 6
730 a d f g h k l m n o p r s t y z v w x 2 3 5 6
440 a n p v w x 6
600 a b c d f g h k l m n o p q r s t v x y z e u w 2 3 4 5 6
610 a b c d f g h k l m n o p r s t v x y z e u w 2 3 4 5 6
611 a b c d e f g h k l m n o p q r s t u v x y z 2 3 4 5 6
630 a d f g h k l m n o p r s t v x y z w 2 3 5 6
650 a b c d v x y z e w 2 3 6
651 a b v x y z w 2 3 6
655 a b c v x y z e w 2 3 6

Normalization

Headings from both your bibliographic records and the MARS 2.0 national authority files are normalized before they are compared for matching. MARS 2.0 uses the NACO normalization standard. During normalization:

  • Alphabetic characters are converted to uppercase
  • The first comma will be retained in $a for personal name headings
  • All other punctuation is removed
  • Certain diacritics and hyphens are left in for JACKPHY normalization
  • All other diacritics are removed
  • Special characters are replaced by an alphabetic equivalent
  • Subfield codes are removed
  • Subfield delimiters for all but first subfield will be left in
 bib record heading:
 $a Architecture $z Brazil $x S'ao Paulo (State)
 
 normalized heading:
 ARCHITECTURE $ BRAZIL $ SAO PAULO STATE

Notice the subfield codes and the diacritic in S'ao Paulo have been discarded when constructing the normalized form of the heading.

Because subfield codes are ignored during the authority matching process, invalid subfield codes do not affect the matching and may be corrected during the process.

For this example, the normalized form of the established heading 1XX in the authority record is:

 authority heading:
 $a Architecture $z Brazil $z S~ao Paulo (State)
 
 normalized heading:
 ARCHITECTURE $ BRAZIL $ SAO PAULO STATE

Since the normalized forms of the bibliographic and authority headings are the same, the established form in the authority record replaces the form in the bibliographic record.

The bibliographic heading then contains the correct diacritic ~ instead of ' and the subfield code for S'ao Paulo has been corrected to $z.

Topics

The rest of the information contained in Step 3 details the matching options for your bibliographic and authority records. As with each step of this profile, these options are suggestions though each one can be customized according to your preference.

links

3.1 - 3.2 - 3.3 - 3.4 - 3.5 - 3.6 - 3.7 - 3.8 - 3.9 - 3.10 - 3.11 - 3.12 - 3.13 - 3.15 - 3.16 - 3.17 - 3.18
1.0 - 2.0 - 3.0 - 4.0 - 5.0 - 6.0