Difference between revisions of "Dedupe 2.0"

From AC Wiki
Jump to: navigation, search
(Created page with "==LINKS== <center><font size="4">2.1 - 2.2 - 2.3 - 2.4 - 2.5 - 2.6 - 2.7 ...")
 
Line 1: Line 1:
==LINKS==
+
==Dedupe 2.0: Group 1==
 +
Section 2.0 of the dedupe profile will guide you through the verification and parameters used for hitting on the 010/020/022 fields.
 +
===Group 1 Hit Fields===
 +
Grouping allows the user to have different parameters for different potential matches. Since numeric fields such as the LCCN, ISBN, and ISSN are fairly reliable in most cases, they are grouped together and will have the same verification parameters.
 +
 
 +
====Library of Congress Control Number (LCCN/010)====
 +
The Library of Congress control number assigned to a catalogued item is recorded in the 010 tag. This number is used to  to distinguish each record from every other record in the database Library of Congress database.
 +
 
 +
The LCCN has three parts: the prefix, a year (represented by two or four digits) and a serial number (six digits) followed by another space in the case of pre-2001 LCCNs. Suffixes and revision dates following some printed LCCNs may or may-not be keyed into MARC record.
 +
 
 +
The following subfields are valid in the 010 tag:
 +
*'''a''' - LC control number
 +
*'''b''' - NUCMC control number -- This subfield is used only in archival/manuscripts format
 +
*'''z''' - Cancelled/invalid LC control number
 +
 
 +
  010 $a___91001938_
 +
  010 $a__2001012884
 +
  010 $a___08000123_$z___80000123_
 +
 
 +
====International Standard Book Number (ISBN/020)====
 +
This field records the International Standard Book Number(s) assigned to a catalogued item. Each valid ISBN is entered in a separate 020 tag; two or more invalid or cancelled ISBNs may be recorded in a single 020 tag.
 +
 
 +
The following subfields are valid in the 020 tag:
 +
*'''a''' - International Standard Book Number
 +
*'''c''' - Terms of availability
 +
*'''z''' - Cancelled/invalid ISBN
 +
 
 +
Valid ISBNs are always ten or thirteen digits long; all ISBNs are assumed valid unless they have too many or too few digits, or unless a shelflist card specifically identifies an ISBN as cancelled or invalid:
 +
  020 $a0049812187
 +
  020 $a9780049853217
 +
 
 +
The only letter that is ever part of an ISBN is X (roman numeral 10); it must always be capitalized:
 +
  020 $a012817409X
 +
 
 +
Subfield a may contain qualifying information (publisher, binding, format, volume numbers). This information is usually entered within parentheses; separate pieces of information with space-colon-space:
 +
  020 $a001281947X (pbk.)
 +
  020 $a0018942113 (Bally Bros. : pbk.)
 +
  020 $a0137183911 (large print)
 +
 
 +
Prices appearing after ISBNs are catalogued in subfield c:
 +
  020 $a0174620684 :$c$21.95
 +
  020 $a0049812187 (pbk.) :$c$17.40
 +
 
 +
====International Standard Serial Number (ISSN/022)====
 +
An International Standard Serial Number (ISSN) is an identification number assigned to a serial (the entire serial, not just a particular issue).  The ISSN is similar in function to the ISBN assigned to books.
 +
The 022 tag is repeated whenever a serial has two or more valid ISSNs.  This sometimes happens when a serial changes its title and a new ISSN is assigned;  when a record is created for the new title, both the new and the old ISSNs (both still valid) are entered in the new record.
 +
 
 +
The following subfields are valid in the 022 tag:
 +
*'''a''' - International Standard Serial Number
 +
*'''l''' - ISSN-L
 +
*'''m''' - Cancelled ISSN-L
 +
*'''y''' - Incorrect ISSN
 +
*'''z''' - Cancelled ISSN
 +
 
 +
This field only contains digits except the last digit may be ''X'' (roman numeral 10):
 +
  022 $a1234-5678
 +
  022 $a9876-123X
 +
 
 +
Subfield l links together various media versions of a continuing resource:
 +
  022 $a1234-5678$▼l1234-1231
 +
 
 +
 
 +
==links==
 
<center><font size="4">[[Dedupe 2.1|2.1]] - [[Dedupe 2.2|2.2]] - [[Dedupe 2.3|2.3]] - [[Dedupe 2.4|2.4]] - [[Dedupe 2.5|2.5]] - [[Dedupe 2.6|2.6]] - [[Dedupe 2.7|2.7]] - [[Dedupe 2.8|2.8]] - [[Dedupe 2.9|2.9]] - [[Dedupe 2.10|2.10]] - [[Dedupe 2.11|2.11]] - [[Dedupe 2.12|2.12]]
 
<center><font size="4">[[Dedupe 2.1|2.1]] - [[Dedupe 2.2|2.2]] - [[Dedupe 2.3|2.3]] - [[Dedupe 2.4|2.4]] - [[Dedupe 2.5|2.5]] - [[Dedupe 2.6|2.6]] - [[Dedupe 2.7|2.7]] - [[Dedupe 2.8|2.8]] - [[Dedupe 2.9|2.9]] - [[Dedupe 2.10|2.10]] - [[Dedupe 2.11|2.11]] - [[Dedupe 2.12|2.12]]
 
<hr>
 
<hr>
 
[[Dedupe 1.0|1.0]] - [[Dedupe 2.0|2.0]] - [[Dedupe 3.0|3.0]] - [[Dedupe 4.0|4.0]] - [[Dedupe 5.0|5.0]] - [[Dedupe 6.0|6.0]]</font></center>
 
[[Dedupe 1.0|1.0]] - [[Dedupe 2.0|2.0]] - [[Dedupe 3.0|3.0]] - [[Dedupe 4.0|4.0]] - [[Dedupe 5.0|5.0]] - [[Dedupe 6.0|6.0]]</font></center>
 
[[category:Profile Guide]]
 
[[category:Profile Guide]]

Revision as of 13:08, 29 March 2013

Dedupe 2.0: Group 1

Section 2.0 of the dedupe profile will guide you through the verification and parameters used for hitting on the 010/020/022 fields.

Group 1 Hit Fields

Grouping allows the user to have different parameters for different potential matches. Since numeric fields such as the LCCN, ISBN, and ISSN are fairly reliable in most cases, they are grouped together and will have the same verification parameters.

Library of Congress Control Number (LCCN/010)

The Library of Congress control number assigned to a catalogued item is recorded in the 010 tag. This number is used to to distinguish each record from every other record in the database Library of Congress database.

The LCCN has three parts: the prefix, a year (represented by two or four digits) and a serial number (six digits) followed by another space in the case of pre-2001 LCCNs. Suffixes and revision dates following some printed LCCNs may or may-not be keyed into MARC record.

The following subfields are valid in the 010 tag:

  • a - LC control number
  • b - NUCMC control number -- This subfield is used only in archival/manuscripts format
  • z - Cancelled/invalid LC control number
 010 $a___91001938_
 010 $a__2001012884
 010 $a___08000123_$z___80000123_

International Standard Book Number (ISBN/020)

This field records the International Standard Book Number(s) assigned to a catalogued item. Each valid ISBN is entered in a separate 020 tag; two or more invalid or cancelled ISBNs may be recorded in a single 020 tag.

The following subfields are valid in the 020 tag:

  • a - International Standard Book Number
  • c - Terms of availability
  • z - Cancelled/invalid ISBN

Valid ISBNs are always ten or thirteen digits long; all ISBNs are assumed valid unless they have too many or too few digits, or unless a shelflist card specifically identifies an ISBN as cancelled or invalid:

 020 $a0049812187
 020 $a9780049853217

The only letter that is ever part of an ISBN is X (roman numeral 10); it must always be capitalized:

 020 $a012817409X

Subfield a may contain qualifying information (publisher, binding, format, volume numbers). This information is usually entered within parentheses; separate pieces of information with space-colon-space:

 020 $a001281947X (pbk.)
 020 $a0018942113 (Bally Bros. : pbk.)
 020 $a0137183911 (large print)

Prices appearing after ISBNs are catalogued in subfield c:

 020 $a0174620684 :$c$21.95
 020 $a0049812187 (pbk.) :$c$17.40

International Standard Serial Number (ISSN/022)

An International Standard Serial Number (ISSN) is an identification number assigned to a serial (the entire serial, not just a particular issue). The ISSN is similar in function to the ISBN assigned to books. The 022 tag is repeated whenever a serial has two or more valid ISSNs. This sometimes happens when a serial changes its title and a new ISSN is assigned; when a record is created for the new title, both the new and the old ISSNs (both still valid) are entered in the new record.

The following subfields are valid in the 022 tag:

  • a - International Standard Serial Number
  • l - ISSN-L
  • m - Cancelled ISSN-L
  • y - Incorrect ISSN
  • z - Cancelled ISSN

This field only contains digits except the last digit may be X (roman numeral 10):

 022 $a1234-5678
 022 $a9876-123X

Subfield l links together various media versions of a continuing resource:

 022 $a1234-5678$▼l1234-1231


links

2.1 - 2.2 - 2.3 - 2.4 - 2.5 - 2.6 - 2.7 - 2.8 - 2.9 - 2.10 - 2.11 - 2.12
1.0 - 2.0 - 3.0 - 4.0 - 5.0 - 6.0