Difference between revisions of "Dedupe 2.6"

From AC Wiki
Jump to: navigation, search
(Length)
(links)
 
(12 intermediate revisions by 2 users not shown)
Line 1: Line 1:
 +
==Dedupe 2.6: Group 1 - 1XX $a - Main entry (Verify)==
 
[[Image:d2-6.png]]<br><br>
 
[[Image:d2-6.png]]<br><br>
== Description ==
+
=== Description ===
 
When an author (personal, corporate, or meeting) or a uniform title is used as the main entry of the record, the MARC record contains a 1XX tag.
 
When an author (personal, corporate, or meeting) or a uniform title is used as the main entry of the record, the MARC record contains a 1XX tag.
  
Line 12: Line 13:
 
Only '''subfield a''' will be used for this verification parameter. If you want to use other subfields, please put a note in the online dedupe profile section 2-13.
 
Only '''subfield a''' will be used for this verification parameter. If you want to use other subfields, please put a note in the online dedupe profile section 2-13.
  
== Verify Method ==
+
=== Length ===
*FULL - Full compares the full verify string up to the verify length.
+
This pertains to the number of characters to be used in the verification for the 1XX field within the verify method chosen above. The max number of characters that can be used is 2048.
*PARTIAL - Partial truncates the compare strings to the shortest string, then does a full compare. "The fox in the hound" in one record, "The fox" on the other record : both truncated to "The fox" and compared.
+
====Full====
*WITHIN - Withing searches each compare string truncated at verify length against the full un-truncated string of the other field. "Cat" will ind a potential match on "The cat in the hat."
+
Using Full verification with a length of <font color="red">5</font>:
 +
  <font size="3">
 +
    1---5----10
 +
  $a <font color="red">Johns</font>on, Mark</font>
 +
If the record had '''$a Johnson, Mark''', then '''Johns''' would be used as the verify string.
  
== Normalization ==
+
This would mean that it would verify against all main entries that had '''$a Johns''' regardless of what followed. So '''$a Johnson, Mark''' would return a positive verify with '''$a Johns, Jason'''. In this case choosing a longer length, or using the default ''all'' may be more desirable.'''
*NACO/CJK retains spaces and subfield delimiters.
+
*FULL is NACO normalization with all spaces and subfield delimiters removed.
+
 
+
== Length ==
+
This pertains to the number of characters to be used in the verification for the 1XX field within the verify method chosen above. The max number of characters that can be used is 2048.
+
  
Examples of ''length'' being used with the different verify methods:
+
====Partial====
  FULL verification with length of 5 being used: If record had, '''100$aJohnson, Mark''', then Johns would be used as the verify string.
+
Using Partial verification with a length of <font color="red">11</font>
 +
  <font size="3">
 +
    1---5----11
 +
  $a <font color="red">Backstage L</font>ibrary Works</font>
 +
If the record had '''$a Backstage Library Works''', then '''Backstage L''' would be used as the verify string.
  
This would mean that it would verify against all main entries that had '''100$aJohns''' regardless of what followed. So '''100$Johnson, Mark''' would return a positive verify with '''100$aJohns, Jason'''. In this case choosing a longer length, or using the default ''all'' may be more desirable.
+
Since the PARTIAL method was chosen, this would match against '''$a Backstage''' or '''$a Back'''.  But it would not match against '''$a Backstage Travel'''.
 +
====Within====
 +
Using Within verification with a length of <font color="red">5</font>:
 +
  <font size="3">
 +
    1---5 1---5 1---5
 +
  $a <font color="red">Card,</font> <font color="blue">Orson</font> <font color="green">Scott</font></font>
  
  PARTIAL verification with length of l0 being used:
+
If record had, '''$a Card, Orson Scott''', then any combination of 5 characters could be used; '''Orson''', '''Scott''', or '''Card''' could be used.
  If record had, '''110$aBackstage Library Works''', then '''Backstage L''' would be used as the verify string.
+
  
Since the PARTIAL method was chosen, this would match against '''110$aBackstage''' or '''100$aBack'''.  But it would not match against '''1XX$aBackstage Travel'''.
+
=== Words ===
 +
This pertains to the number of words to be used in the verification for the 1XX field within the verify method chosen above.  
  
  WITHIN verification with length of 5 being used:
+
== Default==
  If record had, '''100$aCard, Orson Scott''', then any combination of 5 characters could be used; Orson, Scott, or Card could be used.
+
{| border="0" cellspacing="0" cellpadding="5" align="left" style="border-collapse:collapse;"
 +
! style="background:lightgray" align="left" colspan="2" | Group 1 (010, 020, 022)
 +
|- style="background:#CCFFFF; font-size: 110%;"
 +
| || 1XX $a main entry is not used as a default verification point
 +
|}
 +
<div style=clear:both></div><br>
  
 
==links==
 
==links==
<center><font size="4">[[Dedupe_2.1|2.1]] - [[Dedupe_2.2|2.2]] - [[Dedupe_2.3|2.3]] - [[Dedupe_2.4|2.4]] - [[Dedupe_2.5|2.5]] - [[Dedupe_2.6|2.6]] - [[Dedupe_2.7|2.7]] - [[Dedupe_2.8|2.8]] - [[Dedupe_2.9|2.9]] - [[Dedupe_2.10|2.10]] - [[Dedupe_2.11|2.11]] - [[Dedupe_2.12|2.12]]
+
<center><font size="4">[[Dedupe_2.1|2.1]] - [[Dedupe_2.2|2.2]] - [[Dedupe_2.3|2.3]] - [[Dedupe_2.4|2.4]] - [[Dedupe_2.5|2.5]] - [[Dedupe_2.6|2.6]] - [[Dedupe_2.7|2.7]] - [[Dedupe_2.8|2.8]] - [[Dedupe_2.9|2.9]] - [[Dedupe_2.10|2.10]] - [[Dedupe_2.11|2.11]] - [[Dedupe_2.12|2.12]] - [[Dedupe_2.13|2.13]]
 
<hr>
 
<hr>
 
[[Dedupe_1.0|1.0]] - [[Dedupe_2.0|2.0]] - [[Dedupe_3.0|3.0]] - [[Dedupe_4.0|4.0]] - [[Dedupe_5.0|5.0]] - [[Dedupe_6.0|6.0]]</font></center>
 
[[Dedupe_1.0|1.0]] - [[Dedupe_2.0|2.0]] - [[Dedupe_3.0|3.0]] - [[Dedupe_4.0|4.0]] - [[Dedupe_5.0|5.0]] - [[Dedupe_6.0|6.0]]</font></center>
 
[[category:Dedupe Profile Guide]]
 
[[category:Dedupe Profile Guide]]

Latest revision as of 11:04, 2 April 2013

Dedupe 2.6: Group 1 - 1XX $a - Main entry (Verify)

D2-6.png

Description

When an author (personal, corporate, or meeting) or a uniform title is used as the main entry of the record, the MARC record contains a 1XX tag.

  • 100 Main entry -- Personal name
  • 110 Main entry -- Corporate name
  • 111 Main entry -- Meeting name
  • 130 Main entry -- Uniform title

A MARC record may have only one 1XX tag, or no 1XX tag at all (in the case of a title main entry).

Only subfield a will be used for this verification parameter. If you want to use other subfields, please put a note in the online dedupe profile section 2-13.

Length

This pertains to the number of characters to be used in the verification for the 1XX field within the verify method chosen above. The max number of characters that can be used is 2048.

Full

Using Full verification with a length of 5:

 
    1---5----10
 $a Johnson, Mark

If the record had $a Johnson, Mark, then Johns would be used as the verify string.

This would mean that it would verify against all main entries that had $a Johns regardless of what followed. So $a Johnson, Mark would return a positive verify with $a Johns, Jason. In this case choosing a longer length, or using the default all may be more desirable.

Partial

Using Partial verification with a length of 11

 
    1---5----11
 $a Backstage Library Works

If the record had $a Backstage Library Works, then Backstage L would be used as the verify string.

Since the PARTIAL method was chosen, this would match against $a Backstage or $a Back. But it would not match against $a Backstage Travel.

Within

Using Within verification with a length of 5:

 
    1---5 1---5 1---5
 $a Card, Orson Scott

If record had, $a Card, Orson Scott, then any combination of 5 characters could be used; Orson, Scott, or Card could be used.

Words

This pertains to the number of words to be used in the verification for the 1XX field within the verify method chosen above.

Default

Group 1 (010, 020, 022)
1XX $a main entry is not used as a default verification point

links

2.1 - 2.2 - 2.3 - 2.4 - 2.5 - 2.6 - 2.7 - 2.8 - 2.9 - 2.10 - 2.11 - 2.12 - 2.13
1.0 - 2.0 - 3.0 - 4.0 - 5.0 - 6.0