Difference between revisions of "Dedupe 2.6"

From AC Wiki
Jump to: navigation, search
(links)
 
(8 intermediate revisions by 2 users not shown)
Line 12: Line 12:
  
 
Only '''subfield a''' will be used for this verification parameter. If you want to use other subfields, please put a note in the online dedupe profile section 2-13.
 
Only '''subfield a''' will be used for this verification parameter. If you want to use other subfields, please put a note in the online dedupe profile section 2-13.
 
=== Verify Method ===
 
*FULL - Full compares the full verify string up to the verify length.
 
*PARTIAL - Partial truncates the compare strings to the shortest string, then does a full compare. "The fox in the hound" in one record, "The fox" on the other record : both truncated to "The fox" and compared.
 
*WITHIN - Withing searches each compare string truncated at verify length against the full un-truncated string of the other field. "Cat" will ind a potential match on "The cat in the hat."
 
 
=== Normalization ===
 
*NACO/CJK retains spaces and subfield delimiters.
 
*FULL is NACO normalization with all spaces and subfield delimiters removed.
 
  
 
=== Length ===
 
=== Length ===
 
This pertains to the number of characters to be used in the verification for the 1XX field within the verify method chosen above. The max number of characters that can be used is 2048.
 
This pertains to the number of characters to be used in the verification for the 1XX field within the verify method chosen above. The max number of characters that can be used is 2048.
 
+
====Full====
Examples of ''length'' being used with the different verify methods:
+
Using Full verification with a length of <font color="red">5</font>:
 
   <font size="3">
 
   <font size="3">
  FULL verification with length of 5 being used: If record had, '''100$aJohnson, Mark''', then Johns would be used as the verify string.</font>
+
    1---5----10
 +
  $a <font color="red">Johns</font>on, Mark</font>
 +
If the record had '''$a Johnson, Mark''', then '''Johns''' would be used as the verify string.
  
This would mean that it would verify against all main entries that had '''100$aJohns''' regardless of what followed. So '''100$Johnson, Mark''' would return a positive verify with '''100$aJohns, Jason'''. In this case choosing a longer length, or using the default ''all'' may be more desirable.
+
This would mean that it would verify against all main entries that had '''$a Johns''' regardless of what followed. So '''$a Johnson, Mark''' would return a positive verify with '''$a Johns, Jason'''. In this case choosing a longer length, or using the default ''all'' may be more desirable.'''
  
 +
====Partial====
 +
Using Partial verification with a length of <font color="red">11</font>
 
   <font size="3">
 
   <font size="3">
   PARTIAL verification with length of l0 being used:
+
    1---5----11
  If record had, '''110$aBackstage Library Works''', then '''Backstage L''' would be used as the verify string.</font>
+
   $a <font color="red">Backstage L</font>ibrary Works</font>
 
+
If the record had '''$a Backstage Library Works''', then '''Backstage L''' would be used as the verify string.
Since the PARTIAL method was chosen, this would match against '''110$aBackstage''' or '''100$aBack'''.  But it would not match against '''1XX$aBackstage Travel'''.
+
  
 +
Since the PARTIAL method was chosen, this would match against '''$a Backstage''' or '''$a Back'''.  But it would not match against '''$a Backstage Travel'''.
 +
====Within====
 +
Using Within verification with a length of <font color="red">5</font>:
 
   <font size="3">
 
   <font size="3">
  WITHIN verification with length of 5 being used:
+
    1---5 1---5 1---5
   If record had, '''100$aCard, Orson Scott''', then any combination of 5 characters could be used; Orson, Scott, or Card could be used.</font>
+
   $a <font color="red">Card,</font> <font color="blue">Orson</font> <font color="green">Scott</font></font>
 +
 
 +
If record had, '''$a Card, Orson Scott''', then any combination of 5 characters could be used; '''Orson''', '''Scott''', or '''Card''' could be used.
  
 
=== Words ===
 
=== Words ===
 
This pertains to the number of words to be used in the verification for the 1XX field within the verify method chosen above.  
 
This pertains to the number of words to be used in the verification for the 1XX field within the verify method chosen above.  
  
=== Defaults ===
+
== Default==
The default for verifying on the 1XX $a main entry within Hit Group 1 (010/020/022) is as follows:
+
{| border="0" cellspacing="0" cellpadding="5" align="left" style="border-collapse:collapse;"
 
+
! style="background:lightgray" align="left" colspan="2" | Group 1 (010, 020, 022)
*Do not use as a verification parameter
+
|- style="background:#CCFFFF; font-size: 110%;"
 +
| || 1XX $a main entry is not used as a default verification point
 +
|}
 +
<div style=clear:both></div><br>
  
 
==links==
 
==links==
<center><font size="4">[[Dedupe_2.1|2.1]] - [[Dedupe_2.2|2.2]] - [[Dedupe_2.3|2.3]] - [[Dedupe_2.4|2.4]] - [[Dedupe_2.5|2.5]] - [[Dedupe_2.6|2.6]] - [[Dedupe_2.7|2.7]] - [[Dedupe_2.8|2.8]] - [[Dedupe_2.9|2.9]] - [[Dedupe_2.10|2.10]] - [[Dedupe_2.11|2.11]] - [[Dedupe_2.12|2.12]]
+
<center><font size="4">[[Dedupe_2.1|2.1]] - [[Dedupe_2.2|2.2]] - [[Dedupe_2.3|2.3]] - [[Dedupe_2.4|2.4]] - [[Dedupe_2.5|2.5]] - [[Dedupe_2.6|2.6]] - [[Dedupe_2.7|2.7]] - [[Dedupe_2.8|2.8]] - [[Dedupe_2.9|2.9]] - [[Dedupe_2.10|2.10]] - [[Dedupe_2.11|2.11]] - [[Dedupe_2.12|2.12]] - [[Dedupe_2.13|2.13]]
 
<hr>
 
<hr>
 
[[Dedupe_1.0|1.0]] - [[Dedupe_2.0|2.0]] - [[Dedupe_3.0|3.0]] - [[Dedupe_4.0|4.0]] - [[Dedupe_5.0|5.0]] - [[Dedupe_6.0|6.0]]</font></center>
 
[[Dedupe_1.0|1.0]] - [[Dedupe_2.0|2.0]] - [[Dedupe_3.0|3.0]] - [[Dedupe_4.0|4.0]] - [[Dedupe_5.0|5.0]] - [[Dedupe_6.0|6.0]]</font></center>
 
[[category:Dedupe Profile Guide]]
 
[[category:Dedupe Profile Guide]]

Latest revision as of 10:04, 2 April 2013

Dedupe 2.6: Group 1 - 1XX $a - Main entry (Verify)

D2-6.png

Description

When an author (personal, corporate, or meeting) or a uniform title is used as the main entry of the record, the MARC record contains a 1XX tag.

  • 100 Main entry -- Personal name
  • 110 Main entry -- Corporate name
  • 111 Main entry -- Meeting name
  • 130 Main entry -- Uniform title

A MARC record may have only one 1XX tag, or no 1XX tag at all (in the case of a title main entry).

Only subfield a will be used for this verification parameter. If you want to use other subfields, please put a note in the online dedupe profile section 2-13.

Length

This pertains to the number of characters to be used in the verification for the 1XX field within the verify method chosen above. The max number of characters that can be used is 2048.

Full

Using Full verification with a length of 5:

 
    1---5----10
 $a Johnson, Mark

If the record had $a Johnson, Mark, then Johns would be used as the verify string.

This would mean that it would verify against all main entries that had $a Johns regardless of what followed. So $a Johnson, Mark would return a positive verify with $a Johns, Jason. In this case choosing a longer length, or using the default all may be more desirable.

Partial

Using Partial verification with a length of 11

 
    1---5----11
 $a Backstage Library Works

If the record had $a Backstage Library Works, then Backstage L would be used as the verify string.

Since the PARTIAL method was chosen, this would match against $a Backstage or $a Back. But it would not match against $a Backstage Travel.

Within

Using Within verification with a length of 5:

 
    1---5 1---5 1---5
 $a Card, Orson Scott

If record had, $a Card, Orson Scott, then any combination of 5 characters could be used; Orson, Scott, or Card could be used.

Words

This pertains to the number of words to be used in the verification for the 1XX field within the verify method chosen above.

Default

Group 1 (010, 020, 022)
1XX $a main entry is not used as a default verification point

links

2.1 - 2.2 - 2.3 - 2.4 - 2.5 - 2.6 - 2.7 - 2.8 - 2.9 - 2.10 - 2.11 - 2.12 - 2.13
1.0 - 2.0 - 3.0 - 4.0 - 5.0 - 6.0