DQM function 282 calls the IBM Global Name Recognition
component libraries to parse the name, add the appropriate cultures
to the name, and generate name variant hashes, which are added to
the incoming UMF record as enhancements to the <NAME> segment.
As of product version 8.0 fix pack 2 or later, the DQM
function 282 is part of the IBM Global Name Recognition Name Hasher,
replacing the web application servlet version of the Name Hasher available
in previous product releases. The web application servlet used the
DQM function 660 on the <NAME> segment.
replaces
For
example, when you apply DQM function 282 to the incoming UMF name
record of
<NAME>
<NAME_TYPE>M</NAME_TYPE>
<LAST_NAME>RASUL</LAST_NAME>
<FIRST_NAME>MOHAMMED</FIRST_NAME>
<MID_NAME></MID_NAME>
<NAME_GEN></NAME_GEN>
</NAME>
results in the following UMF name record:
<NAME>
<NAME_TYPE>M</NAME_TYPE>
<LAST_NAME>RASUL</LAST_NAME>
<FIRST_NAME>MOHAMMED</FIRST_NAME>
<MID_NAME></MID_NAME>
<NAME_GEN></NAME_GEN>
<CULTURE>SOUTHWESTASIAN</CULTURE>
<SN_CULTURE>SOUTHWESTASIAN</SN_CULTURE>
<GN_CULTURE>SOUTHWESTASIAN</GN_CULTURE>
<ATTRIBUTE>
<ATTR_TYPE>GNR_HASH</ATTR_TYPE>
<ATTR_VALUE>ABDLRSL|MOHMD</ATTR_VALUE>
</ATTRIBUTE>
<ATTRIBUTE>
<ATTR_TYPE>GNR_HASH</ATTR_TYPE>
<ATTR_VALUE>ABDLRSL|MUHMD</ATTR_VALUE>
</ATTRIBUTE>
</NAME>
The values and subtags that are created
by this DQM function are then added to the UMF record to enhance the
NAME data:
- <CULTURE>
- Indicates the overall culture assigned to the full name
- <SN_CULTURE>
- Indicates the culture assigned to the surname
- <GN_CULTURE>
- Indicates the culture assigned to the given name.
- <Name_hash_variant_tag_name>
- Indicates the name variant hashes that are generated for the full
name. By default, the Name Hasher configuration uses <GNR_HASH>
for as the value of the name hash variant UMF tag. However, you can
configure this tag value to another tag name value.
If the Name Manager NAMESIFTER
system parameter is enabled, this DQM function creates the name variant
hashes differently for personal and organizational names:
- Personal names
- If the name is categorized as personal, name variant hashes are
created based on combining pairs of the variants for the given names
and surnames.
- Organizational names
- If the name is categorized as organizational, name variant hashes
are created by using pair-wise combinations of the words in the organization
name
Parameters
- Maximum Part Length
- Determines the maximum length of each part of the two-part name
hash. Each hash consists of two name parts, separated by a | symbol.
Attention: Changing this parameter after processing
UMF data can cause entity resolution to miss name matches. Consult
IBM Services or Support if you are considering changing this parameter
from the current setting or the default setting.
- When you install the product, the default value is set based on
whether your installation is new or an upgrade installation:
- 5
- Default value for new installations. Indicates that the maximum
length of each part of the two-part hash is five characters. For example,
a maximum part length of 5 alters the name hash of
ALXNDR|JOHNTHN
to
ALXND|JOHNT
- 0
- Default value for upgrade installations. Indicates that there
is no maximum part length.
- Internal Name Attribute Type
- Specifies the attribute types created for the name by the Name
Hasher.
- The default value is GNR_HASH.
- maxVariants
- Determines the maximum number of name variants that DQM 282 generated.
- The valid range is 1 to infinity.
- The default value is 5.
Tip: If prior to product version 8.0 fix pack 2, you
used DQM function 660, set the parameter to the same value used in
the maxVariants parameter of DQM function 660.
- variantScoreThreshold
- Specifies the scoring threshold to determine which name variants
are kept, and which name variants are filtered out.
- The valid range is 1 to 100.
- The default value is 10.
Tip: If prior to product version 8.0 fix pack 2, you
used DQM function 660, set the parameter to the same value used in
the variantScoreThreshold parameter of DQM function
660.
Exception codes
Exception code |
Description |
Severity level |
-2 |
Indicates that the UMF in the incoming name
is invalid. For example, if the incoming name contains both the <FULL_NAME> tag
and the individual name piece <FIRST_NAME> and <LAST_NAME> tags. |
3 |
-1 |
Indicates that the Name Hasher process returned
an error when processing the name. Among the possible causes of this
exception are an empty incoming name and an incoming conjoined name. |
3 |
0 |
Indicates that no name hash variants were generated. |
0 |
1 |
Indicates that name hash variants were generated. |
0 |