IM InfoSphere Identity Insight, Version 8.0

DQM function 282: Name variant hashes

DQM function 282 calls the IBM Global Name Recognition component libraries to parse the name, add the appropriate cultures to the name, and generate name variant hashes, which are added to the incoming UMF record as enhancements to the <NAME> segment.

As of product version 8.0 fix pack 2 or later, the DQM function 282 is part of the IBM Global Name Recognition Name Hasher, replacing the web application servlet version of the Name Hasher available in previous product releases. The web application servlet used the DQM function 660 on the <NAME> segment.

replaces

For example, when you apply DQM function 282 to the incoming UMF name record of

<NAME>
   <NAME_TYPE>M</NAME_TYPE>
   <LAST_NAME>RASUL</LAST_NAME>
   <FIRST_NAME>MOHAMMED</FIRST_NAME>
   <MID_NAME></MID_NAME>
   <NAME_GEN></NAME_GEN>
</NAME>

results in the following UMF name record:

<NAME>
   <NAME_TYPE>M</NAME_TYPE>
   <LAST_NAME>RASUL</LAST_NAME>
   <FIRST_NAME>MOHAMMED</FIRST_NAME>
<MID_NAME></MID_NAME>
   <NAME_GEN></NAME_GEN>
   <CULTURE>SOUTHWESTASIAN</CULTURE>
   <SN_CULTURE>SOUTHWESTASIAN</SN_CULTURE>
   <GN_CULTURE>SOUTHWESTASIAN</GN_CULTURE>
   <ATTRIBUTE>
         <ATTR_TYPE>GNR_HASH</ATTR_TYPE>
         <ATTR_VALUE>ABDLRSL|MOHMD</ATTR_VALUE>
    </ATTRIBUTE>
   <ATTRIBUTE>
         <ATTR_TYPE>GNR_HASH</ATTR_TYPE>
         <ATTR_VALUE>ABDLRSL|MUHMD</ATTR_VALUE>
    </ATTRIBUTE>
</NAME>

The values and subtags that are created by this DQM function are then added to the UMF record to enhance the NAME data:
<CULTURE>
Indicates the overall culture assigned to the full name
<SN_CULTURE>
Indicates the culture assigned to the surname
<GN_CULTURE>
Indicates the culture assigned to the given name.
<Name_hash_variant_tag_name>
Indicates the name variant hashes that are generated for the full name. By default, the Name Hasher configuration uses <GNR_HASH> for as the value of the name hash variant UMF tag. However, you can configure this tag value to another tag name value.
If the Name Manager NAMESIFTER system parameter is enabled, this DQM function creates the name variant hashes differently for personal and organizational names:
Personal names
If the name is categorized as personal, name variant hashes are created based on combining pairs of the variants for the given names and surnames.
Organizational names
If the name is categorized as organizational, name variant hashes are created by using pair-wise combinations of the words in the organization name

Parameters

Maximum Part Length
Determines the maximum length of each part of the two-part name hash. Each hash consists of two name parts, separated by a | symbol.
Attention: Changing this parameter after processing UMF data can cause entity resolution to miss name matches. Consult IBM Services or Support if you are considering changing this parameter from the current setting or the default setting.
When you install the product, the default value is set based on whether your installation is new or an upgrade installation:
5
Default value for new installations. Indicates that the maximum length of each part of the two-part hash is five characters. For example, a maximum part length of 5 alters the name hash of

ALXNDR|JOHNTHN
to
ALXND|JOHNT

0
Default value for upgrade installations. Indicates that there is no maximum part length.
Internal Name Attribute Type
Specifies the attribute types created for the name by the Name Hasher.
The default value is GNR_HASH.
maxVariants
Determines the maximum number of name variants that DQM 282 generated.
The valid range is 1 to infinity.
The default value is 5.
Tip: If prior to product version 8.0 fix pack 2, you used DQM function 660, set the parameter to the same value used in the maxVariants parameter of DQM function 660.
variantScoreThreshold
Specifies the scoring threshold to determine which name variants are kept, and which name variants are filtered out.
The valid range is 1 to 100.
The default value is 10.
Tip: If prior to product version 8.0 fix pack 2, you used DQM function 660, set the parameter to the same value used in the variantScoreThreshold parameter of DQM function 660.

Exception codes

Exception code Description Severity level
-2 Indicates that the UMF in the incoming name is invalid. For example, if the incoming name contains both the <FULL_NAME> tag and the individual name piece <FIRST_NAME> and <LAST_NAME> tags. 3
-1 Indicates that the Name Hasher process returned an error when processing the name. Among the possible causes of this exception are an empty incoming name and an incoming conjoined name. 3
0 Indicates that no name hash variants were generated. 0
1 Indicates that name hash variants were generated. 0


Feedback

Last updated: 2011