Enabling IBM Global Name Recognition Name Hasher feature
for UMF <NAME> segment data quality processing
can improve name parsing, culture classification, and name hash generation.
Before you begin
If you are enabling the Name Hasher on an existing installation
for the first time, contact IBM Services or Support for assistance.
All existing data from all data sources must be reloaded to prevent
the entity resolution of new data from failing against the existing
data in the entity database.
About this task
These instructions are summaries of the tasks that must be
completed to enable the Name Hasher. All steps are completed using
the Configuration Console. Click the link to get the step-by-step
instructions for each task.
Procedure
- Enable the DQM function 282 to create name hashes. This function
turns on the Name Hasher functionality within the pipelines. If
you used the Name Hasher prior to product version 8.0 fix pack 2,
see the instructions for migrating to the upgraded
Name Hasher. You might want to reuse some of the parameters
used by DQM 282.
- Enable the DQM function 610, so that the Name Hasher can create composite
name hash attributes.
- Configure
the Default w/ Name only candidate builder for enhanced name
hashing.
- Configure
each data source for enhanced name hashing.
- Disable
full name parsing in DQM function 252. The Name
Hasher creates name hash variants for all name parts, not just the
full name.
- Configure
DQM rule 255 for enhanced name hashing By completing
this step, you keep the name standardization capability of DQM 255,
but you disable the standard name hashing to use the enhanced name
hashing of the Name Hasher. You also ensure that pipeline validation
check of verifying that DQM 255 is enabled does not fail and shut
down the pipelines.
- Enable the DQM function 260 for the UMF <NAME> segment. This DQM function assigns name cultures to incoming name data.
The Name Hasher requires name culture to apply multi-cultural expertise
to enhanced name hashing. Make sure that Name Manager is turned on.
(Typically, Name Manager is on.) If you enable the DQM rule 260 and
Name Manager is not turned on, DQM 260 rule fails, and the pipelines
shut down.
- Set the
system parameters for the Name Hasher. By completing
this step, you configure the necessary system parameters for the pipelines
used during enhanced name hashing.