The IBM Global Name Recognition Name Hasher feature enhances the name hygiene and candidate list building capabilities of IBM Relationship Resolution.
The IBM Global Name Recognition Name Hasher feature enables improved name parsing, culture classification, and name hash generation processes of IBM Relationship Resolution by leveraging IBM Global Name Recognition technology. The improved capabilities of the name hygiene process enable the IBM Relationship Resolution pipelines to build better candidate lists. The generation of more accurate and robust candidate lists directly improves entity resolution.
GNR Name Hasher is deployed as a servlet which is called by DQM function 660. The DQM 660 rule passes the UMF <NAME> segments to the IBM Global Name Recognition Name Hasher servlet, which then returns the enhanced UMF <NAME> segments.
Using the IBM Global Name Recognition Name Hasher feature with an IBM Relationship Resolution installation which includes the IBM Entity Analytics Solutions Name Manager add-on product provides the ability to classify names for culture and to accurately compare and score names on the candidate list in a culturally sensitive context.
Testing has shown that enabling the IBM Global Name Recognition Name Hasher feature significantly reduces performance while providing a benefit of an 11% reduction in false negatives and a 5% decrease in false positives.
Customers should not enable this feature on an existing IBM Relationship Resolution version 4.2 installation. If the IBM Global Name Recognition Name Hasher feature is enabled on an existing installation without reloading all existing data in the system from the data sources, entity resolution of new data will fail against the existing data in the system. The required installation method for customers wanting to enable the IBM Global Name Recognition Name Hasher feature is to perform a clean IBM Relationship Resolution version 4.2 installation, then install the IBM Relationship Resolution version 4.2 fix pack 1 or fix pack 2, then manually enable the IBM Global Name Recognition Name Hasher feature, and finally load all of the data into the system.
The IBM Global Name Recognition Name Hasher feature is not enabled by default. You use the configuration console to enable the IBM Global Name Recognition Name Hasher feature.