During entity resolution, the pipelines resolve identities into
entities. After the data values in the identity records have been cleansed,
standardized or enhanced, the pipeline uses sophisticated search algorithms
to compare the data values in the incoming identity record against existing
entities in the entity database to determine if they are the same entity.
Resolving entities involves these phases:
- Generating candidate lists
- The system uses the information on the incoming identity record to match
against entities already in the entity database to create a list of potential
entity resolution candidates. Each candidate shares enough attribute values
to continue evaluating the candidate for entity resolution. You can configure
the criteria that the system uses to generate the candidate lists.
- Performing entity resolution
- After generating candidate lists, the system then applies the resolution
rules to each entity on the candidate list, using a scoring method that calculates
a resolution score to determine if the incoming identity and the existing
entity should be resolved. You can configure resolution rules and set the
thresholds for the resolution scores to determine how closely the attribute
values must be for the incoming identity and the candidate entity to be resolved
into one entity.