IM Relationship Resolution Information Center, Version 4.2

Pipelines

Pipelines are the component that perform name and address hygiene standardization, data quality management, and entity resolution. The pipelines also perform relationship resolution and generate alerts, based on the system configuration.

Pipelines perform three core processes:

Pipelines are hosted by pipeline nodes.

You can configure pipelines for parallel processing, so that one pipeline command spawns multiple parallel pipeline processing threads, which enables the system to concurrently process multiple data requests. This feature can help improve system performance, reduce data processing time, and mitigate hardware memory constraints.

The parallel pipeline processing feature is configured in two places:


Feedback

Last updated: 2009