This reference contains detailed information about
DQM rules, transports, pipelines, and the UMF specification.
Data Quality Management (DQM)
Data quality management (DQM) is the pipeline process that checks
the data for required values, valid data types, and valid codes. You can also
configure DQM to correct the data by providing default values, formatting
numbers and dates, and adding new codes.
Transports
Transports move data from one place to another – between
acquisition programs and pipelines, between pipelines and the entity
database, and even between pipelines and external systems.
Pipelines
Pipelines are the component that perform name and address hygiene
standardization, data quality management, and entity resolution. The pipelines
also perform relationship resolution and generate alerts, based on the system
configuration.
Default UMF Specification
Universal Message Format (UMF) is a standard markup language, based
on XML, for structuring data source files. Before data can be loaded or processed
into the entity database, it must be formatted in UMF and follow the UMF specification.
Expanded Service API
The Expanded Service API updates IBM® InfoSphere
Identity Insight Web services by providing an object-rich SOAP API
and corresponding UMF API of Web services operations. It also includes
pipeline load balancing functionality for IBM InfoSphere
Identity Insight Web services, which allows for pipeline redundancy
and enhanced performance.