Data Warehouse Center Administration Guide

Chapter 12. Maintaining the Data Warehouse Center

This chapter explains concepts and procedures that are associated with maintaining the Data Warehouse Center. This chapter covers the following topics:


Backing up the Data Warehouse Center

Back up your warehouse control database, as well as all your warehouse databases, at regular intervals.

To back up the warehouse control database, use the standard procedures for DB2 backup and recovery.

You need to stop the Data Warehouse Center before backing up the control database. To shut down the Data Warehouse Center:

  1. On the workstation that contains the warehouse server, click Settings --> Control Panel --> Services. The Services window opens.
  2. Select a warehouse service, and click Stop.
  3. Repeat this step for each warehouse service that is listed in the Services window.

    The warehouse logger and server are linked; if you stop the warehouse logger, the server also stops. However, if you stop the warehouse server, the logger does not stop.

  4. When all warehouse services are stopped, click Close.

To back up your warehouse databases, use the backup and recovery procedures provided by your database manager.


Expanding your warehouse

With the Data Warehouse Center, you can expand your network configuration as your warehouse grows. You can add new workstations with new warehouse agents and administrative interfaces, or delete agents and administrative interfaces from your existing configuration.

You can also create a warehouse control database. If you create a warehouse control database, you must run the initialization process again. For more information, see Initializing a second warehouse control database.

To add administrative interfaces or warehouse agents to your Data Warehouse Center configuration, follow the installation process for those components. To delete administrative interfaces or warehouse agents from your Data Warehouse Center configuration, remove those components.

For more information about installing and removing Data Warehouse Center components, see the DB2 Universal Database Quick Beginnings for your operating system and the DB2 Warehouse Manager Installation Guide.


Exporting and importing Data Warehouse Center metadata

You can use the Data Warehouse Center export and import capabilities to export and import object definitions within the following operating systems:

Export processes and import processes use a large amount of system resources. You might want to limit the use of other programs while you are exporting and importing object definitions.

Because the import and export formats are release-dependent, you cannot use exported files from a previous release to migrate from one release of the Data Warehouse Center to another. If you want to migrate the Data Warehouse Center, see the DB2 Universal Database Quick Beginnings for your operating system.

Alternatively, if you want to make a copy of your warehouse control database (such as when you want separate test and production systems), you can use the DB2 Universal Database export and load utilities to copy the data. However, both databases must have a user ID and password that matches the default warehouse user, so that users can log on to the Data Warehouse Center.

Exporting metadata to another Data Warehouse Center

When you export metadata to a tag language file, the Data Warehouse Center finds the objects that you want to export and produces tag language statements to represent the objects. It then places the tag language statements into files that you can import into another Data Warehouse Center.

Planning to export metadata

If you export a process, you might export a large volume of metadata. The following objects are exported with a process:

Exporting the metadata to a tag language file

To export a tag language file:

  1. From the Data Warehouse Center, right-click the Warehouse node, and click Export Metadata --> Interchange File.

    The Export Metadata window opens.

  2. Specify the tag language file to which you want to export the metadata. You can:
  3. In the Available objects list, select the objects that you want to export, and click >. The object moves to the Selected objects list. Repeat this step for all objects that you want to export.

    If you want to export all available objects in a tree, click >>. The objects move to the Selected objects list. If a node on the tree does not contain objects within it, it is not moved.

  4. If you do not want to export all definitions of warehouse sources, tables, and columns that are related to exported steps, clear the Export dependent source properties check box.
  5. If you do not want to export schedule information related to the processes that you are exporting (including steps and processes related through task flow and notification), clear the Include schedules check box.
  6. Click OK.

The Export Information window opens and displays a message that tells you whether the export process completed successfully. If errors occurred, the Export Metadata window remains open so that you can try the export procedure again. You can find more information about a particular error message in the IBM DB2 Universal Database Message Reference.

Several files can be created during a single export process. For example, when you export metadata definitions for BLOB data, multiple tag language files are created. The first file created in the export process has an extension of .tag. If multiple files are created, the file name that is generated for each supplementary file has the same name as the tag language file with a numeric extension.

For example, if the tag language file name you specified is e:\tag\steps.tag, the supplementary tag language files are named e:\tag\steps.1, e:\tag\steps.2, and so on. Only the file extension is used to identify the supplementary files within the base tag language file, so you can move the files to another directory. However, you should not rename the files. You must always keep the files in the same directory, otherwise you will not be able to import the files successfully.

Importing metadata

You can import object definitions for use in your Data Warehouse Center system. You might want to import sample data into a warehouse or you might want to import data if you are creating a prototype for a new warehouse.

Planning to import objects

When you import a tag language file, the metadata that it contains is stored in the warehouse control database. The Data Warehouse Center will accept a tag language file that is not generated by another Data Warehouse Center system. For example, you can import a tag language file that is generated by an ETI product.

Any object with an unknown OBJTYPE in a non-Data Warehouse Center tag language file will be ignored, and processing will continue. If an object is of a known OBJTYPE, but does not contain all the properties required by the Data Warehouse Center, an error will occur.

Initializing the control database

If you are using the import utility to establish a new Data Warehouse Center, you must initialize a new warehouse control database in the target system. After you complete this task, you can import as many tag language files as you want.

For more information about how to initialize a new control database, see Initializing a second warehouse control database.

Importing from one Data Warehouse Center system to another

After you complete the export process, and a tag language file is created, the file is ready to import to a target system.

Before you the import the file, consider the following issues:

Do not use the import function to migrate from a previous version of the Data Warehouse Center. You must always import and export tag language files using the same version of the Data Warehouse Center. For information about migrating from a previous version of the Data Warehouse Center, see the DB2 Universal Database Quick Beginnings for your operating system.

Importing the metadata into Data Warehouse Center

To import the metadata into a Data Warehouse Center:

  1. From the Data Warehouse Center, right-click on the Warehouse node, and click Import metadata.

    The Import Metadata window opens.

  2. In the File name field, type the name of the tag language file that you want to import. If you want to search for a file to import, click ... . The File window opens.
  3. Select the tag language file that you want to import, and click OK. The file name that you selected is displayed in the File name field of the Import Metadata window.
  4. Click OK. The metadata that is represented by the tag language file is imported into the warehouse control database.

After a successful import procedure, you receive a message with return code 0 or a warning return code, and the Import Metadata window closes. No changes are committed to the control database when an error occurs.

Import errors

If an error occurs during the import process, you receive an error message with an error return code and the Import Metadata window remains open.

Common errors include:

Check the log file to determine what object or relationship was being imported when the error occurred. If the object is not listed in the message, it should be the last object that is listed before the return code and error message information at the end of the log file. Fix the error, then import the object definitions again. If you cannot determine the error or you cannot fix it, contact IBM Software Support or the vendor who provided the tag language file.

Post-import considerations

After a successful import, you receive a message with a return code of 0, and the Import Metadata window closes.

An imported subject area, warehouse source, or warehouse target will not be displayed in the tree view unless you belong to a warehouse group that has access to the object. A warehouse group that is associated with exported warehouse sources or warehouse targets must be related to a process that uses those objects; some warehouse groups might not be exported. If the warehouse groups that you want your warehouse sources and warehouse targets to be related to are not imported, you need to add the warehouse groups to the Data Warehouse Center system. Metadata for individual users is not exported. You must add users to the Data Warehouse Center.

The Data Warehouse Center assigns development mode to all imported steps so that you can verify that the steps are complete and accurate before you promote them to test or production mode.

Before you use imported objects: Update database information for warehouse sources and warehouse targets (database name, user ID, and password), if necessary. If any new agent site definitions were imported, their passwords also need to be updated. If you imported any programs that have password type parameters, you must reset those values.

Add your user ID to a warehouse group that has access to the imported processes and warehouse targets.

Before you run imported steps:

  1. Ensure that all the warehouse source and warehouse target databases that are referred to by the imported steps are defined and configured in your Data Warehouse Center system. Verify all the definitions that you imported and change any parameters to match the new environment.

    After you import a tag language file that includes steps that include joins, you might notice the tables are not placed as you expected in the Process Model or Warehouse Schema Model. Some information cannot be imported from one system to another. You will need to reposition your steps and star schemas using the Process or Warehouse Schema Model window.

  2. Promote steps as necessary.

Importing tag language files again If you import an existing step again, it must be in development mode, or an error will occur. You must demote steps to development mode before importing them again.

Most tag language files can be imported more than once without errors. If a tag language file includes primary and foreign keys, the keys must be deleted before the file can be imported again.

If the names or order of columns in a table or file have changed (for example, a warehouse source or warehouse target table is redefined), delete the warehouse source table, file, or step definition (including the target table in the warehouse target) from the target system before you import the same table or step again. At a minimum, verify that the column or field information is imported as you expected.


Publishing Data Warehouse Center metadata

Publishing metadata is the process of transferring metadata from the Data Warehouse Center to the Information Catalog Manager. An information catalog is the set of tables managed by the Information Catalog Manager that contains business metadata that helps users identify and locate data and information available to them in the organization. When you publish metadata to the information catalog, you provide end users and system analysts with a fast path to seeing what is in the warehouse.

From the Data Warehouse Center, you can publish:

This chapter discusses publishing Data Warehouse Center metadata only. For information about publishing OLAP server metadata, see the Information Catalog Manager Administration Guide.

To publish metadata, you must install the DB2 Warehouse Manager package with the Information Catalog Manager. The Information Catalog Manager runs on Windows 32-bit operating systems. For information about the DB2 Universal Database platforms on which an information catalog can reside, see the Information Catalog Manager Administration Guide.

Planning to publish metadata

Before you publish metadata, you need to ensure that the correct environment is established.

  1. Ensure that you installed and configured the necessary warehouse components on the correct workstations:

    The Information Catalog Manager administrator function must be installed both on the warehouse server site and on the Data Warehouse Center administrative interface component if they are on different workstations.

    The Data Warehouse Center administrative interface must have DB2 connectivity to the information catalog APIs.

  2. Ensure that both the administrator user IDs for the Information Catalog Manager and Data Warehouse Center have Windows NT administrator privileges.

Selecting metadata to publish

Table 18 provides the mapping between object types in the Data Warehouse Center and information catalog. The Data Warehouse Center uses this mapping when you export the metadata to the information catalog. See Appendix B, Metadata mappings for a detailed mapping of object types and object type properties.

Table 18. Mapping between Data Warehouse Center and information catalog object types
Data Warehouse Center object type Information catalog object type
Process DWC Process
Step Transformation (at the table or column level)
Column or field Columns or fields
Warehouse source or warehouse target Databases, IMS database definitions
Subject area Business subject areas
Table or file Relational tables and views, Files, IMS segments
Warehouse schema Star Schema

When you publish a process, the following objects are published with it:

Requirement: Before you publish metadata, make sure that you are not logged on to the Information Catalog Manager as an administrator.

To select the metadata that you want to publish:

  1. From the Data Warehouse Center, right-click on the Warehouse node, and click Publish metadata. The Publish Metadata window opens.
  2. Right-click on the white space in the list, and click Data Warehouse Center metadata. The Publish Data Warehouse Center Metadata notebook opens.
  3. In the Available objects list, select the objects that you want to publish, and click >

    The object moves to the Selected objects list. Repeat this step for all the objects that you want to publish.

    If you want to publish all available objects, click >>. If a node on the tree does not contain objects within it, it is not moved.

  4. Click the Synchronization Settings tab.
  5. In the Catalog name field, type the name of the information catalog to which you want to publish metadata.
  6. In the Administrator user ID field, type a user ID that has access to the information catalog. The user ID can be the ID of the information catalog administrator or that of an information catalog user who has certain administrative privileges.
  7. In the Administrator password field, type the password that corresponds to the user ID.
  8. Type the password again in the Verify password field.
  9. Select one of the options from the Map source tables to target warehouse list to specify how you want to map processes and their source tables when they are published in the information catalog:
  10. Optional: Select one of the following check boxes to determine how published metadata will be updated:
  11. Optional: You can make the tree structure shown in the Information Catalog Manager less detailed by limiting the number of levels of objects published and displayed in the Tree view:
    1. Select Limit the levels of objects in the tree to limit the number of levels of objects that are published and displayed in the Tree view. If you do not select this check box, all objects under the selected object are published and displayed. If you specify a limit, only a subset of the objects under a selected object is published and displayed.
    2. Click the up or down arrow to specify a limit for the number of levels of objects that are published and displayed in the tree view. The default limit value is 1. The lower the limit that you specify, the less detailed the tree structure will be.

      If you set a limit, the number of levels of objects that are published and displayed in the tree view is limited to the number of levels that you specify.

  12. Click OK.

The Publish Information window opens and displays a message that tells you whether the publish process completed successfully. The Publication Information window displays the contents of a log file that lists which metadata objects were published successfully. If errors occurred, the notebook remains open when you close the Publication Information window so that you can try to publish objects again. You can find more information about a particular error message in the IBM DB2 Universal Database Message Reference.

If the publication is successful, the notebook closes when you close the Publication Information window, and two publication objects are added to the list of publications in the Publish Metadata window. One publication updates control metadata, the other updates definition metadata. Control metadata is metadata that describes changes to data in the warehouse. Examples of control metadata are the date and time that a table is updated by the processing of a step. Definition metadata is metadata that describes the objects in the warehouse, the sources of data, and the transformations that are applied to the data. Examples of definition metadata are column names, table names, and database names. The first time that you publish metadata, both control and definition metadata is registered in the information catalog. When you update metadata, you can choose to update the control metadata, the definition metadata, or both types.

Updating published metadata

After you publish metadata, you can use the publication objects (created during the publish process) periodically to propagate to the information catalog the changes that are made in the Data Warehouse Center. This process is called metadata synchronization.

When you synchronize metadata, the metadata for an object that is registered in the information catalog is updated either when you run the publication or based on a schedule that you create for the publication. Metadata is not updated in the information catalog in the following situations:

Requirement: If you plan to synchronize metadata, you must use the Data Warehouse Center warehouse control database as your information catalog database.

To update published metadata:

  1. In the List of Publications in the Publish Metadata window, right-click on the publication that you want to run.
  2. Click Run.

Scheduling regular updates of published metadata

You can automate updates to previously published metadata by creating a schedule.

To schedule regular updates of metadata to the information catalog:

  1. Open the Publish Metadata window.
  2. In the List of publications, right-click the publication that you want to schedule, and click Schedule. The Schedule page of the Schedule notebook opens so that you can create a schedule on which to run the publication.

See Scheduling warehouse processes for information about using the Schedule notebook.

Viewing log files for the publication updates

You can view the log files that show the result of the update to the publication. The files are located in the directory that is specified by the VWS_LOGGING environment variable. The default value of the VWS_LOGGING variable for Windows NT is \sqllib\logging, where x is the drive where the DB2 Universal Database is installed. The following log file names are used:
Type of metadata Purpose Log file name
Update of Data Warehouse Center control metadata Describes changes to objects in the warehouse ICMDWCD.OUT
Update of Data Warehouse Center definition metadata Describes objects in the warehouse, the sources of data, and the transformations that are applied to the data ICMXCHG.OUT

Propagating deleted objects to the information catalog

When an object is deleted in Data Warehouse Center, information about the deleted object is stored in the warehouse control database. When the metadata is updated during the synchronization process, the Data Warehouse Center propagates these deletions to the information catalog before importing other changes into the information catalog. When metadata synchronization completes successfully, the Data Warehouse Center removes the entries in the warehouse control database. Because the Data Warehouse Center removes the entries, deletions can be propagated to only one information catalog. If you need to make the deletions to a second information catalog, you must delete those items manually.

If you change the name of a warehouse object that you previously published to the information catalog, you must publish the object again to update the information catalog. The object with the old name is not overwritten, so both objects exist in the information catalog after metadata synchronization; you must delete the old object manually.


Initializing a second warehouse control database

When you install the warehouse server, the warehouse control database that you specify during installation is initialized. Initialization is the process in which the Data Warehouse Center creates the control tables that are required to store Data Warehouse Center metadata. If you have more than one warehouse control database, you can use the Data Warehouse Center--Control Database Management window to initialize the second warehouse control database. However, only one warehouse control database can be active at a time.

The Data Warehouse Center will create the database that you specify on the warehouse server workstation if the database does not already exist on the workstation. If you want to use a remote database, create the database on the remote system and catalog it on the warehouse server workstation.

To initialize a second warehouse control database:

  1. To open the Warehouse Control Database window, click Start --> Programs --> IBM DB2 --> Warehouse Control Database Management.
  2. In the New control database field, type the name of the new control database that you want to create.
  3. In the Schema field, type the name of the schema to use for the database.
  4. In the User ID field, type the name of the user ID that has access the database.
  5. In the Password field, type the password for the user ID.
  6. In the Verify Password field, type the password again.
  7. Click OK.

    The window remains open. The Messages field displays messages that indicate the status of the creation and migration process.

  8. After the process is complete, click Cancel to close the window.

Configuring the Data Warehouse Center

You can use the Data Warehouse Center Properties notebook to change global settings for your Data Warehouse Center installation. You can override many global settings in the objects that use them. For example, you can use the Properties notebook to specify the default behavior of a processing step when the warehouse agent finds no rows in the source table or file. You can override this global setting in a particular step.

You can use the configuration tool only if the Data Warehouse Center server is installed on the workstation (as well as the administrative client).

Important: Some of the fields in the Properties notebook should be changed only on the recommendation of IBM Software Support. For more information, see the online help for the Data Warehouse Center Properties notebook.

Changing the Data Warehouse Center properties

Use the Data Warehouse Center Properties notebook to change the Data Warehouse Center configuration parameters. For example, you can change the parameters for the default notification messages to suit your particular business situation.

You must have administrative privilege to change Data Warehouse Center properties.

To change the Data Warehouse Center properties:

Open the Data Warehouse Center Properties notebook:

  1. Right-click the Warehouse object, and select Properties from the pop-up menu.
  2. Change any of the following settings:
  3. Click OK to save your changes.

See the online help for the Data Warehouse Center Properties notebook for detailed descriptions of the settings in the notebook.

You can also use the Data Warehouse Center Properties notebook to run a Data Warehouse Center component trace. For more information, see Component trace data.


[ Top of Page | Previous Page | Next Page ]