Data Movement Utilities Guide and Reference
Following is some information to consider before using the AutoLoader
utility:
- Familiarize yourself with AutoLoader operations by using the utility with
small amounts of data.
- If the input data is already sorted, or in some chosen order, and you wish
to maintain that order during the loading process, only one database partition
should be used for splitting. Parallel splitting cannot guarantee that
the data will be loaded in the same order it was received.
- If large objects (LOBs) are being loaded from separate files (that is, if
you are using the lobsinfile modifier through the load utility),
all directories containing the LOB files must be read-accessible to all the
database partitions where loading is taking place. The LOAD
lob-path parameter must be fully qualified when working with
LOBs.
- All temporary AutoLoader files reside in the directory specified through
the LOGFILE AutoLoader configuration parameter. This directory must be
network-accessible with both read and write access to all partitions where
splitting is to be done. By specifying different directories for
temporary files, you can run multiple concurrent AutoLoader jobs to load data
into separate tables in different table spaces.
- The maximum number of active database connections in an AutoLoader job is
the number of loading partitions defined in the OUTPUT_NODES AutoLoader
configuration parameter. Ensure that the maxxappls (maximum
number of active applications) database configuration parameter has been set
high enough.
- You can force an AutoLoader job to continue even if the AutoLoader detects
(at startup time) that some loading partitions or associated table spaces are
offline, by specifying FORCE=YES in the AutoLoader configuration
file.
- Use the STATUS_INTERVAL AutoLoader configuration parameter to monitor
the progress of an AutoLoader job. AutoLoader returns messages at
specified intervals, indicating how many megabytes of data have been
processed.
- Better performance can be expected if the splitting partitions (as defined
by the SPLIT_NODES parameter) are different from the loading partitions (as
defined by the OUTPUT_NODES parameter), since there is less contention for
CPU cycles. The AutoLoader utility itself should be invoked on a
database partition that is not participating in either the splitting or the
loading operation. On an SMP system, you can improve performance by
ensuring that there is at least one splitter task for every available
CPU.
- AutoLoader ignores the MESSAGES parameter in the LOAD command, and directs
all messages from the LOAD command into the file
load_log.XXX; this file contains messages from
the load process on database partition XXX. AutoLoader also
creates a file called splt_log.XXX; this file
contains messages from the split process on database partition
XXX. The utility also creates a file called
autoload.log, containing messages from the main AutoLoader
script. Check it to ensure that all pipes and temporary directories
have been set up correctly.
- AutoLoader chooses only one output database partition on which to collect
statistics. The RUN_STAT_NODE AutoLoader configuration parameter
can be used to specify that partition.
- Multiple invocations of AutoLoader can be used to load data simultaneously
into separate tables. Ensure that:
- The tables reside in separate table spaces.
- All of the AutoLoader operations are invoked from separate
directories.
- The data file name used to create temporary pipes is unique for each
AutoLoader operation.
[ Top of Page | Previous Page | Next Page | Table of Contents | Index ]
[ DB2 List of Books |
Search the DB2 Books ]