PQ75605, 4.0.4, 4.0.5, 4.0.6: Periodic loss of
Administration
Downloadable files
Abstract
Administration server hangs and the admin server will
error; OutOfMemory.
This APAR marks only remote nodes off-line,
Download Description
USERS AFFECTED:
WebSphere Application Server 4.0 users with a heavily loaded system,(more
than 50 application server).
PROBLEM DESCRIPTION:
Limited to no administration capabilities which appear to stem from an
Admin Server hanging.
Intermittent hangs occur when a system is heavily loaded (more than 50
application servers).
RECOMMENDATION:
On heavily loaded systems, the admin server will hang in the
checkOutOfOrders method waiting for an epoch change, which will never
happen. It was determined that whenever a module failed to start, an epoch
change was erroneously being made by the node, resulting in the node
marking itself offline. Then, during subsequent application server starts,
the admin server will wait for an epoch change on the node, which never
takes place.
PROBLEM CONCLUSION:
Two modifications were made:
1. The code will prevent a node from marking itself offline.
2. The code was modified so the indefinite wait in CheckOutOfOrders module
was changed to a timed wait and rollback.
Prerequisites
-WebSphere Application Server 4.0.5 and 4.0.6 the interm-fix can be
applied as is.
-WebSphere Application Server 4.0.4 the Interm-fix will need the
prerequisite SM cumulative ifix:
WAS_SysMgmt_05-01-2003_4.0.5-4.0.4-4.0.3-4.0.2_AE_Solaris_cumulative
APAR
Installation instructions
Please review the readme.txt for detailed instructions.