PQ81567: NodeAgent crashes after repeated start/stop of clusters and/or servers
 Downloadable files
 
Abstract
This is caused by a timing window/race condition in admin code that handles process management and monitoring.
 
Download Description
The problem is a timing window issue when servers are started by a nodeagent in an multi-node configuration. If the servers are stopped within the first twenty minutes after they are started, there is a potential crash that may occur due to memory getting deallocated while it is still in use. The window is fairly narrow, but it can be hit.

A common way to reproduce it is to create a cluster of servers, start them, then when it reports all servers are up, stop the cluster. Do it repeatedly and you will eventually hit the problem. It may happen within the first few restarts, or it may take days or repeated start/stops within the twenty minute window.
 
Prerequisites
Please download the UpdateInstaller below to install this fix.
 
URL LANGUAGE SIZE(Bytes)
UpdateInstaller - 5.0.x US English 7000000
UpdateInstaller - 5.1.x US English 4000000
 
 
Installation Instructions
Please review the readme.txt for detailed installation instructions.
 
URL LANGUAGE SIZE(Bytes)
Readme US English 2574
 
Download package
What is DD?
Download RELEASE DATE LANGUAGE SIZE(Bytes) Download Options
PQ81567 - 5.0.x 1/7/2004 US English 24004 FTP DD
PQ81567 - 5.1.x 1/7/2004 US English 29547 FTP DD
 
Technical support
1-800-IBM-SERV (U.S. Only)
 
Cross Reference information
Segment Product Component Platform Version Edition
Application Servers Runtimes for Java Technology Java SDK
Problems (APARS) fixed
PQ81567
 
 


Document Information


Product categories: Software > Application Servers > Distributed Application & Web Servers > WebSphere Application Server > Servlet Engine/Web Container
Operating system(s): Windows
Software version: 5.1
Software edition:
Reference #: 4006180
IBM Group: Software Group
Modified date: Aug 17, 2004