PQ69308: AFTER "NO PING RECEIVED" MSG, APPSERVER PROCESS KILLED BUT NOT BE RESTARTED

APAR status
Closed as program error.

Error description
After I implemented code for 
PQ68450 to ensure that the a
hanging application server gets terminated as it should,
an additional problem was noted.  After the application server
was terminated by the admin server in response to the "No Ping
Received" message, I was unable to restart the failed
application server. I had to kill the nanny and admin server
processes, and the application server was restarted
automatically, and console stops/starts worked OK after that.
Local fix Problem summary
****************************************************************
* USERS AFFECTED: WebSphere Application Server users.          *
****************************************************************
* PROBLEM DESCRIPTION: When user tries to stop an              *
*                      application server via admin console,   *
*                      wscp, or XMLConfig, sometimes the       *
*                      application server process continues    *
*                      to run.                                 *
****************************************************************
* RECOMMENDATION:                                              *
****************************************************************
This is identical to 
PQ68450.  Here is summary from that APAR:

When user stops an application server via any of the
WebSphere client programs, WebSphere reports that the
application is stopped but looking at the process table
of the system (e.g., "ps -ef"), the process remains active.
Problem conclusion
Changed the code so that process is terminated with SIGKILL.
Original code did something like:

if (kill(app server pid, SIGTERM)!= 0)
{
. . .
kill(app server pid, SIGKILL);

Problem was that return code of "0" from kill() does not
mean that target process is terminated - only that target
pid exists, user has permission to send signal to target
process, and signal number is valid.
Temporary fix
All system management cumulative eFixes dated 20Dec02 or
later.
Comments
APAR information
APAR number PQ69308
Reported component name WEBSPHERE AE AI
Reported component ID 5630A2200
Reported release 400
Status CLOSED PER
PE NoPE
HIPER NoHIPER
Submitted date 2002-12-19
Closed date 2003-01-21
Last modified date 2003-01-21

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Modules/Macros
ADMINSVR          

Fix information
Fixed component name WEBSPHERE AE AI
Fixed component ID 5630A2200

Applicable component levels
R400 PSY    UP


Document Information


Product categories: Software > Application Servers > Distributed Application & Web Servers > WebSphere Application Server > General
Operating system(s):
Software version: 400
Software edition:
Reference #: PQ69308
IBM Group: Software Group
Modified date: Jan 21, 2003