Knowledge Center         Contents    Previous  Next    
Platform Computing Corp.

lsb_suspreason()

Explains why a job was suspended.

DESCRIPTION

Using the SBD, lsb_suspreason() explains why system-suspended and user-suspended jobs were suspended.

SYNOPSIS

#include <lsf/lsbatch.h> 
char *lsb_suspreason(int reasons, int subreasons, 
                      struct loadIndexLog *ld) 
struct loadIndexLog {
    int nIdx;
    char **name;
}; 

PARAMETERS

reasons

Reasons a job suspends:

SUSP_HOST_LOCK

The LSF administrator has locked the execution host.

SUSP_LOAD_REASON

A load index exceeds its threshold. The subreasons field indicates which indices.

SUSP_MBD_PREEMPT

The job was preempted by mbatchd because of a higher priorty job.

SUSP_QUEUE_WINDOW

The run window of the queue is closed.

SUSP_RESCHED_PREEMPT

Suspended after preemption. The system needs to re-allocate CPU utilization by job priority.

SUSP_SBD_PREEMPT

Preempted by sbatchd. The job limit of the host/user has been reached.

SUSP_USER_RESUME

The job is waiting to be re-scheduled after being resumed by the user.

SUSP_USER_STOP

The user suspended the job.

SUSP_ADMIN_STOP

The job was suspened by root or the LSF administrator.

SUSP_SBD_STARTUP

The job is suspended while the sbatchd is restarting.

SUSP_HOST_LOCK_MASTER

The execution host is locked by the master LIM.

SUSP_QUE_STOP_CONDITION

The suspend conditions of the queue, as specified by the STOP_COND parameter in lsb.queues, are true.

SUSP_QUE_RESUME_CONDITION

The resume conditions of the queue, as specified by the RESUME_COND parameter in lsb.queues, are false.

SUSP_RES_RESERVE

The job is terminated due to resource limit.

SUSP_RES_LIMIT

The job's requirements for resource reservation are not satisfied.

SUSP_PG_IT

The job was suspended due to the paging rate and the host is not idle yet.

SUSP_REASON_RESET

Resets the previous reason.

SUSP_MBD_LOCK

The job is locked by the mbatchd.

SUSP_LOAD_UNAVAIL

Load information on the execution hosts is unavailable.

subreasons

If reasons is SUSP_LOAD_REASON, subreasons indicates the load indices that are out of bounds. The integer values for the load indices are found in lsf.h.

If reasons is SUSP_RES_LIMIT, subreasons indicates the job's requirements for resource reservation are not satisfied. The integer values for the job's requirements for resource reservation are found in lsbatch.h.

Subreasons a job suspends if reasons is SUSP_LOAD_REASON:

R15S

15 second CPU run queue length

R1M

1 minute CPU run queue length

R15M

15 minute CPU run queue length

UT

1 minute CPU utilization

PG

Paging rate

IO

Disk IO rate

LS

Number of log in sessions

IT

Idle time

TMP

Available temporary space

SWP

Available swap space

MEM

Available memory

USR1

USR1 is used to describe unavailable or out of bounds user defined load information of an external dynamic load indice on execution hosts.

USR2

USR2 is used to describe unavailable or out of bounds user defined load information of an external dynamic load indice on execution hosts.

Subreasons a job suspends if reasons is SUSP_RES_LIMIT:

SUB_REASON_RUNLIMIT

The run limit was reached.

SUB_REASON_DEADLINE

The deadline was reached.

SUB_REASON_PROCESSLIMIT

The process limit was reached.

SUB_REASON_CPULIMIT

The CPU limit was reached.

SUB_REASON_MEMLIMIT

The memory limit was reached.

ld

When reasons is SUSP_LOAD_REASON, ld is used to determine the name of any external load indices. ld uses the most recent load index log in the lsb.events file.

The loadIndexLog structure contains the following fields:

nIdx

Number of load indices.

names

Names of load indices.

RETURN VALUES

char: reasons

Returns the suspending reason string.

char:NULL

The function failed. The reason code is bad.

ERRORS

No error handling

SEE ALSO

Related API

lsb_pendreason() - Explains why a job is pending

Equivalent line command

bjobs -s

Environment Variable

LSB_SUSP_REASONS

Files

lsb.queues

lsb.events


Platform Computing Inc.
www.platform.com
Knowledge Center         Contents    Previous  Next