Release Date: January
2011
The following bugs have been
fixed in LSF Version 8.0 since the September 2009 update (LSF 7
Update 6):
138040 |
Date |
2009-11-18 |
|
Description |
You remove EGO_DAEMONS_CPUS and run "lsadmin limrestart", but lim is still bound to the CPU. |
|
Component |
lim |
|
Platform |
Linux 2.6 |
|
Impact |
The configuration change does not take effect. |
145947 |
Date |
2010-03-17 |
|
Description |
The vemkd socket connection keeps growing until the file descriptor limit is exhausted. |
|
Component |
vemkd |
|
Platform |
All |
|
Impact |
The file descriptors on the LSF master host are exhausted. |
152787 |
Date |
2010-09-21 |
|
Description |
The |
|
Component |
sbatchd |
|
Platform |
Linux/UNIX |
|
Impact |
Some of the jobs do not use the spool directory and job files are left over. |
140403 |
Date |
2010-01-12 |
|
Description |
The mbatchd daemon does not give a warning message for fairshare configuration errors. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
A job may not be charged to the correct fairshare account. |
156734 |
Date |
2010-09-30 |
|
Description |
The bqueues -l command shows job runTime overflow. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The counter for fairshare charges is incorrect. |
154126 |
Date |
2010-08-20 |
|
Description |
The bjobs -W command sometimes displays negative CPU time. |
|
Component |
bjobs |
|
Platform |
All |
|
Impact |
The bjobs -W command output is incorrect. |
138888 |
Date |
2009-11-12 |
|
Description |
The bjobs -A command shows jobs from other users when NEWJOB_REFRESH=Y is set in lsb.params. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Confusion and possible network congestion when other users’ jobs are shown. |
145426 |
Date |
2010-06-01 |
|
Description |
Improve mbschd performance for job arrays. |
|
Component |
mbatchd schmod_limit.so mbschd |
|
Platform |
All |
|
Impact |
Improve mbschd performance for job arrays. |
144960 |
Date |
2010-04-07 |
|
Description |
Hosts become unlicensed for a period of time after a daemon is down. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
Hosts become unlicensed for a period of time. |
140317 |
Date |
2009-12-25 |
|
Description |
The lspasswd command does not validate the passwd.lsfuser file. |
|
Component |
lspasswd.exe |
|
Platform |
Windows |
|
Impact |
Jobs pass the lspasswd check, then pend because the password is wrong. |
143648 |
Date |
2010-03-17 |
|
Description |
Jobs sometimes pend by mistake because ENFORCE_ |
|
Component |
schmod_limit.so |
|
Platform |
All |
|
Impact |
Jobs are in the pending state when they should not be. |
135419 |
Date |
2009-09-29 |
|
Description |
The bmod command cannot modify the resource of a job submitted from a floating client host for several minutes after cluster reconfiguration. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
This affects usability and performance. |
151841 |
Date |
2010-07-06 |
|
Description |
The pwm.sys sometimes causes Windows to hang. |
|
Component |
pwm.sys |
|
Platform |
Windows |
|
Impact |
The host does not respond. |
144432 |
Date |
2010-03-15 |
|
Description |
Hosts become unlicensed for a short period of time. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
Client requests are rejected while the host is unlicensed. |
137028 |
Date |
2009-10-27 |
|
Description |
Suspended jobs with a memory requirement are not resumed. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
A job stays suspended even when resources are available. |
143328 |
Date |
2010-01-28 |
|
Description |
The CPU load is not reduced when a job is pending with PEND_NO_MAPPING. |
|
Component |
sbatchd |
|
Platform |
Linux 2.6 |
|
Impact |
Job binding is unbalanced. |
142540 |
Date |
2010-01-19 |
|
Description |
The lim daemon reports the wrong maximum memory when physical memory is bigger than 4 GB. |
|
Component |
lim |
|
Platform |
MacOS |
|
Impact |
Memory size is incorrect. |
135524 |
Date |
2009-09-25 |
|
Description |
Jobs are killed when you use a port scanner. |
|
Component |
res |
|
Platform |
All |
|
Impact |
LSF jobs cannot run under a port scanner. |
139350 |
Date |
2009-11-27 |
|
Description |
The child process of lim inherits the CPU binding when you run lsload. |
|
Component |
lim |
|
Platform |
Linux 2.6 |
|
Impact |
The child process of lim runs on the same CPU as lim does, the master lim performance is affected. |
133552 |
Date |
2009-09-01 |
|
Description |
The badmin perfmon view displays Total Queries, but it is not a total for user queries, and it is not the total of the given columns or rows. It is the total number of requests processed by mbatchd, which includes user queries (b* command queries) as well as internal transactions. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The term "total queries" is confusing. |
139318 |
Date |
2010-04-28 |
|
Description |
A License Scheduler job cannot be resumed after the duration expires. |
|
Component |
sbatchd mbatchd |
|
Platform |
All |
|
Impact |
The job cannot be resumed. |
143145 |
Date |
2010-01-29 |
|
Description |
You cannot submit a job from Linux to Windows using the special characters “ >”, “<”, or “|”. |
|
Component |
bsub |
|
Platform |
UNIX |
|
Impact |
Your application cannot use the special characters “ >”, “<”, or “|”. |
140575 |
Date |
2009-12-11 |
|
Description |
The bhist command displays a format error for the EVENT_ |
|
Component |
bhist |
|
Platform |
All |
|
Impact |
This affects the viewing and collecting of job information. |
148839 |
Date |
2010-05-18 |
|
Description |
You define rusage[resourceA||resourceB], but if the limit for resourceA is reached, resourceB is not used. |
|
Component |
schmod_default.so |
|
Platform |
All |
|
Impact |
The "OR" relationship does not take effect for a sequential job. |
139810 |
Date |
2009-12-01 |
|
Description |
CLEARCASE_ROOT is not set to the correct path, and the job fails. Information such as job ID and time stamp are not logged by daemons.wrap. |
|
Component |
daemons.wrap |
|
Platform |
UNIX |
|
Impact |
If CLEARCASE_ROOT is not set correctly, jobs fail, and the log messages are not clear. |
154813 |
Date |
2010-08-27 |
|
Description |
The bjobs command does not work as expected. |
|
Component |
bjobs mbatchd |
|
Platform |
All |
|
Impact |
The bjobs command does not work as expected. |
136780 |
Date |
2009-10-14 |
|
Description |
You cannot close an unavailable host. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
You cannot close an unavailable host. |
152172 |
Date |
2010-12-08 |
|
Description |
There are some errors message in mbschd.log, and mbschd consumes a lot of memory. |
|
Component |
mbschd |
|
Platform |
All |
|
Impact |
The mbschd daemon cannot dispatch jobs normally. |
158777 |
Date |
2010-11-17 |
|
Description |
The master lim hangs on Solaris 8. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
The master lim hangs. |
153499 |
Date |
2010-07-30 |
|
Description |
The lspasswd command times out when the secondary master host is unavailable. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
You cannot update your LSF password. |
151103 |
Date |
2010-07-02 |
|
Description |
You delete the C$ share and run the installer, the error is "Cannot start LSF Workflow Monitor service on host wxp64." |
|
Component |
Installer |
|
Platform |
All |
|
Impact |
You cannot install LSF. |
137919 |
Date |
2009-11-19 |
|
Description |
With two EGO SLA consumers, one consumer has to reclaim resources to run jobs. The command “ego consumer alloc” shows resources have been allocated, but jobs are very slow to start. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Priority jobs do not start on time, job submission takes 25-30 minutes with reclaim instead of 1-2 minutes without it. |
138036 |
Date |
2009-11-01 |
|
Description |
The lim and license management software logs show false error messages indicating that features are not available or that the client and license server license files are not synchronized. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
You see error messages but there are no errors. |
140674 |
Date |
2009-12-15 |
|
Description |
To address performance and scalability issues with job scheduling, add timing debug messages in mbatchd and mbschd. |
|
Component |
mbatchd mbschd schmod_default.so schmod_fairshare.so schmod_limit.so |
|
Platform |
All |
|
Impact |
This improves debugging performance. |
138798 |
Date |
2009-11-10 |
|
Description |
Loading libptmalloc3.so in mbatchd may cause a problem with egroup. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The egroup could fail. |
150106 |
Date |
2010-06-08 |
|
Description |
A job's reservation of a numeric resource is lost after mbatchd reconfiguration, restart, or if another resource reservation expired. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
A job fails because the resource reservation is lost. |
146803 |
Date |
2010-04-07 |
|
Description |
Memory leak. |
|
Component |
libbat.a |
|
Platform |
All |
|
Impact |
Memory leak. |
147515 |
Date |
2010-04-14 |
|
Description |
ENFORCE_ |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The limit applies to all user groups. |
146311 |
Date |
2010-04-16 |
|
Description |
Over-preemption happens when there are high priority pending jobs with unsatisfied resource requirements. |
|
Component |
schmod_preemption.so |
|
Platform |
All |
|
Impact |
The time to finish all jobs is much longer than usual. |
139186 |
Date |
2009-12-16 |
|
Description |
When you run badmin reconfig, bld restarts automatically. |
|
Component |
mbatchd |
|
Platform |
UNIX |
|
Impact |
If there are multiple clusters sharing bld, reconfiguring one cluster impacts the other clusters. |
148500 |
Date |
2010-05-07 |
|
Description |
If there is space in the process name, pim gets an incorrect process snapshot. |
|
Component |
pim |
|
Platform |
Linux |
|
Impact |
Cannot get the correct process information. |
138899 |
Date |
2009-11-18 |
|
Description |
A job submitted from a floating client without a type could pend forever if any of the server hosts have an exclusive resource defined. |
|
Component |
mbschd schmod_default.so |
|
Platform |
All |
|
Impact |
A job submitted from a floating client could pend forever. |
142921 |
Date |
2010-01-27 |
|
Description |
The command lshosts displays the wrong number of CPUs for a dual-core Itanium 2 host. |
|
Component |
lim |
|
Platform |
ia64 linux2.4 |
|
Impact |
Over-licensing. |
148608 |
Date |
2010-07-21 |
|
Description |
Some events are overwritten. |
|
Component |
mbatchd.exe |
|
Platform |
Windows |
|
Impact |
JFD may fail to read the lsb.events file and some of the flow cannot go smoothly. It can also cause job loss and bhist command errors. |
137465 |
Date |
2009-10-27 |
|
Description |
If HOME is set in lstcsh, $home is set to $cwd. |
|
Component |
lstcsh |
|
Platform |
All |
|
Impact |
The HOME environment variable is set incorrectly, and the ~cd command does not work as expected. |
70861 |
Date |
2010-05-18 |
|
Description |
Preemption does not work normally due to over-reservation caused by cancelling dispatch decisions. |
|
Component |
mbschd schmod_preemption.so |
|
Platform |
All |
|
Impact |
Preemption does not work as expected. |
148923 |
Date |
2010-05-23 |
|
Description |
Checkpointable jobs sometimes fail to restart because the job file cannot be found. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Restarted checkpointable jobs fail to start. |
157160 |
Date |
2010-10-18 |
|
Description |
After you modify the lsf.cluster file, hostsetup recognizes the machine as a client. |
|
Component |
Installer |
|
Platform |
All |
|
Impact |
Cannot run hostsetup. |
144936 |
Date |
2010-02-28 |
|
Description |
The command ./daemons.wrap -V does not show the version of the binary for several clusters on different platforms. |
|
Component |
daemons.wrap |
|
Platform |
All |
|
Impact |
Difficult to know the version of daemons.wrap. |
157770 |
Date |
2010-10-26 |
|
Description |
The lim fails when a child exits because of a signal from the function in child_handler. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
The system is not stable. |
159933 |
Date |
2010-11-25 |
|
Description |
The mbatchd daemon sets the wrong pending reason for a job if there are over 299 resources in the LSF resource table. |
|
Component |
lim mbatchd |
|
Platform |
All |
|
Impact |
The command bjobs shows the wrong pending reason. |
144641 |
Date |
2010-02-28 |
|
Description |
Recursive job submission causes the last job to fail. |
|
Component |
sbatchd res |
|
Platform |
All |
|
Impact |
Jobs fail. |
134276 |
Date |
2009-09-02 |
|
Description |
The ls index is incorrect. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
This can affect the availability of a host. |
141326 |
Date |
2009-12-31 |
|
Description |
The command bpeek fails if $HOME/.lsbatch/ is not accessible with the local spool directory configured. |
|
Component |
bpeek |
|
Platform |
All |
|
Impact |
The bpeek command fails on non-execution hosts. |
143160 |
Date |
2010-01-26 |
|
Description |
The mbatchd log file contains some characters that cannot be displayed. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Cannot read the log file, problems with the command "wc". |
67806 |
Date |
2010-06-30 |
|
Description |
The bhosts command shows the host status as “closed_Busy” when it should be “ok”. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The bhosts command output is incorrect. |
136446 |
Date |
2009-10-19 |
|
Description |
The mbatchd daemon sends out more data than necessary when "bjobs -w" has more than one job ID. |
|
Component |
bjobs |
|
Platform |
All |
|
Impact |
The bjobs -w queries in job scripts cause very high network load. |
136498 |
Date |
2009-10-14 |
|
Description |
The command blaunch fails to run a command which has space in its path. |
|
Component |
blaunch |
|
Platform |
All |
|
Impact |
This problem impacts PMPI 7.1 integration, the PMPI default installation path has spaces. |
139193 |
Date |
2009-11-20 |
|
Description |
The pim daemon slows down on Linux hosts. |
|
Component |
pim |
|
Platform |
Linux |
|
Impact |
LSF does not report a job's rusage. |
|
Parameter |
LSF_PIM_LINUX_ENHANCE
in lsf.conf |
135812 |
Date |
2009-09-27 |
|
Description |
Jobs submitted via bsub have a trailing colon added to the LD_LIBRARY_PATH. |
|
Component |
sbatchd |
|
Platform |
All |
|
Impact |
This may cause problems with the job application. |
148956 |
Date |
2010-05-20 |
|
Description |
Sometimes mbatchd dies when lim is being restarted. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The system becomes unavailable for a short time. |
154291 |
Date |
2010-09-02 |
|
Description |
The mbatchd daemon is killed by sbatchd when egroup does not return on time. |
|
Component |
sbatchd |
|
Platform |
All |
|
Impact |
The system cannot start properly. |
148528 |
Date |
2010-05-06 |
|
Description |
Using bsub with esub.password is slow when there are a lot of unavailable hosts in the cluster. |
|
Component |
esub.password.exe lspassword.exe |
|
Platform |
Windows |
|
Impact |
Performance slows down. |
155646 |
Date |
2010-09-13 |
|
Description |
The mbatchd daemon fails because of the file system error "Interrupted system call". |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The system is not stable. |
117503 |
Date |
2009-08-13 |
|
Description |
The command “bhist –l all” is case sensitive. |
|
Component |
bmod.exe bstop.exe bhist.exe libbat.lib liblsf.lib bresume.exe bjobs.exe eauth.exe bkill.exe bsub.exe |
|
Platform |
Windows |
|
Impact |
The command “bhist –l all” is case sensitive. |
140131 |
Date |
2009-12-09 |
|
Description |
In a MultiCluster environment, if schmod_mc is not set in lsb.modules, there is no error in the log file. |
|
Component |
mbschd |
|
Platform |
All |
|
Impact |
The log file does not indicate improper configuration. |
145350 |
Date |
2010-03-18 |
|
Description |
Dynamic user priority becomes 100 after you change |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Some users have a high priority. |
158241 |
Date |
2010-10-27 |
|
Description |
LSB_SUB_QUEUE is not set in $LSB_SUB_PARM_ |
|
Component |
liblsf.a liblsf.so libbat.a brestart libbat.so bsub |
|
Platform |
All |
|
Impact |
Unable to check queues in $LSB_SUB_PARM_ |
122721 |
Date |
2010-06-22 |
|
Description |
A Perl script running in LSF on an x64 system crashes the machine. |
|
Component |
sbatchd.exe |
|
Platform |
win2003-x64 |
|
Impact |
This job cannot run on an x64 system. |
147709 |
Date |
2010-04-21 |
|
Description |
The PA administrator cannot delete the lsb.stream file. |
|
Component |
mbatchd |
|
Platform |
Linux/UNIX/Solaris |
|
Impact |
The Analytics data loader has to run under the primary LSF administrator account, but this is not realistic since the PA administrator and the LSF administrator are different IT roles. |
137023 |
Date |
2009-11-03 |
|
Description |
Even with available hosts, jobs take a long time (3 minutes) to start running if LSB_SLOT_RESERVE_ENHANCE=Y. |
|
Component |
schmod_reserve.so mbschd schmod_default.so |
|
Platform |
All |
|
Impact |
Performance is poor and job dispatching is slow. |
140133 |
Date |
2009-12-10 |
|
Description |
The lspasswd.exe program contacts LSF Windows client hosts in the cluster to check the password. |
|
Component |
lspasswd.exe |
|
Platform |
Windows |
|
Impact |
It takes a long time for lspasswd.exe to finish if it runs on a client host. |
154718 |
Date |
2010-09-10 |
|
Description |
The mbatchd daemon cannot start if an operation generates the wrong event file. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The mbatchd daemon cannot start if an operation generates the wrong event file. |
149947 |
Date |
2010-06-10 |
|
Description |
The lspasswd -C option does not require the password as input. |
|
Component |
lspasswd.exe |
|
Platform |
Windows |
|
Impact |
The lspasswd -C option does not require the password as input. |
138969 |
Date |
2009-12-03 |
|
Description |
The lsrcp program fails to copy a file when the file name contains the '@' character. |
|
Component |
lsrcp |
|
Platform |
All |
|
Impact |
The lsrcp program cannot copy a file if the name is in the format “my@dirlist user@host:/xxx”. |
134611 |
Date |
2009-09-25 |
|
Description |
The MELIM kills all processes in the system. |
|
Component |
melim |
|
Platform |
All |
|
Impact |
The MELIM brings the host down. |
147851 |
Date |
2010-05-05 |
|
Description |
Jobs are not dispatched by mbschd and you see the error
" |
|
Component |
mbschd schmod_default.so |
|
Platform |
All |
|
Impact |
Jobs pend unless you run badmin reconfig. |
155627 |
Date |
2010-09-10 |
|
Description |
The lsload command reports 0 MB memory and 0 MB swap. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
The lsload command provides the wrong information. |
159001 |
Date |
2010-11-11 |
|
Description |
The return value of lspasswd -c is 0 although the command fails. |
|
Component |
lspasswd.exe |
|
Platform |
Windows |
|
Impact |
You cannot use the environment variable %errorlevel% to get the lspasswd execution result. |
136366 |
Date |
2009-10-12 |
|
Description |
The bjobs command calls the master lim. |
|
Component |
bjobs |
|
Platform |
All |
|
Impact |
Cluster performance is affected. |
139036 |
Date |
2009-11-16 |
|
Description |
The command badmin perfmon view cannot show information related to the file descriptor on x86-64-sol10. |
|
Component |
mbatchd |
|
Platform |
Solaris10 x86-64 |
|
Impact |
You cannot get the information related to the file descriptor. |
147435 |
Date |
2010-07-08 |
|
Description |
The command badmin ckconfig does not report an error message if the USER_SHARE parameter is defined incorrectly. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Cannot troubleshoot the problem. |
157173 |
Date |
2010-10-14 |
|
Description |
The lsb.stream file may be lost in the stream directory. |
|
Component |
liblsbstram.so |
|
Platform |
UNIX/Linux |
|
Impact |
The PA cannot get the data from LSF. |
149326 |
Date |
2010-06-02 |
|
Description |
LSF jobs get finished twice because of EGO reclaim. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The job result is affected when the post-done job reruns again. |
132433 |
Date |
2009-11-20 |
|
Description |
Using "," in the job-level select resource requirement string causes LSF to ignore queue level resource requirements. |
|
Component |
bsub libbat.a libbat.so |
|
Platform |
All |
|
Impact |
Jobs are dispatched to the wrong host. |
157635 |
Date |
2010-10-20 |
|
Description |
The mbschd daemon has a memory leak when removing a host from a host group. |
|
Component |
mbschd |
|
Platform |
All |
|
Impact |
The mbschd daemon consumes more and more memory. |
160950 |
Date |
2010-12-02 |
|
Description |
Using fork mode to switch events, there is error
information about root in the |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Problems writing to lsb.events file. |
136656 |
Date |
2009-10-21 |
|
Description |
LSF 7.0.5 does not always work, sometimes res is not bound. |
|
Component |
sbatchd |
|
Platform |
All |
|
Impact |
The processor binding facility does not work. |
139963 |
Date |
2009-12-23 |
|
Description |
The LD_LIBRARY_PATH parameter is not set. |
|
Component |
profile.js |
|
Platform |
HPUX IA64 |
|
Impact |
Unable to source the JS environment. |
144633 |
Date |
2010-02-26 |
|
Description |
When USE_SERVER is used in a license file, lim fails to validate the lsf_manager feature and turns on core-based licensing. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
You cannot use the USE_SERVER keyword in a license file. |
159518 |
Date |
2010-11-12 |
|
Description |
Job dependency does not work as expected when a job is requeued. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Job dependency does not work as expected. |
139919 |
Date |
2009-12-07 |
|
Description |
A client |
|
Component |
liblsf.a libslf.so libbat.a libbat.so |
|
Platform |
All |
|
Impact |
The master lim takes a long time to recognize the new host. |
135974 |
Date |
2009-09-29 |
|
Description |
The command lsadmin limdebug does not respect the LC_PERFM debug class. |
|
Component |
lsadmin |
|
Platform |
All |
|
Impact |
You cannot retrieve information related to network packets dynamically, you must restart the master lim twice for data collection. |
143950 |
Date |
2010-02-05 |
|
Description |
CPU load is not reduced when a job pends with PEND_NO_MAPPING. |
|
Component |
sbatchd |
|
Platform |
Linux |
|
Impact |
A job may run on a highly loaded core, and take a longer time to complete. |
133672 |
Date |
2009-08-24 |
|
Description |
When the command bmig fails for a rerunnable job, the job mail is confusing. |
|
Component |
sbatchd |
|
Platform |
All |
|
Impact |
Job mail is confusing. |
149030 |
Date |
2010-05-19 |
|
Description |
The lsrcp program or lsrun program or parallel jobs fail with error "Request from non-LSF host rejected". |
|
Component |
lim |
|
Platform |
All |
|
Impact |
Cannot run any jobs. |
144584 |
Date |
2010-02-15 |
|
Description |
The slave lim reports an external resource value even though the resource is defined as [all]. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
The UDP buffer overflows and the cluster does not work properly. |
132514 |
Date |
2009-08-13 |
|
Description |
LSF reports the total number of cores on a PPC64 machine
even though LSF is running on an |
|
Component |
lim |
|
Platform |
linux2.6_glibc2.3_ppc64 |
|
Impact |
LSF overcharges for licenses and job scheduling is affected. |
147378 |
Date |
2010-04-13 |
|
Description |
The overwriting operation fails when you use the lsrcp command to transfer files to a 64-bit Windows host. |
|
Component |
lsrcp |
|
Platform |
All |
|
Impact |
The lsrcp command does not work. |
142530 |
Date |
2010-01-19 |
|
Description |
The lim log shows an error on Mac OS X 10.6: idletime(): open(/var/run/utmp) failed, No such file or directory. |
|
Component |
lim |
|
Platform |
MacOS |
|
Impact |
The lim cannot get the right idle time resource. |
136686 |
Date |
2009-12-07 |
|
Description |
|
|
Component |
mbatchd blimits |
|
Platform |
All |
|
Impact |
The host is overused. |
135681 |
Date |
2009-10-20 |
|
Description |
The command bsub –L CShell fails if an error exists in the .login file. |
|
Component |
sbatchd |
|
Platform |
Linux/UNIX |
|
Impact |
A job fails without a meaningful error message. |
150608 |
Date |
2010-06-18 |
|
Description |
The command bhosts –w does not display all remote hosts. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
You cannot get the host partition from bhosts -w. |
133545 |
Date |
2009-08-31 |
|
Description |
You use lsrcp and lsid on AIX and the wrong message displays: “callLimUdp_(): got reply LIME_WRONG_MASTER”. |
|
Component |
lsrcp |
|
Platform |
All |
|
Impact |
The file transfer is successful but the message is not clear. |
149990 |
Date |
2010-12-17 |
|
Description |
The "lim -t" and "lshosts" commands report the wrong CPU or core number. |
|
Component |
lim |
|
Platform |
Linux |
|
Impact |
LSF licensing and host slots are affected. |
134007 |
Date |
2010-02-02 |
|
Description |
A job's processes are not bound to particular cores. |
|
Component |
sbatchd |
|
Platform |
linux2.6 Nehalem |
|
Impact |
You cannot use the LSF job binding feature on a Nehalem processor. |
138460 |
Date |
2009-11-16 |
|
Description |
A job is forwarded to the wrong cluster. |
|
Component |
mbschd schmod_mc.so mbatchd |
|
Platform |
All |
|
Impact |
Job forwarding does not work as expected. |
145645 |
Date |
2010-04-19 |
|
Description |
There is an xlsbatch memory leak. |
|
Component |
libbat.so) batch_lib(libbat.a xlsbatch |
|
Platform |
Linux |
|
Impact |
The xlsbatch uses more and more memory. |
140595 |
Date |
2009-12-16 |
|
Description |
The command lsadmin ckconfig -v does not report a configuration error if the lsf.cluster.clustername file resource map section is missing "End ResourceMap". |
|
Component |
lsadmin lim |
|
Platform |
All |
|
Impact |
Configuration problems are not reported on reconfiguration. |
124628 |
Date |
2010-07-29 |
|
Description |
Root mbatchd cannot start while a child mbatchd holds the batch port. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Root mbatchd cannot start. |
141173 |
Date |
2010-02-03 |
|
Description |
The number of CPUs and cores reported by lim is wrong. |
|
Component |
lim |
|
Platform |
MacOS |
|
Impact |
Licensing is incorrect. |
144640 |
Date |
2010-03-04 |
|
Description |
A requeued block mode job fails in 7.0.6. |
|
Component |
res nios sbatchd |
|
Platform |
All |
|
Impact |
A job fails. |
134887 |
Date |
2009-09-09 |
|
Description |
A job cannot be submitted if the dependency condition is larger than 2050 characters. |
|
Component |
liblsf.so libbat.a liblsf.a libbat.so bsub |
|
Platform |
All |
|
Impact |
A job cannot be submitted. |
150743 |
Date |
2010-06-23 |
|
Description |
You run bmod –q, but the modified job is not dispatched in the correct sequence. |
|
Component |
mbschd |
|
Platform |
All |
|
Impact |
The job scheduling sequence is not consistent with the bjobs output. |
135264 |
Date |
2009-11-26 |
|
Description |
You must manually delete MultiCluster information from lsb.lease.state after MultiCluster is disabled. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Expired MultiCluster information is visible. |
135982 |
Date |
2009-10-11 |
|
Description |
The lim runs, but lim log shows an error message if LC_TRACE and LOG_DEBUG are defined. |
|
Component |
lim |
|
Platform |
Windows |
|
Impact |
A false error is shown. |
127919 |
Date |
2009-09-14 |
|
Description |
A running job referred to an open AR keeps running when the AR expires, but an open AR job gets suspended by other pending jobs once the AR window closes. |
|
Component |
mbschd mbatchd bparams |
|
Platform |
All |
|
Impact |
Large parallel jobs fail. |
141620 |
Date |
2010-01-15 |
|
Description |
When using compute units, host groups and resource leasing in a MultiCluster environment, jobs submitted with a host specified are dispatched to the wrong host. |
|
Component |
mbschd |
|
Platform |
All |
|
Impact |
Jobs run on the wrong hosts even though the -m option is specified during job submission. |
152682 |
Date |
2010-07-21 |
|
Description |
Array elements are moved to the wrong queue if you run bswitch, bmod and badmin mbdrestart. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
An array job is sent to the wrong queue. |
144504 |
Date |
2010-02-11 |
|
Description |
You change ENFORCE_ |
|
Component |
mbschd mbatchd |
|
Platform |
All |
|
Impact |
You cannot use badmin reconfig to change the parameter
ENFORCE_ |
138171 |
Date |
2009-11-12 |
|
Description |
You cannot run LSF commands beginning with “b” from a floating client during master failover if the host order is not consistent in LSF_MASTER_LIST and LSF_SERVER_HOSTS. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
The floating client will not work for a while. |
134286 |
Date |
2009-10-23 |
|
Description |
In a MultiCluster environment, rerunnable jobs in "UNKWN" state fail to update status in the submission cluster. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Job Status does not match in the submission and execution clusters. |
149713 |
Date |
2010-07-27 |
|
Description |
The Windows installer grants special rights to the LSF Admin account even if you run the LSF daemon under the Local System Account. |
|
Component |
Installer |
|
Platform |
All |
|
Impact |
The Windows installer grants special rights to the LSF Admin account. |
152925 |
Date |
2010-07-24 |
|
Description |
The lsload output cannot show more than 9999 hosts. |
|
Component |
lsload lsplace mbatchd |
|
Platform |
All |
|
Impact |
Scalability is limited. |
156242 |
Date |
2010-10-11 |
|
Description |
The defined resource limit does not take effect on newly added dynamic hosts. |
|
Component |
mbatchd |
|
Platform |
Linux |
|
Impact |
You must run badmin reconfig to work around the issue. |
137457 |
Date |
2009-10-28 |
|
Description |
The esubs for MPICH2 and Intel MPI do not handle the “same” keyword in the resource requirement. |
|
Component |
esub.intelmpi esub.mpich2 |
|
Platform |
All |
|
Impact |
The "same" resource requirement string cannot be used for Intel MPI and MPICH2 jobs. |
135254 |
Date |
2009-12-17 |
|
Description |
The mbschd daemon reserves incorrect slots when preemption is configured with fairshare. |
|
Component |
mbschd schmod_preemption.so |
|
Platform |
All |
|
Impact |
The bqueues and bhosts commands show incorrect information and jobs are affected. |
149997 |
Date |
2010-07-05 |
|
Description |
The program eauth_userpass.exe cannot update passwd.lsfuser because the file contains the character '\r'. |
|
Component |
Installer |
|
Platform |
All |
|
Impact |
The program eauth_userpass.exe cannot update passwd.lsfuser. |
140675 |
Date |
2009-12-16 |
|
Description |
The host slot limit is not working. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The host slot limit is not working. |
157978 |
Date |
2010-10-29 |
|
Description |
The daemon sbatchd logs error messages and rla cannot start if both LSF_CPUSETLIB and LSF_ASPLUGIN are set in lsf.conf. |
|
Component |
rla liblsf.so libbat.a libbat.so liblsf.a sbatchd pam |
|
Platform |
linux2.6-glibc2.3-x86_64, linux2.6-glibc2.4-sn-ipf |
|
Impact |
Features related to |
147705 |
Date |
2010-04-26 |
|
Description |
If you define LSF_DEBUG_MBD="LC_TRACE", or forgot to remove it from the lsf.conf file, mbatchd has slow response. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The mbatchd daemon does not respond in a high load cluster. |
148579 |
Date |
2010-05-12 |
|
Description |
After enabling daemons.wrap, some sbatchd debug messages are not logged. |
|
Component |
sbatchd mbatchd |
|
Platform |
All |
|
Impact |
Troubleshooting is not convenient. |
149270 |
Date |
2010-05-26 |
|
Description |
The command bacct always reports memory and swap for a job as zero. |
|
Component |
sbatchd |
|
Platform |
All |
|
Impact |
PA reports cannot be generated because the data in lsb.acct is incorrect. |
136762 |
Date |
2009-11-04 |
|
Description |
With Asian language settings, if the registry key HKEY_CURRENT_USER\Console\LoadConIme is 1, the tssub job does not finish because the conime.exe process does not finish automatically |
|
Component |
lstsmgr.exe |
|
Platform |
Windows |
|
Impact |
The tssub job does not finish. |
103932 |
Date |
2009-09-27 |
|
Description |
Advance reservation files are not created under LSB_LOCALDIR when the duplicate event log feature is enabled. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
You may lose the advance reservation definition. |
132935 |
Date |
2010-01-20 |
|
Description |
LSF does not handle single and double quotes in the job command line on Windows hosts. |
|
Component |
preservestarter sbatchd bsub |
|
Platform |
Linux / Windows |
|
Impact |
The user job flow is broken because the job command line is not interpreted correctly. |
|
Parameter |
|
131751 |
Date |
2009-07-28 |
|
Description |
After changing a Windows server to a static client and restarting the daemons on the client, sbatchd cannot die automatically. |
|
Component |
sbatchd |
|
Platform |
All |
|
Impact |
The sbatchd daemon keeps running on the static client host. |
143961 |
Date |
2010-04-20 |
|
Description |
When bld exits abnormally, mlim does not log anything. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
Troubleshooting is not convenient. |
143248 |
Date |
2010-02-23 |
|
Description |
A job was not dispatched because the job file was owned by root. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The job remains pending, you must force it to run manually. |
140213 |
Date |
2009-12-15 |
|
Description |
A zombie pending job with a dependency exists in mbschd after the job gets killed in mbatchd. |
|
Component |
mbschd |
|
Platform |
All |
|
Impact |
There is a potential performance impact because mbschd still tries to schedule the job. |
134777 |
Date |
2009-09-15 |
|
Description |
The query child mbatchd is bound to the same CPU as the parent mbatchd if LSF_DAEMONS_CPUS is set. |
|
Component |
mbatchd |
|
Platform |
linux2.6 |
|
Impact |
This affects performance. |
136484 |
Date |
2009-10-14 |
|
Description |
When you call the lsb_submit |
|
Component |
liblsf.lib |
|
Platform |
All |
|
Impact |
You see windows you do not use. |
147782 |
Date |
2010-04-22 |
|
Description |
You set " |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
You cannot submit such a large job array. |
140673 |
Date |
2009-12-17 |
|
Description |
DAEMON_WRAP_DEBUG is not defined but every job logs a message to /tmp/daemons.wrap.log. |
|
Component |
daemons.wrap |
|
Platform |
All |
|
Impact |
The /tmp file system eventually runs out of disk space and memory, which can lead to other system problems. |
146411 |
Date |
2010-04-07 |
|
Description |
The esub.password.exe program cannot validate passwords in a mixed cluster with a UNIX master host. |
|
Component |
esub.password.exe |
|
Platform |
Windows |
|
Impact |
The esub.password.exe program does not work. |
140621 |
Date |
2010-01-15 |
|
Description |
The hostsetup program shows the error: “Cannot determine BINARY_TYPE”. |
|
Component |
hostsetup |
|
Platform |
All |
|
Impact |
You cannot run hostsetup or lsfinstall on Ubuntu 9.10. |
154994 |
Date |
2010-09-07 |
|
Description |
Some jobs in a large job array pend on a Windows host because they failed to get the password. |
|
Component |
sbatchd |
|
Platform |
Windows |
|
Impact |
The failed jobs pend. |
138410 |
Date |
2009-11-03 |
|
Description |
The |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
User-defined scripts cannot handle the |
153404 |
Date |
2010-08-04 |
|
Description |
There is a performance problem with mbatchd reconfiguration when many SLAs are configured. |
|
Component |
mbatchd liblsf.a libbat.so liblsf.so |
|
Platform |
All |
|
Impact |
There is a performance problem with mbatchd reconfiguration. |
153992 |
Date |
2010-08-10 |
|
Description |
The mbschd daemon may crash when bmod and bkill are executed at the same time. |
|
Component |
mbschd |
|
Platform |
All |
|
Impact |
The mbschd daemon may crash. |
147028 |
Date |
2010-04-02 |
|
Description |
The master lim becomes out of service after overwhelming requests from the slave lim when LSF_REJECT_NONLSFHOST=Y |
|
Component |
lim |
|
Platform |
All |
|
Impact |
The system is down. |
144512 |
Date |
2010-02-19 |
|
Description |
In a mixed cluster, you must specify the password file location using a parameter. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
It is difficult to manage a mixed cluster. |
141438 |
Date |
2010-01-03 |
|
Description |
The sbatchd daemon logs messages such as “A socket operation has failed” and sbatchd CPU usage is 100%. |
|
Component |
sbatchd |
|
Platform |
All |
|
Impact |
The host cannot be used, you must restart sbatchd manually. |
150063 |
Date |
2010-07-08 |
|
Description |
CPU binding does not work on some non-Nehalem hosts. |
|
Component |
sbatchd |
|
Platform |
All |
|
Impact |
The CPU binding feature does not work. |
147626 |
Date |
2010-04-19 |
|
Description |
ENFORCE_ |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
ENFORCE_ |
146925 |
Date |
2010-04-13 |
|
Description |
A mistake in a job script might lead to errors in lsb.events. |
|
Component |
libbat.a mbatchd libbat.so |
|
Platform |
All |
|
Impact |
After you restart mbatchd, the job is gone, the bjobs command no longer shows the job. |
151584 |
Date |
2010-07-02 |
|
Description |
A job is mistakenly forwarded to the wrong cluster. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The job forwarding feature does not work. |
158439 |
Date |
2010-10-29 |
|
Description |
The mbatchd daemon uses up to 2 GB memory after 2-3 days. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
You must run badmin mbdrestart. |
141644 |
Date |
2010-01-08 |
|
Description |
A program linked with the LSF |
|
Component |
liblsf.lib libbat.lib libbat.dll liblsf.dll libbatw2k.dll libbatw2k.lib liblsfw2k.dll liblsfw2k.lib |
|
Platform |
Windows |
|
Impact |
A program cannot run as a service. |
133940 |
Date |
2009-09-02 |
|
Description |
The mbatchd deamon logs duplicate event records for the same job ID in lsb.acct, with incorrect timestamps. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
This affects Platform Analytics. You see negative pending reasons for jobs. |
130564 |
Date |
2009-10-14 |
|
Description |
You cannot install all LSF features if the license file does not include the keyword FEATURE. |
|
Component |
Installer |
|
Platform |
All |
|
Impact |
LSF features are not enabled. |
149090 |
Date |
2010-05-19 |
|
Description |
The lim daemon does not detect two cores on the dual-core CPU. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
The number of job slots is incorrect. |
140089 |
Date |
2009-12-28 |
|
Description |
The command bjobs shows the incorrect pending order after you run bmig. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
You cannot forecast the dispatch time based on bjobs. |
139281 |
Date |
2009-11-25 |
|
Description |
It takes over 30 seconds to finish a Session Scheduler job. |
|
Component |
libvem.so |
|
Platform |
Linux |
|
Impact |
It takes over 30 seconds to finish a Session Scheduler job. |
151614 |
Date |
2010-07-09 |
|
Description |
After you run "badmin reconfig", sbatchd dies on some slave hosts. |
|
Component |
sbatchd |
|
Platform |
All |
|
Impact |
You must start sbatchd manually on these hosts. |
157247 |
Date |
2010-10-09 |
|
Description |
The sbatchd daemon spawns a child res
with -PTY_ |
|
Component |
sbatchd |
|
Platform |
Linux/UNIX |
|
Impact |
An interactive job hangs. |
139538 |
Date |
2009-11-24 |
|
Description |
Jobs are dispatched to hosts not in the compute unit when ptile='!' is specified. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The compute unit does not work properly. |
147379 |
Date |
2010-04-20 |
|
Description |
When LSB_MIXED_PATH_ENABLE=y, job submission using a script fails. |
|
Component |
sbatchd |
|
Platform |
All |
|
Impact |
Job submission by script fails. |
147827 |
Date |
2010-05-05 |
|
Description |
The job does not requeue to the top if LSB_REQUEUE_TO_BOTTOM is 0. |
|
Component |
mbschd |
|
Platform |
All |
|
Impact |
The job execution order is not as expected. |
138051 |
Date |
2009-11-18 |
|
Description |
An LSF job does not release its slot if it is suspended by
License Scheduler. LSF_ |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Underutilization of slots. |
147837 |
Date |
2010-04-22 |
|
Description |
Add a log class for mkExecSibling log messages. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The mbatchd log file size grows. |
149174 |
Date |
2010-05-25 |
|
Description |
The dual core license cannot be checked out, even though FEATURE lsf_dualcore_x86 is specified in the license file. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
Dual-core machines are unlicensed. |
148825 |
Date |
2010-05-20 |
|
Description |
After configuring LSB_LOCALDIR, "badmin showconf mbd" displays an incorrect value for LSB_SHAREDIR. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The badmin command shows incorrect information. |
137221 |
Date |
2009-10-28 |
|
Description |
The bjobs command shows unexpected results because of a threading issue in the child mbatchd. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Job queries are interrupted. |
150860 |
Date |
2010-06-23 |
|
Description |
After submitting a simple Perl job to a win2008/win2003 64 bit host, the execution host hangs and is unreachable until the daemons restart. |
|
Component |
sbatchd |
|
Platform |
Windows |
|
Impact |
The execution host is down. |
145617 |
Date |
2010-03-28 |
|
Description |
Job submission fails if LSB_SUB_MODIFY_ |
|
Component |
liblsf.a liblsf.so bsub libbat.so |
|
Platform |
All |
|
Impact |
Job submission fails. |
133276 |
Date |
2009-08-28 |
|
Description |
LSF_BIND_ |
|
Component |
sbatchd |
|
Platform |
Linux 2.6 |
|
Impact |
CPU utilization is not balanced. |
157215 |
Date |
2010-10-10 |
|
Description |
A job submitted from a floating client pends with "Not the same type as the submission host" after restarting LSF. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Job execution fails. |
146690 |
Date |
2010-03-31 |
|
Description |
Some dynamic hosts cannot be recognized by mbatchd, but can be recognized by lim. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Some dynamic hosts cannot join the cluster. |
153987 |
Date |
2010-08-11 |
|
Description |
The mbatchd daemon does not log which host dropped the connection when logging this error message: do_queueInfoReq(): b_write_fix(204941860) failed, Connection reset by peer. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Troubleshooting is difficult. |
142065 |
Date |
2010-01-19 |
|
Description |
Jobs are not dispatched to a lightly loaded host ordered by r15s:pg as default if there is no order string defined explicitly. |
|
Component |
mbschd |
|
Platform |
All |
|
Impact |
Jobs cannot be dispatched as expected, and the cluster's load is not balanced. |
137739 |
Date |
2009-11-17 |
|
Description |
The PATH environment variable is not expanded correctly while using preservestarter.exe. |
|
Component |
preservestarter.exe |
|
Platform |
Windows |
|
Impact |
Cannot find correct path. |
135490 |
Date |
2009-10-14 |
|
Description |
The lsb.stream parsing library has a memory leak, which causes the PA loader to use a lot of memory. |
|
Component |
liblsbstream.so |
|
Platform |
All |
|
Impact |
The PA loader uses a lot of memory. |
143229 |
Date |
2010-01-27 |
|
Description |
The bsub command crashes at runBatchEsub() if the requested host list is longer than 1024 characters. |
|
Component |
bsub |
|
Platform |
All |
|
Impact |
A job cannot be submitted. |
139352 |
Date |
2009-12-03 |
|
Description |
The bclusters command shows “disc” when LSB_ |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Clusters are disconnected. Jobs cannot run on a remote cluster. |
147957 |
Date |
2010-04-29 |
|
Description |
The sbatchd daemon fills /tmp with millions of temp files. |
|
Component |
clearcase/daemons.wrap sbatchd |
|
Platform |
All |
|
Impact |
The directory /tmp is filled with temp files. |
145612 |
Date |
2010-03-23 |
|
Description |
Memory usage for a multi-thread job is not reported correctly. |
|
Component |
sbatchd res |
|
Platform |
Linux |
|
Impact |
Job accounting information is incorrect, it may affect job scheduling based on memory. |
156717 |
Date |
2010-10-29 |
|
Description |
The mbatchd daemon waits for 5 minutes to restart mbschd. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
It takes a long time for the cluster to work normally. |
158033 |
Date |
2010-10-25 |
|
Description |
The lshosts command can not update the swap value after swap changes. |
|
Component |
lim |
|
Platform |
Linux |
|
Impact |
You must restart lim. |
145641 |
Date |
2010-03-18 |
|
Description |
The variable LSB_DJOB_HOSTFILE contains a mix of \ and / characters; the / characters are not recognized by Windows. |
|
Component |
sbatchd.exe |
|
Platform |
All |
|
Impact |
The LSB_DJOB_HOSTFILE cannot be recognized by Windows. |
157291 |
Date |
2010-10-26 |
|
Description |
On hosts with more than 256 virtual CPUs, the core number per processor may be incorrect. |
|
Component |
lim |
|
Platform |
Linux |
|
Impact |
The lim daemon reports incorrect core values, which affects licensing. |
151193 |
Date |
2010-06-24 |
|
Description |
In a MultiCluster environment, the rescheduling feature did not work on AIX. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
Jobs are not rescheduled. |
124220 |
Date |
2009-09-25 |
|
Description |
The command lsmake fails while building the openWRT project. |
|
Component |
lsmakerm lsmake |
|
Platform |
Linux |
|
Impact |
Unable to build the openWRT source tree. |
143068 |
Date |
2010-01-27 |
|
Description |
Improper log classes make collection of mbatchd debug data difficult. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The cluster is not responding. |
140592 |
Date |
2009-12-13 |
|
Description |
The allocation of vemkd and mbatchd do not match. |
|
Component |
mbatchd lib2vemkd.so |
|
Platform |
All |
|
Impact |
LSF jobs pend even with enough idle CPUs in the |
147725 |
Date |
2010-04-19 |
|
Description |
Memory leak. |
|
Component |
libbat.a |
|
Platform |
All |
|
Impact |
Memory leak. |
152925 |
Date |
2010-07-24 |
|
Description |
The command lsload cannot show more than 9999 hosts. |
|
Component |
lsload lsplace mbatchd |
|
Platform |
All |
|
Impact |
Scalability is limited. |
154479 |
Date |
2010-08-23 |
|
Description |
LSB_SUB_ |
|
Component |
bsub |
|
Platform |
All |
|
Impact |
Cannot get the value in esub. |
115080 |
Date |
2010-01-26 |
|
Description |
The bjobs –u all command output does not show the condensed host list. |
|
Component |
bjobs mbatchd |
|
Platform |
All |
|
Impact |
The bjobs –u all command output does not show the condensed host list. |
134040 |
Date |
2009-08-28 |
|
Description |
The handle and thread counts keep increasing when loading or unloading the LSF library. |
|
Component |
liblsf.lib |
|
Platform |
Windows |
|
Impact |
Your program will not work as expected. |
135702 |
Date |
2009-09-25 |
|
Description |
The lspasswd command does not work. |
|
Component |
lsf.shared |
|
Platform |
UNIX/Linux |
|
Impact |
The lspasswd command does not work. |
149157 |
Date |
2010-05-25 |
|
Description |
Incorrect calculation of the memory usage of the process. |
|
Component |
pim |
|
Platform |
Linux |
|
Impact |
Incorrect calculation of the memory usage of the process. |
145892 |
Date |
2010-05-06 |
|
Description |
|
|
Component |
lim |
|
Platform |
All |
|
Impact |
The cluster does not work. |
152097 |
Date |
2010-07-23 |
|
Description |
Running lspasswd from an LSF 7.0 host automatically alters the permissions of the passwd.lsfuser file to 600, and causes lsrcp to fail on an LSF 6.2 Windows host. |
|
Component |
lim lspasswd.exe |
|
Platform |
Windows |
|
Impact |
The lsrcp programs fails in a mixed cluster if the master runs LSF 7.0 EP 6, and the slave runs LSF 6.2. |
133477 |
Date |
2009-09-02 |
|
Description |
The LIBPATH environment variable on AIX must be set by
profile.lsf. |
|
Component |
sbatchd |
|
Platform |
All |
|
Impact |
None. |
151637 |
Date |
2010-07-11 |
|
Description |
The mbatchd daemon core dumps when a dynamic host is removed. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The mbatchd daemon core dumps. |
143059 |
Date |
2010-02-08 |
|
Description |
The mbatchd daemon logs "updUserData1: numPEND is
negative". |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The message is confusing. |
133621 |
Date |
2009-09-02 |
|
Description |
You added a new host as the master host, and the new master lim core dumped after reconfiguration. |
|
Component |
lim |
|
Platform |
All |
|
Impact |
The cluster is not stable, failover does not work. |
140808 |
Date |
2009-12-15 |
|
Description |
Using bsub with the XOR resource requirement causes mbatchd to core dump. |
|
Component |
mbatchd |
|
Platform |
All |
|
Impact |
The mbatchd daemon core dumps and you cannot submit jobs. |
153609 |
Date |
2010-08-01 |
|
Description |
When jobs use CPU binding on a Xen virtual machine, sbatchd core dumps. |
|
Component |
sbatchd |
|
Platform |
All |
|
Impact |
The sbatchd daemon core dumps. |
support@platform.com
www.platform.com
Toll-free: 1-877-444-4573
Platform Support
Platform Computing
Corporation
© 1994 - 2011
Platform Computing Corporation
All Rights Reserved.
Although the information in
this document has been carefully reviewed, Platform Computing Corporation (“Platform”) does not warrant it to be
free of errors or omissions. Platform reserves the right to make corrections,
updates, revisions or changes to the information in this document.
UNLESS OTHERWISE EXPRESSLY
STATED BY PLATFORM, THE PROGRAM DESCRIBED IN THIS DOCUMENT IS PROVIDED
“AS IS”
Document redistribution policy : This document is protected by copyright and you may
not redistribute or translate it into another language, in part or in whole.
You may only redistribute this document internally within your organization
(for example, on an intranet).
LSF is a registered
trademark of Platform Computing Corporation in the
ACCELERATING INTELLIGENCE,
THE BOTTOM LINE IN DISTRIBUTED COMPUTING, PLATFORM COMPUTING, CLUSTERWARE, PLATFORM ACTIVECLUSTER,
IT INTELLIGENCE, SITEASSURE, PLATFORM SYMPHONY,
PLATFORM JOBSCHEDULER, PLATFORM INTELLIGENCE,
PLATFORM INFRASTRUCTURE INSIGHT, PLATFORM WORKLOAD INSIGHT, and the PLATFORM
and LSF logos are trademarks of Platform Computing Corporation in the United
States and in other jurisdictions.
UNIX is a registered
trademark of The Open Group in the
Microsoft is either a
registered trademark or a trademark of Microsoft Corporation in the
Windows is a registered
trademark of Microsoft Corporation in the
Other products or services
mentioned in this document are identified by the trademarks or service marks of
their respective owners.