Using Traps to Monitor Appliance Health

Recommendations for configuring alerts that monitor appliance health.

When monitoring the health of the Integration Appliance, you can use one or both of the following methods:
  • Poll - Actively monitor runtime resource usage including appliance garbage collection cycles, appliance memory usage, and appliance disk usage.
  • Trap - Receive notifications indicating hardware situations such as failed fans, high temperatures, or failed disks. For more information about hardware related SNMP traps, see About the Platform Module.

For more information about creating and enabling notification alerts, see the WMC Online Help or the Cast Iron Web Management Console Guide in the IBM WebSphere Cast Iron Information Center.

Table 1 provides recommended thresholds for notifications regarding garbage collection, memory usage, and disk usage.

Table 1. Recommended Notification Thresholds
Parameters to Monitor Recommended Thresholds SNMP Name and OID
Garbage Collection Create a notification that triggers an alert if this value changes quickly, by more than 6 counts in a 1-minute time period. CASTIRON-IA-MIB::ciIaResNbrGarbageCollects .1.3.6.1.4.1.13336.2.2.2.1.1.2.1.0
Memory Usage Create a notification that triggers an alert if this value goes over 80% (raw value of 8000). CASTIRON-IA-MIB::ciIaResPctMemoryUsed .1.3.6.1.4.1.13336.2.2.2.1.1.2.2.0
Disk Usage Create a notifications triggers an alert if this value goes over 75% (raw value of 7500). CASTIRON-IA-MIB::ciIaResPctWipFull .1.3.6.1.4.1.13336.2.2.2.1.1.2.3.0
Note: The parameters to monitor, described in the table above, are for SNMP polling only.




Feedback | Notices


Timestamp icon Last updated: Thursday, 2 June 2016


http://pic.dhe.ibm.com/infocenter/wci/v7r0m0/topic/com.ibm.wci.notifications_reference.doc/using_traps_to_monitor_appliance_health.html