- Overview
- Configuring Hardware Monitoring
- Changes
- Known Problems
- Monitors Provided
- Monitor Dependencies
- Product Documentation
- SD Product Structure
- Reporting Defects
Included with the OnlineDiag bundle of support tools are the EMS Hardware Monitors, an important set of tools for maintaining system availability. The EMS Hardware Monitors allow you to monitor the operation of a wide variety of hardware resources and be alerted immediately if any failure or other unusual event occurs.
Hardware event monitoring provides a high level of protection against system hardware failure. By using hardware event monitoring, you can eliminate the most undetected hardware failures that could interrupt system operation or cause data loss.
The following monitors are introduced in this release on PA-RISC and Itanium-based systems:
- DS2500 Enclosure Monitor (gazemon)
- SCSI Tape Devices (dm_stape)
The following monitor is introduced in this release on Itanium-based systems:
- Emulex PCI-e Fibre Channel Mass Storage Adapter (dm_fclp_adapter)
Configuring Hardware Monitoring
The EMS Hardware Monitors are installed at the same time as the Support Tools Manager (STM). Once the monitoring software is installed, monitoring is automatically enabled.
By default, event messages with severity levels major warning, minor warning, serious, or critical generated by the monitors are conveyed in the following ways:
All events generated by all the monitors are stored in the /var/opt/resmon/log/event.log file.
- Written to /var/adm/syslog/syslog.log
- Sent to EMAIL address root
To configure, enable, or disable hardware event monitoring, run the /etc/opt/resmon/lbin/monconfig monitoring request manager.
The Peripheral Status Monitor (PSM) and the Kernel Resource Monitor (krmond) are configured differently. They use the Event Monitoring Service (EMS) Graphical User Interface (GUI). For more information, see:
http://docs.hp.com/en/diag/ems/ems_gui.htmChanges made to the EMS Hardware Monitors for the June 2007 release include the following:
- Changes to Monitors
This section describes the changes made to individual monitors. Monitors are listed in alphabetical order.
- Chassis Code Monitor (dm_chassis)
The following change applies only to PA-RISC-based systems:
- JAGag28140
The dm_chassis reports old events when the process table is full. This problem is fixed.- CMC Monitor (cmc_em)
- Not applicable
- Core Hardware (dm_core_hw)
The following change applies only to PA-RISC-based systems:
- JAGag28139
The dm_core_hw monitor now supports the following systems:
- rp7440
- rp8440
- 9000/800/SD16
- 9000/800/SD32
- 9000/800/SD64
- Core Hardware for Itanium (ia64_corehw)
The following change applies to PA-RISC and Itanium-based systems:
- JAGag07766
The ia64_corehw monitor does not generate events when the SENSOR_RESCAN_INTERVAL condition is met. This problem is fixed. The monitor now generates reminder events.- Core Hardware Monitor - Asama (ipfcorehw_asama)
- Not applicable
- Core Hardware Monitor - Hitachi (ipfcorehw_hitachi)
- Not applicable
- CPE Monitor (cpe_em)
The following change applies only to Itanium-based systems:
- JAGag19632
The cpe_em monitor incorrectly generates Event 100299 when a memory error corresponding to double chip sparing is encountered. This problem is fixed.- JAGag24589
Event description of events 100220 and 100221 indicates that the cell is faulty. This description is incorrect. The event description is modified to fix the problem.- CPU Monitor (lpmc_em)
- Not applicable
- CPU Monitor - Hitachi (cmc_em_hitachi)
- Not applicable
- Disk Array FC60 Monitor (fc60mon)
- Not applicable
- Disk Monitor (disk_em)
- Not applicable
- DS2500 Enclosure Monitor (gazemon)
The following change applies to PA-RISC and Itanium-based systems:
- The DS2500 monitor is supported in this release on PA-RISC and Itanium-based systems.
- JAGag30664
The gazemon monitor generates duplicate disk_em asynchronous events. This problem is fixed.- Fibre Channel Adapter (dm_ql_adapter)
- Not applicable
- Fibre Channel Adapter Model A5158 Monitor (dm_TL_adapter)
- Not applicable
- Fibre Channel Switch (dm_fc_sw)
- Not applicable
- Forward Progress Log (FPL) Monitor (fpl_em)
The following changes apply to PA-RISC and Itanium-based systems:
- JAGag15659
The Time Stamp of Event 676 is incorrect. This problem is fixed.- JAGag02230
The fpl_em monitor does not generate pre-boot events. A new value, PRE_BOOT_FLAG, is added in the global config file (fpl_em.cfg) to fix this problem.- High Availability Disk Array Monitor (ha_disk_array)
- Not applicable
- High Availability Storage System (dm_ses_enclosure) The following change applies to PA-RISC and Itanium-based systems:
- JAGag35642
Event 601 and Event 603 are reported incorrectly. This problem is fixed.- iSCSI Driver Subsystem Monitor (dm_iscsi_adapter)
- Not applicable
- Kernel Resource Monitor (krmond)
- Not applicable
- Memory (dm_memory) The following change applies only to PA-RISC-based systems:
- The dm_memory monitor now supports the following systems:
- rp7440
- rp8440
- 9000/800/SD16
- 9000/800/SD32
- 9000/800/SD64
- Memory IA64 (memory_ia64)
- JAGaf23487
Event 1400 is generated when the PDT is 100% full. This event is new.- JAGag28127
DRAM erasure events are generated when an erasure occurs. Double Chip spare events such as events 5000, 5100, 5200, 5300, 5400 are disabled to fix this problem.- JAGag28128
Event 6100 is generated when 24 errors occur in 24 hours on the entire system. The event is now generated when 24 errors occur in 24 hours only on a single extender and not on the entire system.- Memory Monitor -- Hitachi (ipfmemory_hitachi)
- Not applicable
- MSA1000/MSA30 Storage Disk Array Monitor (msamon)
- JAGag24840
The EMS GUI cannot identify the /storage/status/disk_arrays/MSA1000 path. Therefore, the PSM does not monitor the StorageWorks Modular SAN Array 1000/30 (MSA1000/MSA30). This problem is fixed.- Peripheral Status Monitor (psmmon)
- Not applicable
- RAID Adapter (dm_raid_adapter)
- Not applicable
- Remote Monitor (RemoteMonitor)
- Not applicable
- SCSI Disk Monitor (scsi_disk)
- Not applicable
- SCSI Tape Devices (dm_stape)
- The dm_stape monitor is supported in this release on PA-RISC and Itanium-based systems.
- System Status Monitor (sysstat_em)
- Not applicable
- UPS Monitor (dm_ups)
- Not applicable
- Changes to Platform and Interface
- Not applicable
- Customer-Viewable Interface Changes
- Not applicable
- If the System Fault Management product is installed on your system, you must complete the following steps, before and after updating the OnlineDiag product:
- Type the following command to shut down the System Fault Management subsystem:
# /sbin/init.d/cimserver stop- Perform the OnlineDiag update.
- Type the following command to restart the System Fault Management subsystem after the update is completed:
# /sbin/init.d/cimserver start- If the maxssiz_64bit kernel parameter is set below the default value of 0x800000, it can cause the lpmc_em monitor to abort.
- The Memory Page Deallocation (MPD), which runs on most current HP-UX systems, does not work on rx4610 systems. The activity log for memlogd includes a message that reads unsupported device.
MPD cannot be implemented on the rx4610 system because the system's design does not allow the memlogd daemon to run on it.
For the June 2007 release of HP-UX 11i v2 (11.23), the following monitors are provided:The following monitors are NOT provided:
- CMC Monitor (cmc_em)
- Core Hardware (dm_core_hw)
- Core Hardware for Itanium (ia64_corehw)
- Core Hardware Monitor -- Asama (ipfcorehw_asama)
- Core Hardware Monitor -- Hitachi (ipfcorehw_hitachi)
- CPE Monitor (cpe_em)
- CPU Monitor (lpmc_em)
- CPU Monitor -- Hitachi (cmc_em_hitachi)
- Disk (disk_em)
- Disk Array FC60 (fc60mon)
- DS2500 Enclosure Monitor (gazemon)
- Emulex PCI-e Fibre Channel Mass Storage Adapter (dm_fclp_adapter)
- Fibre Channel Adapter (dm_ql_adapter)
- Fibre Channel Adapter Model A5158 (dm_TL_adapter)
- Forward Progress Log (FPL) Monitor (fpl_em)
- High Availability Disk Array (ha_disk_array)
- High Availability Storage System (dm_ses_enclosure)
- iSCSI Driver Subsystem Monitor (dm_iscsi_adapter)
- Kernel Resource Monitor (krmond)
- Memory (dm_memory)
- Memory IA64 (memory_ia64)
- Memory Monitor -- Hitachi (ipfmemory_hitachi)
- MSA1000/MSA30 Storage Disk Array Monitor (msamon)
- Peripheral Status Monitor (psmmon)
- RAID Adapter (dm_raid_adapter)
- Remote Monitor (RemoteMonitor)
- Serial-Attached SCSI (SAS) Mass Storage Adapter monitor (dm_sas_adapter)
- SCSI Disk Monitor (scsi_disk)
- SCSI Tape Devices (dm_stape)
- System Status (sysstat_em)
- UPS (dm_ups)
- dm_FCMS_adapter
- Fibre Channel SCSI Multiplexer (dm_fc_scsi_mux)
- fw_disk_array: hardware not supported on the system
- scsi123_em: hardware not supported on the system
For detailed information about the products and the monitors supporting them, and additional dependencies, see the documentation of Online Diagnsotcs at:
http://docs.hp.com/en/diag/For a list of the current required patches, see the DIAGNOSTIC.readme file for this release.
Current monitor requirements are described in the Requirements and Supported Products file. Requirements are also listed in Chapter 2 of the EMS Hardware Monitors User's Guide.
For the latest and most complete information on EMS Hardware Monitors and the Support Tools Manager (STM), see the documentation of Online Diagnsotcs at:
http://docs.hp.com/en/diag/At this site, you will find Overviews, Tutorials, Quick Reference Cards, Frequently Asked Questions (FAQs), and other material.
For complete information on installing and using EMS Hardware Monitors, as well as a list of supported hardware, see the "EMS Hardware Monitors User's Guide" available at the mentioned site.
For the most current information on HP-UX 11i v2 (11.23) diagnostics, see the following Web pages at the Diagnostics site:
- "DIAGNOSTICS.readme for HP-UX 11i v2 (11.23) (June 2007)"
- "Release Notes for STM on HP-UX 11i v2 (11.23) (June 2007)"
- "Release Notes for EMS Hardware Monitors on HP-UX 11i v2 (11.23) (June 2007)"
The EMS Hardware Monitors are installed as part of the OnlineDiag bundle (product number B4708AA). In addition, they require the EMS framework, product number B7609BA.
For information on the STM product, see the STM release notes file: /usr/sbin/stm/Rel_NOTES.STM.
SD Bundle: OnlineDiag Description: On-line Diagnostic System (Series 800/700) SD PRODUCT: Sup-Tool-Mgr Description: Support Tools Manager for HP-UX Systems SD SUB-PRODUCT: Manuals Description: Support Tools Manager Manual Pages FILESET: RELEASE_NOTES Description: HPUX STM Release Notes FILESET: STM-MAN Description: HPUX STM Manual Pages SD SUB-PRODUCT: Runtime Description: STM Manual Runtime FILESET: STM-CATALOGS Description: HPUX STM Shared Libraries FILESET: STM-SHLIBS Description: HPUX STM Shared Libraries FILESET: STM-UI-RUN Description: HPUX STM User Interface FILESET: STM-UUT-RUN Description: HPUX STM Unit Under Test Runtime SD PRODUCT: EMS-Config Description: EMS Config FILESET: EMS-GUI Description: Event Monitoring Service Graphical User Interface SD PRODUCT: EMS-Core Description: EMS Core Product FILESET: EMS-CORE Description: Event Monitoring Service Core FilesYou can report defects by filing a request on CHART. The name of the project is diag.SR. If you do not have access to CHART, contact your local HP representative to file a defect on your behalf.