- Online Diagnostics
- Required and Recommended Patches
- Loading Patches
- EMS Hardware Monitors Overview
- Configuring Hardware Monitoring
- Changes
- Known Problems
- Monitors Provided
- Product Documentation
- SD Product Structure
- Reporting Defects
The Online Diagnostics software is a collection of tools that enables you to monitor and test server hardware. It comprises the Event Monitoring Service (EMS) framework, the EMS Hardware Monitors, and the Support Tools Manager (STM).
Note: On the HP-UX 11i v3 operating system, Online Diagnostics does not support tape drives. Although some of the Support Tools Manager (STM) tools may work with tape drives, they are not supported. The diagnostic tools and utilities that support these devices are HP StorageWorks Library and Tape Tools (L and TT). These tools are available at:
http://www.hp.com/support/tapetoolsThe Online Diagnostics tools are all contained in a Software Depot (SD) bundle, called the OnlineDiag bundle. This bundle is distributed in the following ways:
- OE media
- HP Software Depot Web site
The following generic command lists various levels of content of the depot:
swlist -l <level> -s /<mount_point>/DIAGNOSTICS/<os_release> where <level> = bundle Bundle level listing = product Product level listing = subproduct Sub-product level listing = fileset File set listing <mount_point> Location where the CD file system is mounted. <os_release> The OS contents of the depot. = B.11.00For OnlineDiag on HP-UX 11i v2 March 2008 release, the following apply:
- All diagnostic defect repairs and enhancements as of November 2007, are included. Any future patches dated after November 2007 must be loaded after this version of OnlineDiag.
- The new hardware resources in all active releases as of December 1, 2007 are supported.
You can install the patches in one of the following ways:
Method 1:
Install the entire Hardware Enablement Bundle (HWE) or Quality Pack (QPK) patch bundle for your system. This is a simple and tested process. However, the volume of the patch bundle can be big.Method 2:
Install only the individual patches required from the patch bundle available on the OE media. This method is advantageous because the size of the patches is small. However, you must select and install patches manually.Method 3:
Notes:
Install individual patches from the HP IT Resource Center at:
http://us.itrc.hp.com/
- After installing individual patches from the HP IT Resource Center, you must restart your system.
- You must install the patches mentioned before installing Online Diagnostics.
EMS Hardware Monitors Overview
EMS Hardware Monitors are a set of tools for maintaining system availability. They enable you to monitor the operation of a wide variety of hardware resources and be alerted immediately if any failure or other unusual event occurs.
Hardware event monitoring provides a high level of protection against system hardware failure. By using hardware event monitoring, you can eliminate the most undetected hardware failures that could interrupt system operation or cause data loss.
Configuring Hardware Monitoring
The EMS Hardware Monitors are installed along with the Support Tools Manager (STM). Once the monitoring software is installed, monitoring is enabled automatically.
By default, monitors report events with severity levels, MAJOR_WARNING, MINOR_WARNING, SERIOUS, CRITICAL, or INFORMATION in the following ways:
Events are stored in the /var/opt/resmon/log/event.log file.
- Written to /var/adm/syslog/syslog.log file
- Sent to the root E-mail address
To configure, enable, or disable hardware event monitoring, run the monitoring request manager.
The Peripheral Status Monitor (PSM) is configured using the Event Monitoring Service (EMS). For more information on how to configure PSM, see Configuring Monitors with the EMS GUI at:
http://docs.hp.com/en/diag/ems/ems _gui.htmFollowing are the changes made to the EMS Hardware Monitors for the current release:
- Changes to Monitoring
- Changes to all Monitors
- Changes to Individual Monitors
- Changes to Configuration Files
- Changes to Monitoring Request Manager
- Changes to Monitoring
Starting with the HP-UX 11i v3 March 2008 release, SFM is the default monitoring mode. However, it is possible to switch to the OnlineDiag monitoring mode. For information on how to switch to the OnlineDiag mode, see the SFM Administrator's Guide at:
http://docs.hp.com/en/diag- Changes to all Monitors
- Not applicable
- Changes to Individual Monitors
This section describes the changes made to individual monitors. Monitors are listed in alphabetical order.
- Chassis Code Monitor (dm_chassis)
- Not applicable
- CMC Monitor (cmc_em)
The following changes apply to HP Integrity systems only:
- QXCR1000584950
The Automatic Process Recovery (APR) functionality is modified. The cmc_em monitor generates Event 100662 when a Machine Check Abort (MCA) happens. If Event 100662 occurs again within a period of two months, the monitor generates Event 100661, and the Dynamic Processor Resilience (DPR) action is initiated on the faulty processor.- QXCR1000752144
On the BL870c HP Server Blade, the cmc_em monitor exits with a message that states that the processor is not supported on the system. This problem is fixed.- Core Hardware (dm_core_hw)
- Not applicable
- Core Hardware for HP 9000 and Itanium-based Intelligent Platform Management Interface (IPMI) systems (ia64_corehw)
The following change applies to PA-RISC-based and HP Integrity systems:
- JAGag35925
The ia64_corehw monitor may not generate events during the system boot, because the Get Sensor Reading IPMI command fails with the completion code Oxc8 (Requested Data field Limit). This problem is fixed.- Core Hardware Monitor - Asama (ipfcorehw_asama)
- Not applicable
- Core Hardware Monitor - Hitachi (ipfcorehw_hitachi)
- Not applicable
- CPE Monitor (cpe_em)
The following changes apply to HP Integrity systems only:
- The cpe_em monitor supports PCI Express interface events. The monitor generates Event 100107 to notify users about PCI Express interface-related errors.
- QXCR1000593459
The cpe_em monitor consumes excessive CPU on sx2000 chipset-based systems. This problem is fixed.- CPU Monitor (lpmc_em)
The following change applies to HP 9000 systems only:
- JAGag45237
While taking Dynamic Processor Resilience (DPR) action on a faulty processor, the lpmc_em monitor activates additional iCAP processors. This problem is fixed.- CPU Monitor - Hitachi (cmc_em_hitachi)
- Not applicable
- Disk Array FC60 Monitor (fc60mon)
- Not applicable
- Disk Monitor (disk_em)
- Not applicable
- DS2500 Enclosure Monitor (gazemon)
- Not applicable
- Fibre Channel Adapter Monitor for FCD Driver-based Adapters (dm_ql_adapter)
- Not applicable
- Fibre Channel Adapter Monitor for A6795A/A5158A Adapters (dm_TL_adapter)
- Not applicable
- Fibre Channel Switch (dm_fc_sw)
- Not applicable
- Forward Progress Log (FPL) Monitor (fpl_em)
- Not applicable
- High Availability Disk Array Monitor (ha_disk_array)
- Not applicable
- High Availability Storage System (dm_ses_enclosure)
The following change applies to HP 9000 and HP Integrity systems:
- QXCR1000472664
Even if the DS2405 device is working properly, the dm_ses_enclosure monitor generates Event 404 periodically. This problem is fixed.- iSCSI Subsystem (dm_iscsi_adapter)
The following applies to HP 9000 and HP Integrity systems:
- The dm_iscsi_adapter monitor is introduced in HP-UX 11i v3 March 2008 release on HP 9000 and HP Integrity systems.
- Memory (dm_memory)
- Not applicable
- Memory IA64 (memory_ia64)
- Not applicable
- Memory Monitor -- Hitachi (ipfmemory_hitachi)
- Not applicable
- MSA1000/MSA30 Storage Disk Array Monitor (msamon)
The following change applies to HP 9000 and HP Integrity systems:
- If either MSA1000, or MSA1000 and MSA1500 devices are connected to the system, the msamon monitor logs Parm Page Get Failure information in the api.log file. Over a period of time, this information may overwrite other important information in the api.log file. This problem is fixed. The msamon monitor is modified to prevent the logging of Parm Page Get Failure information in the api.log file.
- MSA 60 and MSA 70 SAS Enclosure Monitor (msamon_sas)
The following change applies to HP Integrity systems only:
- QXCR1000749824
The msamon monitor supports MSA60 and MSA70 Enclosures.- Peripheral Status Monitor (psmmon)
- Not applicable
- RAID Adapter Monitor (dm_raid_adapter)
- Not applicable
- Remote Monitor (RemoteMonitor)
- Not applicable
- Serial-Attached SCSI (SAS) Mass Storage Adapter monitor (dm_sas_adapter)
- Not applicable
- SCSI Disk Monitor (scsi_disk)
- Not applicable
- System Status Monitor (sysstat_em)
- Not applicable
- UPS Monitor (dm_ups)
- Not applicable
- Changes to Configuration Files
- Not applicable
- Changes to Monitoring Request Manager
- Not applicable
- If the maxssiz_64bit kernel parameter is set below the default value of 0x800000, it can cause the lpmc_em monitor to abort.
- The Memory Page Deallocation (MPD), which runs on most current HP-UX systems, does not work on rx4610 systems. The activity log for memlogd includes a message that reads unsupported device.
MPD cannot be implemented on the rx4610 system, because the system's design does not allow the memlogd daemon to run on it.- Upgrading OnlineDiag to the 0803 version without upgrading SFM to the 0803 version, logs an error.
- Symptom
If SFM is the monitoring mode, and you upgrade OnlineDiag to the 0803 version without upgrading SFM to the 0803 version, the following error is logged in the /var/opt/sfm/log/install.log file:
No such file or directory - /opt/sfm/bin/trigger_switch_to_sfm.sh Monitoring is still not up completely. The operation will be retried after 3 minutes.
After the upgrade is completed, OnlineDiag is the monitoring mode.
- Workaround
To switch to the SFM mode, enter the following command at the HP-UX prompt:
# /opt/sfm/bin/sfmconfig -w -s
- Prevention
Upgrade both OnlineDiag and SFM to the corresponding 0803 versions simultaneously.
For the March 2008 release of HP-UX 11i v3 (11.31), the following monitors are provided:The following monitors are NOT provided:
- CMC Monitor (cmc_em)
- Core Hardware (dm_core_hw)
- Core Hardware for HP 9000 and Itanium-based Intelligent Platform Managment Interface (IPMI) systems (ia64_corehw)
- Core Hardware Monitor -- Asama (ipfcorehw_asama)
- Core Hardware Monitor -- Hitachi (ipfcorehw_hitachi)
- CPE Monitor (cpe_em)
- CPU Monitor (lpmc_em)
- CPU Monitor -- Hitachi (cmc_em_hitachi)
- Disk (disk_em)
- Disk Array FC60 (fc60mon)
- DS2500 Enclosure Monitor (gazemon)
- Fibre Channel Adapter Monitor for FCD Driver-based Adapters (dm_ql_adapter)
- Fibre Channel Adapter Monitor for A6795A/A5158A Adapters (dm_TL_adapter)
- Forward Progress Log (FPL) Monitor (fpl_em)
- High Availability Disk Array (ha_disk_array)
- High Availability Storage System (dm_ses_enclosure)
- iSCSI Subsystem (dm_iscsi_adapter)
- Memory (dm_memory)
- Memory IA64 (memory_ia64)
- Memory Monitor -- Hitachi (ipfmemory_hitachi)
- MSA1000/MSA30 Storage Disk Array Monitor (msamon)
- MSA 60 and MSA 70 SAS Enclosure Monitor (msamon_sas)
- Peripheral Status Monitor (psmmon)
- RAID Adapter Monitor (dm_raid_adapter)
- Remote Monitor (RemoteMonitor)
- Serial-Attached SCSI (SAS) Mass Storage Adapter monitor (dm_sas_adapter)
- SCSI Disk Monitor (scsi_disk)
- System Status (sysstat_em)
- UPS (dm_ups)
- dm_FCMS_adapter
- SCSI Tape Devices (dm_stape)
- Fibre Channel SCSI Multiplexer (dm_fc_scsi_mux)
- fw_disk_array: hardware not supported on the system
- scsi123_em: hardware not supported on the system
For detailed information about the products and the monitors supporting them, and additional dependencies, see the documentation on Online Diagnosticcs at:
http://docs.hp.com/en/diag/For a list of the current required patches, see the DIAGNOSTIC.readme file at:
http://docs.hp.com/en/diag/st/st_read.ht mCurrent monitor requirements are described in the Online Diagnostics Administrator's and User's Guide at:
http://docs.hp.com/en/diag .Following are the documents related to EMS Hardware Monitors available at:
http://docs.hp.com/en/diag/
- Data Sheets
- Online Diagnostics Administrator's and User's Guide
- EMS HW Monitors for Hitachi Systems Running HP-UX
- Event Descriptions
- Release Notes
The EMS Hardware Monitors are installed as part of the OnlineDiag bundle (product number B4708AA). In addition, they require the EMS framework (product number B7609BA).
For information on the STM product, see the STM release notes file at:
/usr/sbin/stm/REL_NOTES.STMFollowing is the information about the bundle, product, sub-product, and fileset of the OnlineDiag depot:
SD Bundle: OnlineDiag Description: On-line Diagnostic System (Series 800/700) SD PRODUCT: Sup-Tool-Mgr Description: Support Tools Manager for HP-UX Systems SD SUB-PRODUCT: Manuals Description: Support Tools Manager Manual Pages FILESET: RELEASE_NOTES Description: HPUX STM Release Notes FILESET: STM-MAN Description: HPUX STM Manual Pages SD SUB-PRODUCT: Runtime Description: STM Manual Runtime FILESET: STM-CATALOGS Description: HPUX STM Shared Libraries FILESET: STM-SHLIBS Description: HPUX STM Shared Libraries FILESET: STM-UI-RUN Description: HPUX STM User Interface FILESET: STM-UUT-RUN Description: HPUX STM Unit Under Test Runtime SD PRODUCT: EMS-Config Description: EMS Config FILESET: EMS-GUI Description: Event Monitoring Service Graphical User Interface SD PRODUCT: EMS-Core Description: EMS Core Product FILESET: EMS-CORE Description: Event Monitoring Service Core FilesYou can report defects related to EMS Hardware Monitors by filing a request on QUIX. If you do not have access to QUIX, contact your local HP representative to file a defect on your behalf.