These release notes cover the April 1999 (IPR 9904) release of the Diagnostic/IPR Media for HP-UX 11.00/10.20 running on S800/S700 systems.
- Overview
- Enabling Hardware Monitoring
- Documentation
- Changes
- Known Problems
- Monitors Provided
- Monitor Dependencies
- Defect Reporting
- SD Product Structure
Included on the Diagnostic/IPR Media CD-ROM are the EMS Hardware Monitors - an important tool for maintaining system availability. The EMS hardware monitors allow you to monitor the operation of a wide variety of hardware products and be alerted immediately if any failure or other unusual event occurs. Hardware event monitoring is available to users running HP-UX 10.20 or 11.X (IPR 9902 and later).
Hardware event monitoring provides a high level of protection against system hardware failure. By using hardware event monitoring, you can virtually eliminate undetected hardware failures that could interrupt system operation or cause data loss.
The EMS Hardware Monitors are installed with the Support Tools Manager. Once the monitoring software is installed you simply need to enable hardware monitoring and all supported hardware devices on your system will automatically be monitored.
To enable hardware event monitoring:
That's it. Event monitoring is now enabled.
- Run the monitoring request manager by typing: /etc/opt/resmon/lbin/monconfig
- From the main menu selection prompt, enter E(nable Monitoring)
The default monitoring requests will automatically provide the following notification methods for all monitors:
All events sent to text file /var/opt/resmon/log/event.log Serious, Criticial and Major Warning events sent to SYSLOG Serious, Criticial and Major Warning events sent to CONSOLE Serious, Criticial and Major Warning events sent to EMAIL address rootThe Hardware Monitoring Request Manager, /etc/opt/resmon/lbin/monconfig, can be used to customize the monitoring requests and add new ones.The Peripheral Status Monitor (PSM) is configured differently. They use the EMS GUI. See: http://docs.hp.com/hpux/onlinedocs/diag/ems/ems_gui.htm
For the latest and most complete information on EMS Hardware Monitors and the Support Tools Manager (STM), see the Web page "Diagnostics":
http://docs.hp.com/hpux/diag/At this site, you will find Overviews, Tutorials, Quick Reference Cards, Frequently Asked Questions (FAQs), and much other material.For complete information on installing and using EMS hardware monitors, as well as a list of supported hardware, refer to the "EMS Hardware Monitors User's Guide" available at the above site. An electronic copy of this book is also included on the Diagnostic/IPR Media CD-ROM in the /Documentation directory.
Changes in the EMS Hardware Monitors for IPR 9904 include:
- New monitor Fibre Channel Switch Monitor. This monitor supports the Gigabit Fibre Channel Switch (Model A5223A), and requires some initial configuration, similar to the Fibre Channel Hub Monitor. See the data sheet for this monitor on the Web site: http://docs.hp.com/hpux/diag/ or in the "EMS Hardware Monitors User's Manual."
- Events with Major Warnings are now sent to the default locations (console, syslog and email root). Previously only events with a severity of Serious and Critical were sent to the default locations. This change was implemented by modifying the *.sapcfg files. The rationale behind this change is that Major Warning is the recommended severity to be used when "A potential or impending problem has been detected that could eventually escalate to a serious problem. Normal use of the hardware is not likely to be impeded, and repair can be scheduled for a convenient time." In other words, events like the failure of a mirrored disk should have a severity of "Major Warning." It was felt that customers should be notified for events such as the failure of a mirrored disk.
- Added support for the SCSI Disk Enclosure in the High-Availability Storage System Monitor (dm_ses_enclosure).
- Added support for a new diagnostic event (#2035) in the Fibre Channel (FC) Mass Storage driver. If not running FC Mass Storage driver patch # PHSS_16319 or later, this event will be documented as "invalid event number".
- Fixed problem whereby the Fibre Channel Mass Storage Adapter monitor did not correctly support the interface to the ServiceGuard failover mechanism when Serious and Critical events occurred. This problem only occurred if customers configured ServiceGuard to fail-over to a redundant system if the FCMS Adapter failed.
- Fixed problems with the AutoRaid monitor:
- (SR GSY1605082): The request from EMS to monitor an AutoRAID array is rejected at boot up.
- (SR GSY1605069): The value of REPEAT_FREQUENCY in Global.cfg does not take effect on an AutoRaid device.
- (SR GSY1605285): When AutoRaid devices are present, ARMServer is not being checked to see if it is present.
- Fixed problem with PSM monitor, whereby the PSM monitor was failing to return status for some Fibre Channel Card Monitor hardware paths.
- Fixed problem with Fibre Channel Arbitrated Loop Hub Monitor (SR GSY1605161), whereby the status of Fibre Channel hubs could not be monitored (as opposed to events).
CAUTION: Monitoring Changes for disc30, sdisk and disk array devicesAs of IPR 9902 (Feb 99 release), there has been a change to the way that monitoring is done for disc30, sdisk and the HA Disk Array Models 10, 20, and 30FC.
Formerly, the "diaglogd exec" programs (pdisc30_exec, pharaymon_exec, and psdisk_exec) handled driver error entries for these devices.
As of IPR 9902, these programs have been deleted and their functionality is now provided by the EMS Hardware Monitors.
If you had customized the configuration files for the dialogd exec programs (disk30_exec.cfg, sdisk_exec.cfg, and haraymon_exec.cfg) you may wish to re-configure the EMS Hardware Monitors to achieve the same results.
CAUTION:Compatibility Problem with ServiceGuard and LockManagerFrom the February 1999 release (IPR 9902) onwards, the Support Tools (diagnostics) include EMS hardware monitors and EMS version A.03.00 on both HP-UX 10.20 and HP-UX 11.00.
This version of EMS is incompatible with ServiceGuard A.10.10, which includes version A.01.00 of EMS. It is also incompatible with ServiceGuard and LockManager versions A.11.01, A.11.02 and A.11.03, which include version A.02.00 of EMS.
If you run these releases of ServiceGuard or LockManager, you must upgrade them before installing the Support Tools on the February 1999 (IPR 9902) or newer releases.
On HP-UX 10.20 you should upgrade ServiceGuard to A.10.11 and on HP-UX 11.00 you should upgrade ServiceGuard or LockManager to release A.11.04 or newer.
If you do not upgrade, EMS will silently be upgraded to version A.03.00 when you install the diagnostics; ServiceGuard and LockManager will fail to work if you have any monitored resources. In this case, if you execute swverify or other SD-UX commands, you will see error messages like:
The corequisite "EMS-Core.EMS-CORE,r=A.01.00,a=HP-UX_B.10.20_800,v=HP" for fileset "Cluster-Monitor.CM-CORE,l=/,r=A.10.10" cannot be successfully resolved.If you have already loaded the diagnostics and therefore upgraded to EMS A.03.00 and are still running an incompatible release of ServiceGuard or LockManager, you should now upgrade to get your system into a supported and working state.There is no functional difference between ServiceGuard A.10.10 and ServiceGuard A.10.11, other than support for the new version of EMS and bug fixes. Functional differences for the 11.00 releases of ServiceGuard and LockManager can be found in the release notes.
Older versions of ServiceGuard and LockManager, for example A.10.06 and A.10.07.01, do not provide any support for EMS, and so are not affected by this issue.
Monitors are provided to support the following:
In addition, a Hardware status monitor is provided to monitor the current status of the products supported by the above list.
- HP Disk Arrays
- Fibre Channel Interconnect
- Fibre Channel Interface Cards
- High Availability Storage System Enclosures
- SCSI Tape Products
- HP SCSI Disk Products
- HP Fibre Channel Disk Products
- HP Fibre Channel Switch
- Memory
For detailed information concerning which products are supported by which monitors and additional dependencies, check the "Diagnostics" section of Hewlett-Packard's online documentation web site at http://docs.hp.com/hpux/diag/ .
Several of the monitors have special requirements, such as patches or certain versions of firmware. Current requirements are described in the "Supported Products" page under "EMS Hardware Monitors" at http://docs.hp.com/hpux/diag/ . Requirements are also listed in chapter 2 of the manual "EMS Hardware Monitors User's Guide".
Note: The Fibre Channel Arbitrated Loop Hub Monitor and the Fibre Channel Switch Monitor require special configuration which is described in their data sheets in the "EMS Hardware Monitors User's Guide" (chapter 6).
Use CHART to report defects in the EMS Hardware monitors. The project name is diag.hw_mon.hpux. If you don't have access to CHART, contact an HP representative to enter a defect for you.
The EMS hardware monitors are installed as part of the OnlineDiag bundle (product number B4708AA). In addition, they utilize the EMS framework, product number B7609BA.
Note: EMS Hardware Monitors are installed as part of the STM-UUT-RUN Fileset. However, the EMS Hardware Monitors are dependent on the EMS-Core and EMS-Config products and additional filesets in the Sup-Tool-Mgr Product.
For information on the STM product, refer to the STM release notes file /usr/sbin/stm/Rel_NOTES.STM.
SD Bundle: OnlineDiag Description: On-line Diagnostic System (Series 800/700) SD PRODUCT: Sup-Tool-Mgr Description: Support Tools Manager for HP-UX Systems SD SUB-PRODUCT: Manuals Description: Support Tools Manager Manual Pages FILESET: RELEASE_NOTES Description: HPUX STM Release Notes FILESET: STM-MAN Description: HPUX STM Manual Pages SD SUB-PRODUCT: Runtime Description: STM Manual Runtime FILESET: STM-CATALOGS Description: HPUX STM Shared Libraries FILESET: STM-SHLIBS Description: HPUX STM Shared Libraries FILESET: STM-UI-RUN Description: HPUX STM User Interface FILESET: STM-UUT-RUN Description: HPUX STM Unit Under Test Runtime SD PRODUCT: EMS-Config Description: EMS Config FILESET: EMS-GUI Description: Event Monitoring Service Graphical User Interface SD PRODUCT: EMS-Core Description: EMS Core Product FILESET: EMS-CORE Description: Event Monitoring Service Core Files