The PCI Error Recovery feature provides the ability to detect,
isolate, and automatically recover from a PCI error, avoiding a
system crash. It is included with the HP-UX 11i v3 operating system
and it is enabled by default.
With the PCI Error Recovery feature enabled, if an error occurs
on a PCI bus containing an I/O card that supports PCI Error Recovery:
The PCI bus
is quarantined to isolate the system from further I/O and prevent
the error from damaging the system.
The PCI Error Recovery feature
will attempt to recover from the error and reinitialize the bus
so I/O can resume.
If an error occurs during the automated error recovery process,
the bus and I/O card will remain quiesced.
If the bus contains a card that supports online addition,
replacement, or deletion (OL*) and the card is in a hot-pluggable
slot, you can use the olrad command (or the attention button) to manually recover
from the error by replacing the card.
If the PCI Error Recovery feature is disabled and an error
occurs on a PCI bus, a Machine Check Abort (MCA) or a High Priority
Machine Check (HPMC) will occur, then the system will crash.
 |
 |  |
 |
 | CAUTION: If you use Serviceguard, HP recommends that you enable
the PCI Error Recovery feature only if your storage devices are
configured with multiple paths and you have not disabled HP-UX native
multipathing. If PCI Error Recovery is enabled, but your storage devices
are configured with only a single path, HP Serviceguard may not
detect when connectivity is lost. HP Serviceguard will not cause
a failover unless it detects a loss of connectivity. See the “Tunable
Kernel Parameters” section in the PCI Error Recovery Product
Note for instructions on using the pci_eh_enable tunable to disable PCI Error Recovery. |
 |
 |  |
 |
For information on OL* operations, see the Interface Card
OL* Support Guide, available at http://docs.hp.com
To determine if OL* is supported, see the I/O card documentation
or support matrix available at http://docs.hp.com
Summary
of Change |
 |
What’s
New for Customers Migrating from HP-UX 11i v1 September 2005?
PCI Error Recovery was never released on HP-UX 11i v1. It
is new for customers migrating from HP-UX 11i
v1.
What’s
New for Customers Migrating from HP-UX 11i v2 June 2006?
PCI Error Recovery was never released on HP-UX 11i v2. A similar
feature known as PCI Error Handling was released as a Software Pack
on HP-UX 11i v2. PCI Error Handling is very similar to PCI Error
Recovery. The main difference is that PCI Error Recovery automatically
attempts to recover from a PCI error, but PCI Error Handling requires
user intervention to attempt recovery from a PCI error.
Impact |
 |
The PCI Error Recovery feature attempts to automatically avoid
a system crash when a PCI error occurs.
Compatibility |
 |
PCI Error Recovery is enabled by default. If you use Serviceguard,
HP recommends the PCI Error Recovery feature only be enabled if
your storage devices are configured with multiple paths and you
have not disabled HP-UX native multipathing. If PCI Error Recovery
is enabled, but your storage devices are configured with only a
single path, Serviceguard may not detect when connectivity is lost.
If Serviceguard does not detect loss of connectivity, it does not
cause a failover. See the “Tunable Kernel Parameters” section
in the PCI Error Recovery Product Note for
instructions on using the pci_eh_enable tunable to disable PCI Error Recovery.
Performance |
 |
There are no known performance issues.
Documentation |
 |
Manpages:
Output from the ioscan -P error_recovery option listed in HP-UX 11i v3 ioscan manpage can be used to determine which drivers (and associated
PCI cards) running on an HP-UX system support PCI Error Recovery.
Obsolescence |
 |
Not applicable.