STM: Support Tools for HP Computers (logo)

IA64_IPF CPU Expert Tool (mstm Online Help)

The CPU Expert Tool provides sophisticated troubleshooting for problems associated with a specific processor in a multi-processor environment. The ability to deactivate, reactivate, and assign a process to a processor allows on-line control of processor functionality.

The CPU Expert Tool is fully interactive, with its own set of menu commands.

CAUTION: The CPU Expert Tool contains commands which will cause severe performance degradation as a result of processor deactivation. It is assumed that users are aware of the dangers of its use.

To use the CPU Expert Tool safely, you should be trained and knowledgeable about CPU modules.

Introduction

The CPU Expert Tool allows on-line control of processor functionality in a multi-processor environment by issuing processor-specific commands.

This product provides individual operations that enable the user to identify a malfunctioning processor. The goal of the Expert Tool is to provide the user with an interactive tool that allows processor deactivating or reactivating of individual processor modules without requiring reset of the SPU or restart of the operating system. The ability to assign a specific process to an individual processor is an another capability of this product.

This tool should allow the multi-processor system under test to be reduced in scope via programmatic elimination of one or more modules. Thus, problems caused by a CPU module or specific process can be isolated without off-line testing.

The CPU Expert Tool requires that you install a HP-Only license at the main STM window. Consult a Mesa Administrator for the password.

If you experience problems, consult the Troubleshooting Tool-Related Problems help topic.

Step-by-Step Instructions

Running the CPU Expert Tool

To run the CPU Expert Tool on a device:

  1. At the main STM window, select a CPU device. The CPU Expert Tool will access all CPU devices on the Unit Under Test.
  2. If desired, change the options for the CPU Expert Tool at the main STM window. Options include the level of logging for the tool. For more information, see online help for STM as a whole (use the Help menu in the STM main window).
  3. Run the CPU Expert Tool.
  4. To select the desired commands, use the pull-down menus in the main window of the tool. For more information, see:
  5. To run an option commands, pull down the Options Menu and make a selection.
  6. To run exercise commands, pull down the CPU-Exercise Menu and make a selection.
  7. If you run into problems, see the Troubleshooting Tool-Related Problems help topic.
  8. Exit the tool by selecting the Exit command from the File pulldown Menu.

Troubleshooting a Suspect processor

To troubleshoot a suspect processor:

  1. Try running a specific process (identified by a PID Number) on a processor to identify process-specific problems.
  2. Deactivate all processors that may not have any problems, and assign a PID to activated processor that is suspected to be malfunctioning.
  3. Continue Deactivating and Reactivating Processors untill a malfunctioning processor is identified.
  4. If you get confused about which processors are disabled/enabled run the Display command, which will indicate the status of all processors.

Common Tasks

If you run into problems, see the Troubleshooting Tool-Related Problems help topic.

Troubleshooting Tool-Related Problems

Cpu Exercise

The function Exercise applied on a specified CPU to run stress tests on data cache, data tlb and floating point unit in turn.

Data cache exerciser -- exercises the data cache by forcing misses, replacements and flushes.

Data TLB exerciser -- exercises the data TLB by forcing page faults which forces TLB replacements.

Floating point exerciser -- exercises the floating point unit by providing a series of known inputs and checks for corresponding outputs. The benchmark algorithm used here is "tomcatv".

Execution Time Limit

The parameter Execution Time Limit can be specified by the user as an option for limiting the CPU exercise execution time in minutes. The default value is 10 minutes.

Error Allowed Number

The parameter Error Allowed Number can be specified by the user as an option for limiting the allowed number of errors before CPU exercise test termination. The default value is 10.

Commands and Pulldown menu

The CPU Expert Tool offers the following commands and Menu:

Save As

The Save As command in the File pulldown menu saves the contents of tool's main window. ( Only the most recent 200 lines can be saved.)

Print

The Print command in the the File pulldown menu prints the contents of tool's main window. ( Only the most recent 200 lines can be printed. )

Exit

Use the Exit command to terminate the execution of the CPU Expert Tool and return to the main STM window. If the user tries to exit without reactivating deactivated processor(s), a warning message will appear. The message will indicate that all processors have not been reactivated, providing an opportunity for the user to Cancel and return to the Expert menu to activate the CPUs.

Activate

The Activate command will only return a processor that has been previously Deactivated to normal system use. A deactivated processor that is also marked for deconfiguration may not be reactivated until it is marked for reconfiguration. Will return error message if activation routine does not return Successful or wrong pdev # was selected. The Activate command will display in the main window the SPU, Slot Number, and LID number along with the current number of activated processors after execution.

Assign

Forces (and locks) a process to a specific processor module. The user is required to find an active PID number from a different application (top, ps -ef).

Deactivate

Deactivates a processor from system use. A pdev number argument is required. The Expert Tool will display a warning message if there is only one processor currently activated or on any attempt to deaotivate the Monarch. The Deactivate command will display in the main window the SPU, Slot Number, and LID number along with the current number of activated processors after execution.

Display

The Display command will display a listing of the current status and information for all system processors:

SPU Number: Slot Number: LID: Active: (Active/Inactive) Monarch: (Yes/No)

Run

The Run command will launch the CPU Exerciser for a selected CPU. On the popup dialog box, the user can modify parameters of the exerciser such as the Execution Time Limit and the Error Allowed Number. By design, only one CPU exerciser can run at any time.

To monitor the running processes, the user may use the Top utility to obtain execution information on the running process.

The running record can be found on CPU Expert activity log.

Termination

The Termination command terminates running the exerciser as the user desires to stop the exercise earlier than the Cpu Exercise. The exercise termination message will be indicated on main window of this Expert Tool.

General Help

The "General Help" command displays online help for the CPU Expert Tool (the help system you are currently viewing).

For online help on STM as a whole, use the Help function keys in the STM main window.

Version

The "Version" command displays version and copyright information for the CPU Expert Tool.

Abort Button

The Abort button allows you to abort the CPU Expert Tool, halting its execution and returning to the STM main window.

The tool will be requested to abort and the state of the tool execution in the system map will be set to ABORT_PENDING. When the tool actually exits, its state will be set to ABORTED.

Interrupt Button

The Interrupt button allows you to interrupt the tool. When the tool receives the interrupt, it will stop whatever it is doing and prompt the user whether they want to continue with the current operation. If the choice is "Continue", the tool will continue with the current operation. If the choice is "Abort", the tool will abort the current operation and return to the window for the CPU Expert Tool.

Control-C

Pressing Control-C (CNTL-C) allows you to interrupt or abort execution of tool, or to escape to the user interface. After you press Control-C, the following function keys are displayed:

ESCAPE TO UI

The ESCAPE TO UI function key allows you to disconnect from the currently executing tool and return to the main User Interface (UI) command level. To reconnect to the tool, select the Attach to Tool menu item in the Tools->Tool Management pulldown menu.

INTERRUPT

The INTERRUPT function key allows you to interrupt the tool.

When the tool receives the interrupt, it stops whatever it is doing and asks you if you want to continue with the operation. If the reply is "Continue", the tool resumes what it was doing at the time of the interrupt. If the reply is "Abort", it terminates the operation and returns to the CPU Expert Tool main menu.

The tool may not immediately receive the interrupt as it could be in the middle of an operation that should not or could not be interrupted. The interrupt is received as soon as the tool completes the operation.

The tool will receive and ignore user interrupts when it is already in the process of returning to the main function keys of the tool.

ABORT

The ABORT function key allows you to abort the tool.

The tool will be requested to abort and the state of the tool execution in the system map will be set to ABORT_PENDING. When the tool actually exits, its state will be set to ABORTED.

CONTINUE

The CONTINUE function key allows you to continue with tool execution. The tool will continue execution just as if the Control-C keys had not been pressed.


Top of Page

/ Diagnostics HOME


URL: http://docs.hp.com/hpux/onlinedocs/diag/stm/help/expert/cpu_ia64m.htm
Last updated: Mon Apr 29 18:17:49 PDT 2002