VMWARE VREALIZE OPERATIONS MANAGEMENT PACK FOR. HP Servers. User Guide

VMWARE VREALIZE OPERATIONS MANAGEMENT PACK FOR HP Servers User Guide TABLE OF CONTENTS 1. Purpose ....................................................
2 downloads 0 Views 2MB Size
VMWARE VREALIZE OPERATIONS MANAGEMENT PACK FOR

HP Servers User Guide

TABLE OF CONTENTS 1. Purpose ..................................................................................................................................................................... 3 2. Introduction to the Management Pack ...................................................................................... 3 2.1 How the Management Pack Collects Data ............................................................ 3 2.2 Data the Management Pack Collects .......................................................................... 3 2.3 Inventory Tree (Traversal Spec) .......................................................................................... 3 3. Dashboards ........................................................................................................................................................... 4 3.1 HP Overview ......................................................................................................................................... 4 3.2 HP Rack Overview .......................................................................................................................... 5 3.3 HP Chassis Overview .................................................................................................................. 6 3.4 HP Blade Overview ........................................................................................................................ 7 3.5 HP Health Investigation ............................................................................................................. 8 3.6 HP Hosted VMs ................................................................................................................................. 9 3.7 HP Hosted ESXi Hosts ............................................................................................................ 10 4. Tags ............................................................................................................................................................................ 10 5. Views......................................................................................................................................................................... 11 6. Reports ................................................................................................................................................................... 12 7. Alerts ......................................................................................................................................................................... 13 8. Analysis Badges ............................................................................................................................................ 13 9. Troubleshooting the Management Pack ............................................................................... 14 9.1 Troubleshooting an Adapter Instance ...................................................................... 14 9.2 Testing Connection Failures ............................................................................................... 14 9.3 Viewing System Log Files..................................................................................................... 14 10. Appendix I: Metrics ................................................................................................................................. 15 11. Appendix II: Alerts, Symptoms, & Recommendations ........................................ 25 12. Appendix III: Capacity Definitions .......................................................................................... 34

NOTE: This document supports the version of each product listed, as well as all subsequent versions, until a new edition replaces it.

You can find the most up-to-date technical documentation on the Blue Medora support site at: http://support.bluemedora.com. The Blue Medora website also provides the latest product updates. If you have comments about this documentation, submit your feedback to: [email protected]. 2

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

1. Purpose The Blue Medora VMware vRealize Operations (vROps) Management Pack for HP Servers User Guide describes the primary features of the Management Pack for HP Servers, including dashboards, views, reports, and alerts that allow users to optimize the monitoring and management of HP server resources from within vRealize Operations.

2. Introduction to the Management Pack The Management Pack for HP Servers is an embedded adapter for vRealize Operations (vROps) that monitors HP servers remotely. The Management Pack retrieves data regarding HP server resources by connecting to each supported HP ProLiant Server’s iLO REST API (via default port 443). This section includes the following topics: • How the Management Pack Collects Data • Data the Management Pack Collects 2.1 How the Management Pack Collects Data During each data collection cycle, the Management Pack opens an iLO REST API connection to the configured HP server(s) and queries it to retrieve metrics for HP server resources. NOTE: For a list of supported HP hardware, refer to the accompanying Blue Medora VMware vRealize Operations (vROps) Management Pack for HP Servers Installation & Configuration Guide. The collection interval for the adapter instance resource determines how often the Management Pack collects data. The default collection interval is five minutes.

Figure 1: Example Traversal Spec

The Management Pack supports Autodiscovery and manual discovery of resources. When you enable Autodiscovery for an adapter instance, the Management Pack creates resources in vRealize Operations and collects data after the main collection query runs. If a new resource belongs to a resource kind that does not exist in vROps, the Management Pack creates the resource kind. 2.2 Data the Management Pack Collects The Management Pack can collect performance data, relationships (associations), and events for the following HP Servers resources: 1. 2. 3. 4. 5. 6. 7. 8. 9.

Chassis Enclosure Blade Port (Optional) Rack Network Adapter Power Supply Fans Containers/Tags • VMware VMs on HP Servers* • VMware Hosts on HP Servers* • HP Servers • HP Tag * To allow for VMware relationships, you must enable IP addresses on the host systems.

2.3 Inventory Tree (Traversal Spec) The Inventory Tree (Traversal Spec) feature within vROps allows you to easily navigate your environment. The hierarchical structure implicitly shows relationships among resource kinds and enables quick drill-downs to root-cause issues.

3

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

3. Dashboards Dashboards are the primary user interface that allow users to monitor HP server resources from within vRealize Operations. The following dashboards are available in the Management Pack: • HP Overview • HP Rack Overview • HP Chassis Overview • HP Blade Overview • HP Health Investigation • HP Hosted VMs • HP Hosted ESXi Hosts NOTE: To filter by VMs and Hosts within the dashboards, refer to section “4. Tags” for instructions.

3.1 HP Overview The Overview dashboard provides at-a-glance heatmap views depicting the overall health of your HP Server resources (racks, chassis, power supplies, blades, fans, network adapters, etc.). Figure 2: HP Overview Dashboard

4

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

3.2 HP Rack Overview The HP Rack Overview dashboard allows you to select a rack to view its health, status, properties, power, child resources (fans, network adapters, ports, etc.), relationships, and alerts. Figure 3: HP Rack Overview Dashboard

5

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

3.3 HP Chassis Overview The HP Chassis Overview dashboard displays an overview of your HP Chassis and its child resources at-aglance. Figure 4: HP Chassis Overview Dashboard

6

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

3.4 HP Blade Overview The HP Blade Overview dashboard allows you to select a blade to view its health, status, properties, child resources (network adapters, ports, etc.), relationships, and alerts. Figure 5: HP Blade Overview Dashboard

7

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

3.5 HP Health Investigation The Health Investigation dashboard allows you to view health status, top alerts, and key performance Metrics (KPIs) for a selected resource in your HP Servers environment. Figure 6: HP Health Investigation Dashboard

8

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

3.6 HP Hosted VMs The Hosted VMs dashboard allows you to select a Virtual Machine to view its parent ESXi host, related ESXi hosts and HP hardware, as well as KPIs for the VM. Figure 7: HP Hosted VMs Dashboard

9

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

3.7 HP Hosted ESXi Hosts The HP Hosted ESXi Hosts dashboard allows you to select an ESXi Host to view its properties, relationships, and KPIs, as well as its host HP Server and related KPIs. Figure 8: HP Hosted ESXi Hosts Dashboard

4. Tags To further customize how your Management Pack dashboards are displayed, four tags are currently available for filtering: • • • •

VMware VMs on HP Servers VMware Hosts on HP Servers HP Servers HP Tag

To select a tag, perform the following steps: NOTE: Depending on the dashboard, the steps for selecting a tag may vary slightly. 1. 2. 3. 4. 5. 6.

10

Click on the Content navigation shortcut ( ). Click on the Dashboards view in the navigation pane. Select Edit ( ) on the HP Servers widget. Select Edit Widget ( ) for the widget you want to edit. Expand the HP Servers Container option. Select the desired tag and click Save.

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

5. Views The vROps Management Pack for HP Servers creates views that allow the user to view statistics of metrics for various HP Servers resources. The views help give a broad picture of the entire system, as opposed to a more in depth view. Table 1: Management Pack Views VIEW

TYPE

DESCRIPTION

Average HP Health Overview

List

Displays average health and availability over the past week.

HP Alerts

List

Displays alert counts by severity.

HP Capacity

List

Displays capacity (%) and time (days) remaining.

HP Chassis Average KPIs

List

Displays average chassis KPIs over the past week.

HP Fan Average KPIs

List

Displays average fan KPIs over the past week

HP Health Overview

List

Displays current health and availability.

HP Network Adapter Average KPIs

List

Displays average network adapter KPIs over the past week.

HP Port Average KPIs

List

Displays average port KPIs over the past week.

HP Power Supply Average KPIs

List

Displays average power supply KPIs over the past week.

HP Server Information

List

Displays KPIs and alert counts by severity for HP blade and rack servers.

To access the Management Pack views, go to Environment > All Objects > HP Servers and double-click on the desired Object (resource). Select the Details tab, then Views. The available views for that resource are listed and can be selected. Figure 9: Accessing Management Pack Views

11

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

6. Reports The vROps Management Pack for HP Servers contains custom reports, as listed below. The reports can be exported and easily shared with key stakeholders in either .pdf or .csv formats. • • • • • • • • •

HP Alerts HP Capacity HP Chassis Average KPIs HP Fan Average KPIs HP Health Overview HP Network Adapter Average KPIs HP Port Average KPIs HP Power Supply Average KPIs HP Server Information

To access the Management Pack reports, go to Environment > All Objects > HP Servers and double-click on the desired Object (resource). Select the Reports tab, then Report Templates. Figure 10: Accessing Management Pack Reports

To run the selected report, click the Run Template icon ( your preferred format.

12

), then click Generated Reports, to select the report in

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

7. Alerts The Management Pack for HP Servers pulls health status and events from HP server resources and displays them in vRealize Operations as alerts. Refer to section “10. Appendix II: Alerts, Symptoms, & Recommendations” for the full list of alerts, symptoms, and recommendations. Figure 11: Alert Example

8. Analysis Badges Using the predictive analytics capabilities of the vROps Analysis Badges through capacity definitions, the Management Pack for HP Servers populates Power Capacity for the Chassis and Rack resource kinds as well as Power Output Capacity for Power Supplies. For details, refer to section. “11. Appendix III: Capacity Definitions”. Figure 12: Capacity Badge Example

13

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

9. Troubleshooting the Management Pack Known troubleshooting information can help you diagnose and correct common problems with the Management Pack for HP Servers. This section includes the following topics: • Troubleshooting an Adapter Instance • Testing Connection Failures • Viewing System Log Files 9.1 Troubleshooting an Adapter Instance Perform these general troubleshooting steps to diagnose and correct problems with an adapter instance: • Edit the adapter instance and click Test Connection to verify the connection to vROps. Refer to section “8.2 Testing Connection Failures”. • View the collection status and collection state for the adapter instance resource on the Environment Overview page in vROps. • Check the adapter and collector logs for errors. Refer to section “8.3 Viewing System Log Files”, for details. 9.2 Testing Connection Failures When clicking Test Connection in the Manage Solution window when adding an adapter instance, the following connection errors are possible. 9.2.1 Missing Connection Information Ensure the following information was entered correctly: • Host name or IP address (single, or comma-separated, hostname(s) or IP addresses) • Port number (if other than default of 443) • iLO User Name and Password • Timeout setting (must be a positive integer value) NOTE: The iLO provides a security feature that disables logins after a number of failed attempts. This feature does not need to be changed in order for the Management Pack to work, but the lockout does also apply to connections made by the Management Pack. As a result, a streak of consecutive failed connection tests could cause the iLO to refuse future connections for a period of time. To check if this security feature is causing Test Connection to fail, open the iLO login page. If logins are disabled, there will be a message to alert you. The settings for delay time and number of failed login attempts needed to cause a delay can be adjusted in the iLO settings (Administration > Access Settings). 9.3 Viewing System Log Files You can view adapter errors in the adapter and collector log files. You can view the adapter and collector log files in the vROps user interface or in an external log viewer. The adapter log files are in the $ALIVE_BASE/user/log/adapters/hpcompute_adapter3/ folder. The collector log files are in the $ALIVE_BASE/user/log/ folder. The logging level is set to ERROR by default. To troubleshoot issues, set the logging level to INFO. To view detailed messages, including micro steps, queries, and returned results, set the logging level to DEBUG. You can set the base log level for the collector via Administrator > Support > Logs > Select COLLECTOR folder > Select Edit Properties icon > Edit Root logger level. NOTE: If you set the logging level to DEBUG, log files can become large very quickly. Set the logging level to DEBUG only for short periods of time. For complete information about viewing log files and modifying log levels, refer to the vROps online help.

14

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

10. Appendix I: Metrics The Management Pack for HP Servers collects the following metrics by HP Resource Kinds. Table 2: Management Pack Metrics Resource Kind

Resource Group

Resource Attribute

Attribute Type

Attribute Description

HP Servers Adapter Instance

License

Blades and Racks

double

Number of blade and rack licenses required for this adapter instance.

HP Servers Container

Relationships

Enclosure Children

string

All HP enclosures.

HP Servers Container

Relationships

Rack Children

string

All HP racks.

VMware VMs on HP Container

Relationships

Virtual Machine Children

string

The virtual machines running on HP servers.

VMware Hosts on HP Container

Relationships

Host System Children

string

The host systems running on HP servers.

HP Enclosure

Relationships

Servers Parent

string

Servers container.

HP Enclosure

Relationships

Chassis Children

string

All the chassis in this enclosure.

HP Enclosure

General

Name

string

The name of the enclosure.

HP Enclosure

General

UUID

string

The universal unique identifier for this enclosure.

HP Blade

Relationships

Network Adapter Child

string

The network adapter on the blade.

HP Blade

Relationships

Chassis Parent

string

The chassis this blade is in.

HP Blade

General

Serial Number

string

The blade serial number.

HP Blade

General

UUID

string

The universal unique identifier for this blade.

HP Blade

General

Manufacturer

string

The manufacturer or OEM of this blade.

HP Blade

General

Model

string

The model information that the manufacturer uses to refer to this blade.

HP Blade

General

SKU

string

SKU for the blade.

HP Blade

General

Asset Tag

string

A user-definable tag that is used to track this blade for inventory or other client purposes.

HP Blade

Firmware

BIOS Version

string

The version of the blade BIOS or primary blade firmware.

HP Blade

Firmware

Platform Definition Firmware Version

string

Platform Definition Table version.

15

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Resource Kind

Resource Group

Resource Attribute

Attribute Type

Attribute Description

HP Blade

Firmware

Power Management Controller Firmware Version

string

The Power Management Microcontroller firmware version.

HP Blade

Firmware

Power Management Controller Bootloader Firmware Version

string

The Power Management Microcontroller firmware bootloader version.

HP Blade

Firmware

SAS PLD Firmware Version

string

SAS Programmable Logic Device version.

HP Blade

Firmware

SPS Firmware Version

string

SPS Firmware version

HP Blade

Firmware

System PLD Firmware Version

string

The firmware version of the CPLD.

HP Blade

Processor

Processor Count

string

The number of processors in the blade.

HP Blade

Processor

Processor Model

string

The processor model for the primary or majority of processors in this blade.

HP Blade

Processor

Processor Health

string

This represents the health state of the processors in the absence of its dependent resources.

HP Blade

Power

Power State

string

This is the current power state of the blade.

HP Blade

Power

Power Capacity

double

The total amount of power allocated to the blade.

HP Blade

Memory

Memory Health

string

This represents the health state of the memory in the absence of its dependent resources.

HP Blade

Memory

Total Memory

double

The total amount of memory in the blade.

HP Blade

Status

State

string

This indicates the known state of the blade, such as if it is enabled.

HP Blade

Status

Health

string

This represents the health state of this blade in the absence of its dependent resources.

HP Network Adapter

Relationships

Blade Parent

string

The blade this network adapter is on.

HP Network Adapter

Relationships

Rack Parent

string

The rack system this network adapter is on.

HP Network Adapter

Relationships

Port Child

string

Port that this network adapter has.

16

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Resource Kind

Resource Group

Resource Attribute

Attribute Type

Attribute Description

HP Network Adapter

General

IP Addresses

string

The IP addresses of the ports on the network adapter.

HP Network Adapter

General

Serial Number

string

The device serial number.

HP Network Adapter

General

Part Number

string

The device part number.

HP Network Adapter

Firmware

Firmware Version

string

This string represents the version of the firmware image.

HP Network Adapter

Status

State

string

This indicates the known state of the port, such as if it is enabled.

HP Network Adapter

Status

Health

string

This represents the health state of this resource in the absence of its dependent resources.

HP Network Adapter

Performance

Bad Receives

double

A count of frames that were received by the adapter but which had an error. This counter is the sum of mib items cpqNicIfPhysAdapterAlignmentErrors, cpqNicIfPhysAdapterFCSErrors, cpqNicIfPhysAdapterFrameTooLongs, and cpqNicIfPhysAdapterInternalMacReceiveErrors. If this counter increments frequently, check the more detailed error statistics and take appropriate action.

HP Network Adapter

Performance

Bad Transmits

double

A count of frames that were not transmitted by the adapter because of an error. This counter is the sum of MIB items cpqNicIfPhysAdapterDeferredTransmissions, cpqNicIfPhysAdapterLateCollisions, cpqNicIfPhysAdapterExcessiveCollisions, cpqNicIfPhysAdapterCarrierSenseErrors, and cpqNicIfPhysAdapterInternalMacTransmitErrors. If this counter increments frequently, check the more detailed error statistics and take appropriate action.

HP Network Adapter

Performance

Good Receives

double

A count of frames successfully received by the physical adapter.

17

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Resource Kind

Resource Group

Resource Attribute

Attribute Type

Attribute Description

HP Network Adapter

Performance

Good Transmits

double

A count of frames successfully transmitted by the physical adapter.

HP Port

Relationships

Network Adapter Parent

string

The network adapter this port is on.

HP Port

General

MAC Address

string

The port MAC address.

HP Port

General

Full Duplex

string

Full-duplex data transmission means that data can be transmitted in both directions on a signal carrier at the same time.

HP Port

General

IPv4 Address

string

This is the IPv4 Address.

HP Port

General

IPv6 Address

string

This is the IPv6 Address.

HP Port

Performance

Bad Receives

double

A count of frames that were received by the adapter but which had an error. This counter is the sum of mib items cpqNicIfPhysAdapterAlignmentErrors, cpqNicIfPhysAdapterFCSErrors, cpqNicIfPhysAdapterFrameTooLongs, and cpqNicIfPhysAdapterInternalMacReceiveErrors. If this counter increments frequently, check the more detailed error statistics and take appropriate action.

HP Port

Performance

Bad Transmits

double

A count of frames that were not transmitted by the adapter because of an error. This counter is the sum of MIB items cpqNicIfPhysAdapterDeferredTransmissions, cpqNicIfPhysAdapterLateCollisions, cpqNicIfPhysAdapterExcessiveCollisions, cpqNicIfPhysAdapterCarrierSenseErrors, and cpqNicIfPhysAdapterInternalMacTransmitErrors. If this counter increments frequently, check the more detailed error statistics and take appropriate action.

HP Port

Performance

Good Receives

double

A count of frames successfully received by the physical adapter.

HP Port

Performance

Good Transmits

double

A count of frames successfully transmitted by the physical adapter.

18

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Resource Kind

Resource Group

Resource Attribute

Attribute Type

Attribute Description

HP Port

Performance

Speed

double

An estimate of the interface's current bandwidth. For interfaces which do not vary in bandwidth or for those where no accurate estimation can be made, this object should contain the nominal bandwidth.

HP Port

Status

State

string

This indicates the known state of the port, such as if it is enabled.

HP Port

Status

Health

string

This represents the health state of this resource in the absence of its dependent resources.

HP Chassis

Relationships

Enclosure Parent

string

Enclosure this chassis is on.

HP Chassis

Relationships

Power Supply Child

string

Power Supply this chassis has.

HP Chassis

Relationships

Blade Child

string

Blades this chassis has.

HP Chassis

Relationships

Fan Child

string

Fans this chassis has.

HP Chassis

General

Model

string

The chassis model number.

HP Chassis

General

Serial Number

string

The chassis serial number.

HP Chassis

General

Chassis Type

string

This property indicates the physical form factor type of this chassis.

HP Chassis

General

Manufacturer

string

The chassis manufacturer.

HP Chassis

General

UUID

string

The chassis UUID.

HP Chassis

General

Name

string

The chassis name.

HP Chassis

General

SKU

string

SKU for the chassis.

HP Chassis

General

Version

string

The chassis version.

HP Chassis

General

Asset Tag

string

The chassis user-assigned asset tag.

HP Chassis

General

Consumed Height (Bays)

string

The number of enclosure bays this chassis consumes in height.

HP Chassis

General

Consumed Width (Bays)

string

The number of enclosure bays this chassis consumes in width.

HP Chassis

General

Bay Number

string

The position of the chassis inside an enclosure.

HP Chassis

General

Height (U)

string

The chassis rack U height.

HP Chassis

General

U Location

string

The chassis rack U location.

19

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Resource Kind

Resource Group

Resource Attribute

Attribute Type

Attribute Description

HP Chassis

General

U Position

string

The chassis U position in the rack.

HP Chassis

General

Rack Part Number

string

The chassis rack part number.

HP Chassis

Status

State

string

This indicates the known state of the chassis, such as if it is enabled.

HP Chassis

Status

Health

string

This represents the health state of this chassis in the absence of its dependent resources.

HP Chassis

Firmware

SAS PLD Firmware Version

string

The firmware version of the SAS controller.

HP Chassis

Firmware

SPS Firmware Version

string

The SPS FW Version number, aka ME FW Version, AAAA.BBBB. CCCC.DDDD.E

HP Chassis

Firmware

System PLD Firmware Version

string

The firmware version of the CPLD.

HP Chassis

Firmware

Power Management Controller Firmware Version

string

The firmware version of the Power Monitor.

HP Chassis

Firmware

Power Management Controller Boot Loader Firmware Version

string

The firmware version of the Power Monitor boot loader.

HP Chassis

Firmware

Platform Definition Firmware Version

string

The version of the Intelligent Platform Abstraction Data.

HP Chassis

Power

Power Management Hardware Family

string

The family type of the Power Monitor hardware.

HP Chassis

Power

Power Capacity

double

The total power (Watts) available to the chassis from all power supplies (adjusting for redundancy settings).

HP Chassis

Power

Power Metrics Interval

double

The interval between power metric evaluation in minutes.

HP Chassis

Power

Minimum Power Consumed

double

The minimum power consumed during the interval specified by IntervalInMin.

HP Chassis

Power

Maximum Power Consumed

double

The maximum power consumed during the interval specified by IntervalInMin.

20

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Resource Kind

Resource Group

Resource Attribute

Attribute Type

Attribute Description

HP Chassis

Power

Average Power Consumed

double

The average power consumed during the interval specified by IntervalInMin.

HP Chassis

Power

Power Consumed

double

The latest observed power (Watts) being drawn by this chassis. The update interval may vary depending upon implementation but is usually measured in seconds.

HP Rack

Relationships

Servers Parent

string

Servers container.

HP Rack

Relationships

Enclosure Parent

string

Enclosure this rack is on.

HP Rack

Relationships

Power Supply Child

string

Power Supply this rack has.

HP Rack

Relationships

Network Adapter Child

string

Network adapter this rack has.

HP Rack

Relationships

Fan Child

string

Fan this rack has.

HP Rack

General

Serial Number

string

The rack serial number.

HP Rack

General

Chassis Type

string

This property indicates the physical form factor type of this rack.

HP Rack

General

Manufacturer

string

The rack manufacturer.

HP Rack

General

Model

string

The rack model number.

HP Rack

General

SKU

string

SKU for the rack.

HP Rack

General

Version

string

The rack version.

HP Rack

General

Asset Tag

string

The rack’s user-assigned asset tag.

HP Rack

General

UUID

string

The rack UUID provided by SMBIOS.

HP Rack

General

Consumed Height (Bays)

string

The number of enclosure bays this rack consumes in height.

HP Rack

General

Consumed Width (Bays)

string

The number of enclosure bays this rack consumes in width.

HP Rack

General

Bay Number

string

The position of the rack inside an enclosure.

HP Rack

General

Height (U)

string

The rack U height.

HP Rack

General

U Location

string

The rack U location.

HP Rack

General

U Position

string

The rack U position in the rack.

HP Rack

General

Enclosure

string

The name of the rack enclosure.

21

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Resource Kind

Resource Group

Resource Attribute

Attribute Type

Attribute Description

HP Rack

Status

State

string

This indicates the known state of the rack, such as if it is enabled.

HP Rack

Status

Health

string

This represents the health state of this rack in the absence of its dependent resources.

HP Rack

Firmware

BIOS Version

string

The version of the blade BIOS or primary blade firmware.

HP Rack

Firmware

SAS PLD Firmware Version

string

The firmware version of the SAS controller.

HP Rack

Firmware

SPS Firmware Version

string

The SPS FW Version number, aka ME FW Version, AAAA.BBBB. CCCC.DDDD.E

HP Rack

Firmware

System PLD Firmware Version

string

The firmware version of the CPLD.

HP Rack

Firmware

Power Management Controller Firmware Version

string

The firmware version of the Power Monitor.

HP Rack

Firmware

Power Management Controller Boot Loader Firmware Version

string

The firmware version of the Power Monitor boot loader.

HP Rack

Firmware

Platform Definition Firmware Version

string

The version of the Intelligent Platform Abstraction Data.

HP Rack

Power

Power Management Hardware

string

The family type of the Power Monitor hardware.

HP Rack

Power

Power State

string

This is the current power state of the blade.

HP Rack

Power

Power Limit

double

The total amount of power allocated to the blade.

HP Rack

Power

Power Capacity

double

The total power (Watts) available to the chassis from all power supplies (adjusting for redundancy settings).

HP Rack

Power

Power Metrics Interval

double

The interval between power metric evaluation in minutes.

HP Rack

Power

Minimum Power Consumed

double

The minimum power consumed during the interval specified by IntervalInMin.

HP Rack

Power

Maximum Power Consumed

double

The maximum power consumed during the interval specified by IntervalInMin.

22

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Resource Kind

Resource Group

Resource Attribute

Attribute Type

Attribute Description

HP Rack

Power

Average Power Consumed

double

The average power consumed during the interval specified by IntervalInMin.

HP Rack

Power

Power Consumed

double

The latest observed power (Watts) being drawn by this chassis. The update interval may vary depending upon implementation but is usually measured in seconds.

HP Rack

Processor

Processor Count

string

The number of processors in the rack system.

HP Rack

Processor

Processor Model

string

The processor model for the primary or majority of processors in this rack system.

HP Rack

Processor

Processor Health

string

This represents the health state of the processors in the absence of its dependent resources.

HP Rack

Memory

Memory Health

string

This represents the health state of the memory in the absence of its dependent resources.

HP Rack

Memory

Total Memory

double

The total amount of memory in the rack system.

HP Power Supply

Relationships

Chassis Parent

string

Chassis this power supply is on.

HP Power Supply

Relationships

Rack Parent

string

Rack this power supply is on.

HP Power Supply

General

Serial Number

string

The serial number for this Power Supply

HP Power Supply

General

Bay Number

string

The power supply bay number.

HP Power Supply

General

Spare Part Number

string

The part number for this Power Supply

HP Power Supply

General

Model

string

The model number for this Power Supply.

HP Power Supply

General

Type

string

The Power Supply type (AC or DC).

HP Power Supply

General

Hotplug Capable

string

If true, this power supply (and power supply bay) is capable of being hotplugged.

HP Power Supply

Firmware

Firmware Version

string

The firmware version for this Power Supply.

23

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Resource Kind

Resource Group

Resource Attribute

Attribute Type

Attribute Description

HP Power Supply

Status

State

string

This indicates the known state of the power supply, such as if it is enabled.

HP Power Supply

Status

Health

string

This represents the health state of this power supply in the absence of its dependent resources.

HP Power Supply

Performance

Power Capacity

double

The maximum capacity of this Power Supply.

HP Power Supply

Performance

Average Power Output

double

The average power output of this Power Supply.

HP Fan

Relationships

Chassis Parent

string

Chassis this fan is on.

HP Fan

Relationships

Rack Parent

string

Rack this fan is on.

HP Fan

General

Chassis Serial Number

string

The serial number of the chassis this fan is on.

HP Fan

General

Name

string

The name of the fan sensor.

HP Fan

General

Fan Location

string

The area or device to which this fan is located.

HP Fan

Status

State

string

The state of the fan.

HP Fan

Status

Health

string

The health of the fan.

HP Fan

Performance

Current Utilization (%)

double

The current utilization (% of max speed) of the fan.

HP Fan

Performance

Current Speed (RPM) double

24

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

The current speed of the fan.

11. Appendix II: Alerts, Symptoms, & Recommendations The vROps Management Pack for HP Servers creates alerts and provides recommended actions based on various symptoms that it detects in the environment. See the table below for details regarding each alert. Table 3: Alerts, Symptoms, & Recommendations Name

Description

Symptom

Recommendation

Blade Health: Warning

Blade Health: Warning

Blade Health: Warning

Blade health is degraded. Review the health of this blade's related components to diagnose the problem.

Blade Health: Critical

Blade Health: Critical

Blade Health: Critical

Blade health is failed. Review the health of this blade's related components to diagnose the problem.

Blade Memory Health: Warning

Blade Memory Blade Memory Health: Health: Warning Warning

Blade memory health is degraded. Recommended actions: - Be sure the memory meets the blade requirements and is installed as required by the blade. - Some blades may require that memory banks be populated fully or that all memory within a memory bank must be the same size, type, and speed. To determine if the memory is installed properly, see the blade documentation. - Check any blade LEDs that correspond to memory slots. - If you are unsure which DIMM has failed, test each bank of DIMMs by removing all other DIMMs. Then isolate the failed DIMM by switching each DIMM in a bank with a known working DIMM. - Remove any third-party memory and run HP Insight Diagnostics.

Rack Memory Health: Warning

Rack Memory Health: Warning

Rack memory health is degraded. Recommended actions: - Be sure the memory meets the rack requirements and is installed as required by the rack. - Some racks may require that memory banks be populated fully or that all memory within a memory bank must be the same size, type, and speed. To determine if the memory is installed properly, see the rack documentation. - Check any rack LEDs that correspond to memory slots. - If you are unsure which DIMM has failed, test each bank of DIMMs by removing all other DIMMs. Then, isolate the failed DIMM by switching each DIMM in a bank with a known working DIMM. - Remove any third-party memory and run HP Insight Diagnostics.

Blade Memory Health: Critical

Blade Memory Blade Memory Health: Critical Health: Critical

25

Rack Memory Health: Warning

Blade memory health is failed. Recommended actions: Be sure the memory meets the blade requirements and is installed as required by the blade. - Some blades may require that memory banks be populated fully or that all memory within a memory bank must be the same size, type, and speed. To determine if the memory is installed properly, see the blade documentation. - Check any blade LEDs that correspond to memory slots. - If you are unsure which DIMM has failed, test each bank of DIMMs by removing all other DIMMs. Then, isolate the failed DIMM by switching each DIMM in a bank with a known working DIMM. - Remove any third-party memory and run HP Insight Diagnostics.

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Name

Description

Rack Memory Health: Critical

Rack Memory Rack Memory Health: Critical Health: Critical

Rack memory health is failed. Recommended actions: Be sure the memory meets the rack requirements and is installed as required by the rack. - Some racks may require that memory banks be populated fully or that all memory within a memory bank must be the same size, type, and speed. To determine if the memory is installed properly, see the rack documentation. - Check any rack LEDs that correspond to memory slots. - If you are unsure which DIMM has failed, test each bank of DIMMs by removing all other DIMMs. Then, isolate the failed DIMM by switching each DIMM in a bank with a known working DIMM. - Remove any third-party memory and run HP Insight Diagnostics.

Blade Processor Health: Warning

Blade Processor Health: Warning

Blade processor health is degraded. Recommended actions: - Be sure each processor is supported by the blade and is installed as directed in the blade documentation. The processor socket requires very specific installation steps and only supported processors should be installed. For processor requirements, see the blade documentation. - Be sure the blade ROM is current. - Be sure you are not mixing processor stepping, core speeds, or cache sizes if this is not supported on the blade. For more information, see the blade documentation. CAUTION: Removal of some processors and heatsinks require special considerations for replacement, while other processors and heatsinks are integrated and cannot be reused once separated. For specific instructions for the blade you are troubleshooting, refer to processor information in the blade user guide. - If the blade has only one processor installed, reseat the processor. If the problem is resolved after you restart the blade, the processor was not installed properly. - If the blade has only one processor installed, replace it with a known functional processor. If the problem is resolved after you restart the blade, the original processor failed. - If the blade has multiple processors installed, test each processor: 1. Remove all but one processor from the blade. Replace each with a processor terminator board or blank, if applicable to the blade. 2. Replace the remaining processor with a known functional processor. If the problem is resolved after you restart the blade, a fault exists with one or more of the original processors. Install each processor one by one, restarting each time, to find the faulty processor or processors. At each step, be sure the blade supports the processor configurations.

26

Symptom

Blade Processor Health: Warning

Recommendation

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Name

Description

Symptom

Recommendation

Rack Processor Health: Warning

Rack Processor Health: Warning

Rack Processor Health: Warning

Rack processor health is degraded. Recommended actions: - Be sure each processor is supported by the rack and is installed as directed in the rack documentation. The processor socket requires very specific installation steps and only supported processors should be installed. For processor requirements, see the rack documentation. - Be sure the rack ROM is current. - Be sure you are not mixing processor stepping, core speeds, or cache sizes if this is not supported on the rack. For more information, see the rack documentation. CAUTION: Removal of some processors and heatsinks require special considerations for replacement, while other processors and heatsinks are integrated and cannot be reused once separated. For specific instructions for the rack you are troubleshooting, refer to processor information in the rack user guide. If the rack has only one processor installed, reseat the processor. If the problem is resolved after you restart the rack, the processor was not installed properly. - If the rack has only one processor installed, replace it with a known functional processor. If the problem is resolved after you restart the rack, the original processor failed. If the rack has multiple processors installed, test each processor: 1. Remove all but one processor from the rack. Replace each with a processor terminator board or blank, if applicable to the rack. 2. Replace the remaining processor with a known functional processor. If the problem is resolved after you restart the rack, a fault exists with one or more of the original processors. Install each processor one by one, restarting each time, to find the faulty processor or processors. At each step, be sure the rack supports the processor configurations.

27

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Name

Description

Blade Processor Health: Critical

Blade Blade Processor Processor Health: Critical Health: Critical

28

Symptom

Recommendation Blade processor health is failed. Recommended actions: - Be sure each processor is supported by the blade and is installed as directed in the blade documentation. The processor socket requires very specific installation steps and only supported processors should be installed. For processor requirements, see the blade documentation. Be sure the blade ROM is current. - Be sure you are not mixing processor stepping, core speeds, or cache sizes if this is not supported on the blade. For more information, see the blade documentation. CAUTION: Removal of some processors and heatsinks require special considerations for replacement, while other processors and heatsinks are integrated and cannot be reused once separated. For specific instructions for the blade you are troubleshooting, refer to processor information in the blade user guide. If the blade has only one processor installed, reseat the processor. If the problem is resolved after you restart the blade, the processor was not installed properly. If the blade has only one processor installed, replace it with a known functional processor. If the problem is resolved after you restart the blade, the original processor failed. - If the blade has multiple processors installed, test each processor: 1. Remove all but one processor from the blade. Replace each with a processor terminator board or blank, if applicable to the blade. 2. Replace the remaining processor with a known functional processor. If the problem is resolved after you restart the blade, a fault exists with one or more of the original processors. Install each processor one by one, restarting each time, to find the faulty processor or processors. At each step, be sure the blade supports the processor configurations.

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Name

Description

Rack Processor Health: Critical

Rack Rack Processor Processor Health: Critical Health: Critical

Network Adapter Network Health: Warning Adapter Health: Warning

Symptom

Recommendation Rack processor health is failed. Recommended actions: - Be sure each processor is supported by the rack and is installed as directed in the rack documentation. The processor socket requires very specific installation steps and only supported processors should be installed. For processor requirements, see the rack documentation. Be sure the rack ROM is current. - Be sure you are not mixing processor stepping, core speeds, or cache sizes if this is not supported on the rack. For more information, see the rack documentation. CAUTION: Removal of some processors and heatsinks require special considerations for replacement, while other processors and heatsinks are integrated and cannot be reused once separated. For specific instructions for the rack you are troubleshooting, refer to processor information in the rack user guide. If the rack has only one processor installed, reseat the processor. If the problem is resolved after you restart the rack, the processor was not installed properly. - If the rack has only one processor installed, replace it with a known functional processor. If the problem is resolved after you restart the rack, the original processor failed. If the rack has multiple processors installed, test each processor: 1. Remove all but one processor from the rack. Replace each with a processor terminator board or blank, if applicable to the rack. 2. Replace the remaining processor with a known functional processor. If the problem is resolved after you restart the rack, a fault exists with one or more of the original processors. Install each processor one by one, restarting each time, to find the faulty processor or processors. At each step, be sure the rack supports the processor configurations.

Network Adapter Health: Warning

Network adapter health is degraded. Recommended action: - Reseat the network adapter and restart the server. - Review the signal backplane on the server or the midplane for damage. - Replace the adapter.

Network Adapter Network Network Adapter Health: Critical Adapter Health: Critical Health: Critical

Network adapter health is failed. Recommended action: - Reseat the network adapter and restart the server. Review the signal backplane on the server or the midplane for damage. - Replace the adapter.

Port Health: Warning

Port Health: Warning

Port Health: Warning

Port health is degraded. Recommended action: - Review the signal backplane on the server or the midplane for damage. - Replace the network adapter.

Port Health: Critical

Port Health: Critical

Port Health: Critical

Port health is failed. Recommended action: - Review the signal backplane on the server or the midplane for damage. - Replace the network adapter.

Chassis Health: Warning

Chassis Health: Warning

Chassis Health: Warning

Chassis health is degraded. Review the health of this chassis' related components to diagnose the problem.

Rack Chassis Health: Warning

Rack Chassis Health: Warning

Rack Health: Warning

Rack chassis health is degraded. Review the health of this rack's related components to diagnose the problem.

Rack System Health: Warning

Rack System Health: Warning

Rack System Health: Warning

Rack system health is degraded. Review the health of this rack's related components to diagnose the problem.

29

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Name

Description

Chassis Health: Critical

Chassis Chassis Health: Health: Critical Critical

Chassis health is failed. Review the health of this chassis' related components to diagnose the problem.

Rack Chassis Health: Critical

Rack Chassis Rack Health: Health: Critical Critical

Rack chassis health is failed. Review the health of this rack's related components to diagnose the problem.

Rack System Health: Critical

Rack System Rack System Health: Critical Health: Critical

Rack system health is failed. Review the health of this rack's related components to diagnose the problem.

Power Supply Health: Warning

Power Supply Health: Warning

Power supply health is degraded. Recommended actions: - Be sure no loose connections exist. - Check the power source. If the power source is working properly, then replace the power supply. - Be sure the system has enough power, particularly if you recently added hardware, such as hard drives. Remove the newly added component and if the problem is no longer present, then additional power supplies are required. Check the system information from the IML. - If running a redundant configuration, be sure that all of the power supplies in the system have the same spare part number and are supported by the server.

Power Supply Health: Critical

Power Supply Power Supply Health: Critical Health: Critical

30

Symptom

Power Supply Health: Warning

Recommendation

Power supply health is failed. Recommended actions: Be sure no loose connections exist. - Check the power source. If the power source is working properly, then replace the power supply. - Be sure the system has enough power, particularly if you recently added hardware, such as hard drives. Remove the newly added component and if the problem is no longer present, then additional power supplies are required. Check the system information from the IML. - If running a redundant configuration, be sure that all of the power supplies in the system have the same spare part number and are supported by the server.

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Name

Description

Symptom

Recommendation

Fan Health: Warning

Fan Health: Warning

Fan Health: Warning

Fan health is degraded. Recommended actions: - Be sure the fans are properly seated and working. Follow the procedures and warnings in the server documentation for removing the access panels and accessing and replacing fans. Unseat, and then reseat, each fan according to the proper procedures. Replace the access panels, and then attempt to restart the server. - Be sure the fan configuration meets the functional requirements of the server. See the server documentation. - Be sure no ventilation problems exist. If you have been operating the server for an extended period of time with the access panel removed, airflow may have been impeded, causing thermal damage to components. For further requirements, see the server documentation. - Be sure no POST error messages are displayed while booting the server that indicate temperature violation or fan failure information. For the temperature requirements for the server, see the server documentation. - Use iLO or an optional IML viewer to access the IML to see if any event list error messages relating to fans are listed. - In the iLO web interface, navigate to the Information > System Information page and verify the following information: a. Click the Fans tab and verify the fan status and fan speed. b. Click the Temperatures tab and verify the temperature readings for each location on the Temperatures tab. If a hot spot is located, then check the airflow path for blockage by cables and other material. - Replace any required nonfunctioning fans and restart the server. For specifications on fan requirements, see the server documentation. Be sure all fan slots have fans or blanks installed. For requirements, see the server documentation. - Verify the fan airflow path is not blocked by cables or other material.

31

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Name

Description

Symptom

Recommendation

Fan Health: Critical

Fan Health: Critical

Fan Health: Critical

Fan health is failed. Recommended actions: - Be sure the fans are properly seated and working. Follow the procedures and warnings in the server documentation for removing the access panels and accessing and replacing fans. Unseat, and then reseat, each fan according to the proper procedures. Replace the access panels, and then attempt to restart the server. - Be sure the fan configuration meets the functional requirements of the server. See the server documentation. - Be sure no ventilation problems exist. If you have been operating the server for an extended period of time with the access panel removed, airflow may have been impeded, causing thermal damage to components. For further requirements, see the server documentation. - Be sure no POST error messages are displayed while booting the server that indicate temperature violation or fan failure information. For the temperature requirements for the server, see the server documentation. - Use iLO or an optional IML viewer to access the IML to see if any event list error messages relating to fans are listed. - In the iLO web interface, navigate to the Information > System Information page and verify the following information: a. Click the Fans tab and verify the fan status and fan speed. b. Click the Temperatures tab and verify the temperature readings for each location on the Temperatures tab. If a hot spot is located, then check the airflow path for blockage by cables and other material. - Replace any required nonfunctioning fans and restart the server. For specifications on fan requirements, see the server documentation. Be sure all fan slots have fans or blanks installed. For requirements, see the server documentation. - Verify the fan airflow path is not blocked by cables or other material.

Chassis Power Consumption: High

Chassis Power Consumption: High

Chassis Power Consumption: High

Chassis power consumption is high. Recommended actions: - Be sure the power supplies are properly seated and operational. - Be sure the system has enough power, particularly if you recently added hardware, such as hard drives. Remove the newly added component and if the problem is no longer present, then additional power supplies are required.

Rack Power Consumption: High

Rack Power Consumption: High

Rack Power Consumption: High

Rack power consumption is high. Recommended actions: - Be sure the power supplies are properly seated and operational. - Be sure the system has enough power, particularly if you recently added hardware, such as hard drives. Remove the newly added component and if the problem is no longer present, then additional power supplies are required.

Power Supply Average Output: High

Power Supply Average Output: High

Power Supply Average Output: High

Power supply average output is high. Recommended actions: - Be sure all the other power supplies are properly seated and operational. - Be sure the system has enough power, particularly if you recently added hardware, such as hard drives. Remove the newly added component and if the problem is no longer present, then additional power supplies are required.

32

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

Name

Description

Symptom

Recommendation

Temperature Sensor Reading Exceeded Non-Critical Threshold

Temperature Sensor Reading Exceeded Non-Critical Threshold

Temperature Sensor Reading Exceeded Non-Critical Threshold

Temperature exceeded the non-critical threshold. Recommended actions: - Check the airflow path for blockage by cables and other material. - Replace any required non-functioning fans and restart the server. For specifications on fan requirements, see the server documentation. - Be sure all fan slots have fans or blanks installed. For requirements, see the server documentation.

Temperature Sensor Reading Exceeded Non-Critical Threshold

Temperature Sensor Reading Exceeded Non-Critical Threshold

Temperature Sensor Reading Exceeded Non-Critical Threshold

Temperature exceeded the non-critical threshold. Recommended actions: - Check the airflow path for blockage by cables and other material. - Replace any required non-functioning fans and restart the server. For specifications on fan requirements, see the server documentation. - Be sure all fan slots have fans or blanks installed. For requirements, see the server documentation.

Temperature Sensor Reading Exceeded Critical Threshold

Temperature Sensor Reading Exceeded Critical Threshold

Temperature Sensor Reading Exceeded Critical Threshold

Temperature exceeded the critical threshold. Recommended actions: - Check the airflow path for blockage by cables and other material. - Replace any required non-functioning fans and restart the server. For specifications on fan requirements, see the server documentation. - Be sure all fan slots have fans or blanks installed. For requirements, see the server documentation.

Temperature Sensor Reading Exceeded Critical Threshold

Temperature Sensor Reading Exceeded Critical Threshold

Temperature Sensor Reading Exceeded Critical Threshold

Temperature exceeded the critical threshold. Recommended actions: - Check the airflow path for blockage by cables and other material. - Replace any required non-functioning fans and restart the server. For specifications on fan requirements, see the server documentation. - Be sure all fan slots have fans or blanks installed. For requirements, see the server documentation.

33

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

12. Appendix III: Capacity Definitions The Management Pack uses the following capacity definitions, which help determine the value of Analysis Badges (refer to section 7, Analysis Badges) within vRealize Operations. For more information on using Analysis Badges, refer to the VMware vRealize Operations online help. Table 4: Capacity Definitions CONTAINER

USE IN WORKLOAD

HP Chassis vRealize Calculated Power Capacity

yes

HP Rack vRealize Calculated Power Capacity

yes

HP Power Supply vRealize Calculated Power Output Capacity

34

yes

Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide

You can find the most up-to-date technical documentation on the Blue Medora support site at: http://support.bluemedora.com. The Blue Medora website also provides the latest product updates. If you have comments about this documentation, submit your feedback to: [email protected].

Copyright © 2016 Blue Medora Inc. All rights reserved. U.S. and international copyright and intellectual property laws protect this product. Blue Medora is a registered trademark or trademark of Blue Medora in the United States and/or other jurisdictions. The HP name (including HP ProLiant Servers) and logo are trademarks or registered trademarks of Hewlett Packard Enterprise (HPE) in the United States and/or other jurisdictions. All other marks and names mentioned herein may be trademarks of their respective companies. Blue Medora 3225 N Evergreen Dr. NE Suite 103 Grand Rapids, MI 49525 www.bluemedora.com