VMWARE VREALIZE OPERATIONS MANAGEMENT PACK FOR
HP Servers User Guide
TABLE OF CONTENTS 1. Purpose ..................................................................................................................................................................... 3 2. Introduction to the Management Pack ...................................................................................... 3 2.1 How the Management Pack Collects Data ............................................................ 3 2.2 Data the Management Pack Collects .......................................................................... 3 2.3 Inventory Tree (Traversal Spec) .......................................................................................... 3 3. Dashboards ........................................................................................................................................................... 4 3.1 HP Overview ......................................................................................................................................... 4 3.2 HP Rack Overview .......................................................................................................................... 5 3.3 HP Chassis Overview .................................................................................................................. 6 3.4 HP Blade Overview ........................................................................................................................ 7 3.5 HP Health Investigation ............................................................................................................. 8 3.6 HP Hosted VMs ................................................................................................................................. 9 3.7 HP Hosted ESXi Hosts ............................................................................................................ 10 4. Tags ............................................................................................................................................................................ 10 5. Views......................................................................................................................................................................... 11 6. Reports ................................................................................................................................................................... 12 7. Alerts ......................................................................................................................................................................... 13 8. Analysis Badges ............................................................................................................................................ 13 9. Troubleshooting the Management Pack ............................................................................... 14 9.1 Troubleshooting an Adapter Instance ...................................................................... 14 9.2 Testing Connection Failures ............................................................................................... 14 9.3 Viewing System Log Files..................................................................................................... 14 10. Appendix I: Metrics ................................................................................................................................. 15 11. Appendix II: Alerts, Symptoms, & Recommendations ........................................ 25 12. Appendix III: Capacity Definitions .......................................................................................... 34
NOTE: This document supports the version of each product listed, as well as all subsequent versions, until a new edition replaces it.
You can find the most up-to-date technical documentation on the Blue Medora support site at: http://support.bluemedora.com. The Blue Medora website also provides the latest product updates. If you have comments about this documentation, submit your feedback to:
[email protected]. 2
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
1. Purpose The Blue Medora VMware vRealize Operations (vROps) Management Pack for HP Servers User Guide describes the primary features of the Management Pack for HP Servers, including dashboards, views, reports, and alerts that allow users to optimize the monitoring and management of HP server resources from within vRealize Operations.
2. Introduction to the Management Pack The Management Pack for HP Servers is an embedded adapter for vRealize Operations (vROps) that monitors HP servers remotely. The Management Pack retrieves data regarding HP server resources by connecting to each supported HP ProLiant Server’s iLO REST API (via default port 443). This section includes the following topics: • How the Management Pack Collects Data • Data the Management Pack Collects 2.1 How the Management Pack Collects Data During each data collection cycle, the Management Pack opens an iLO REST API connection to the configured HP server(s) and queries it to retrieve metrics for HP server resources. NOTE: For a list of supported HP hardware, refer to the accompanying Blue Medora VMware vRealize Operations (vROps) Management Pack for HP Servers Installation & Configuration Guide. The collection interval for the adapter instance resource determines how often the Management Pack collects data. The default collection interval is five minutes.
Figure 1: Example Traversal Spec
The Management Pack supports Autodiscovery and manual discovery of resources. When you enable Autodiscovery for an adapter instance, the Management Pack creates resources in vRealize Operations and collects data after the main collection query runs. If a new resource belongs to a resource kind that does not exist in vROps, the Management Pack creates the resource kind. 2.2 Data the Management Pack Collects The Management Pack can collect performance data, relationships (associations), and events for the following HP Servers resources: 1. 2. 3. 4. 5. 6. 7. 8. 9.
Chassis Enclosure Blade Port (Optional) Rack Network Adapter Power Supply Fans Containers/Tags • VMware VMs on HP Servers* • VMware Hosts on HP Servers* • HP Servers • HP Tag * To allow for VMware relationships, you must enable IP addresses on the host systems.
2.3 Inventory Tree (Traversal Spec) The Inventory Tree (Traversal Spec) feature within vROps allows you to easily navigate your environment. The hierarchical structure implicitly shows relationships among resource kinds and enables quick drill-downs to root-cause issues.
3
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
3. Dashboards Dashboards are the primary user interface that allow users to monitor HP server resources from within vRealize Operations. The following dashboards are available in the Management Pack: • HP Overview • HP Rack Overview • HP Chassis Overview • HP Blade Overview • HP Health Investigation • HP Hosted VMs • HP Hosted ESXi Hosts NOTE: To filter by VMs and Hosts within the dashboards, refer to section “4. Tags” for instructions.
3.1 HP Overview The Overview dashboard provides at-a-glance heatmap views depicting the overall health of your HP Server resources (racks, chassis, power supplies, blades, fans, network adapters, etc.). Figure 2: HP Overview Dashboard
4
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
3.2 HP Rack Overview The HP Rack Overview dashboard allows you to select a rack to view its health, status, properties, power, child resources (fans, network adapters, ports, etc.), relationships, and alerts. Figure 3: HP Rack Overview Dashboard
5
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
3.3 HP Chassis Overview The HP Chassis Overview dashboard displays an overview of your HP Chassis and its child resources at-aglance. Figure 4: HP Chassis Overview Dashboard
6
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
3.4 HP Blade Overview The HP Blade Overview dashboard allows you to select a blade to view its health, status, properties, child resources (network adapters, ports, etc.), relationships, and alerts. Figure 5: HP Blade Overview Dashboard
7
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
3.5 HP Health Investigation The Health Investigation dashboard allows you to view health status, top alerts, and key performance Metrics (KPIs) for a selected resource in your HP Servers environment. Figure 6: HP Health Investigation Dashboard
8
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
3.6 HP Hosted VMs The Hosted VMs dashboard allows you to select a Virtual Machine to view its parent ESXi host, related ESXi hosts and HP hardware, as well as KPIs for the VM. Figure 7: HP Hosted VMs Dashboard
9
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
3.7 HP Hosted ESXi Hosts The HP Hosted ESXi Hosts dashboard allows you to select an ESXi Host to view its properties, relationships, and KPIs, as well as its host HP Server and related KPIs. Figure 8: HP Hosted ESXi Hosts Dashboard
4. Tags To further customize how your Management Pack dashboards are displayed, four tags are currently available for filtering: • • • •
VMware VMs on HP Servers VMware Hosts on HP Servers HP Servers HP Tag
To select a tag, perform the following steps: NOTE: Depending on the dashboard, the steps for selecting a tag may vary slightly. 1. 2. 3. 4. 5. 6.
10
Click on the Content navigation shortcut ( ). Click on the Dashboards view in the navigation pane. Select Edit ( ) on the HP Servers widget. Select Edit Widget ( ) for the widget you want to edit. Expand the HP Servers Container option. Select the desired tag and click Save.
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
5. Views The vROps Management Pack for HP Servers creates views that allow the user to view statistics of metrics for various HP Servers resources. The views help give a broad picture of the entire system, as opposed to a more in depth view. Table 1: Management Pack Views VIEW
TYPE
DESCRIPTION
Average HP Health Overview
List
Displays average health and availability over the past week.
HP Alerts
List
Displays alert counts by severity.
HP Capacity
List
Displays capacity (%) and time (days) remaining.
HP Chassis Average KPIs
List
Displays average chassis KPIs over the past week.
HP Fan Average KPIs
List
Displays average fan KPIs over the past week
HP Health Overview
List
Displays current health and availability.
HP Network Adapter Average KPIs
List
Displays average network adapter KPIs over the past week.
HP Port Average KPIs
List
Displays average port KPIs over the past week.
HP Power Supply Average KPIs
List
Displays average power supply KPIs over the past week.
HP Server Information
List
Displays KPIs and alert counts by severity for HP blade and rack servers.
To access the Management Pack views, go to Environment > All Objects > HP Servers and double-click on the desired Object (resource). Select the Details tab, then Views. The available views for that resource are listed and can be selected. Figure 9: Accessing Management Pack Views
11
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
6. Reports The vROps Management Pack for HP Servers contains custom reports, as listed below. The reports can be exported and easily shared with key stakeholders in either .pdf or .csv formats. • • • • • • • • •
HP Alerts HP Capacity HP Chassis Average KPIs HP Fan Average KPIs HP Health Overview HP Network Adapter Average KPIs HP Port Average KPIs HP Power Supply Average KPIs HP Server Information
To access the Management Pack reports, go to Environment > All Objects > HP Servers and double-click on the desired Object (resource). Select the Reports tab, then Report Templates. Figure 10: Accessing Management Pack Reports
To run the selected report, click the Run Template icon ( your preferred format.
12
), then click Generated Reports, to select the report in
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
7. Alerts The Management Pack for HP Servers pulls health status and events from HP server resources and displays them in vRealize Operations as alerts. Refer to section “10. Appendix II: Alerts, Symptoms, & Recommendations” for the full list of alerts, symptoms, and recommendations. Figure 11: Alert Example
8. Analysis Badges Using the predictive analytics capabilities of the vROps Analysis Badges through capacity definitions, the Management Pack for HP Servers populates Power Capacity for the Chassis and Rack resource kinds as well as Power Output Capacity for Power Supplies. For details, refer to section. “11. Appendix III: Capacity Definitions”. Figure 12: Capacity Badge Example
13
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
9. Troubleshooting the Management Pack Known troubleshooting information can help you diagnose and correct common problems with the Management Pack for HP Servers. This section includes the following topics: • Troubleshooting an Adapter Instance • Testing Connection Failures • Viewing System Log Files 9.1 Troubleshooting an Adapter Instance Perform these general troubleshooting steps to diagnose and correct problems with an adapter instance: • Edit the adapter instance and click Test Connection to verify the connection to vROps. Refer to section “8.2 Testing Connection Failures”. • View the collection status and collection state for the adapter instance resource on the Environment Overview page in vROps. • Check the adapter and collector logs for errors. Refer to section “8.3 Viewing System Log Files”, for details. 9.2 Testing Connection Failures When clicking Test Connection in the Manage Solution window when adding an adapter instance, the following connection errors are possible. 9.2.1 Missing Connection Information Ensure the following information was entered correctly: • Host name or IP address (single, or comma-separated, hostname(s) or IP addresses) • Port number (if other than default of 443) • iLO User Name and Password • Timeout setting (must be a positive integer value) NOTE: The iLO provides a security feature that disables logins after a number of failed attempts. This feature does not need to be changed in order for the Management Pack to work, but the lockout does also apply to connections made by the Management Pack. As a result, a streak of consecutive failed connection tests could cause the iLO to refuse future connections for a period of time. To check if this security feature is causing Test Connection to fail, open the iLO login page. If logins are disabled, there will be a message to alert you. The settings for delay time and number of failed login attempts needed to cause a delay can be adjusted in the iLO settings (Administration > Access Settings). 9.3 Viewing System Log Files You can view adapter errors in the adapter and collector log files. You can view the adapter and collector log files in the vROps user interface or in an external log viewer. The adapter log files are in the $ALIVE_BASE/user/log/adapters/hpcompute_adapter3/ folder. The collector log files are in the $ALIVE_BASE/user/log/ folder. The logging level is set to ERROR by default. To troubleshoot issues, set the logging level to INFO. To view detailed messages, including micro steps, queries, and returned results, set the logging level to DEBUG. You can set the base log level for the collector via Administrator > Support > Logs > Select COLLECTOR folder > Select Edit Properties icon > Edit Root logger level. NOTE: If you set the logging level to DEBUG, log files can become large very quickly. Set the logging level to DEBUG only for short periods of time. For complete information about viewing log files and modifying log levels, refer to the vROps online help.
14
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
10. Appendix I: Metrics The Management Pack for HP Servers collects the following metrics by HP Resource Kinds. Table 2: Management Pack Metrics Resource Kind
Resource Group
Resource Attribute
Attribute Type
Attribute Description
HP Servers Adapter Instance
License
Blades and Racks
double
Number of blade and rack licenses required for this adapter instance.
HP Servers Container
Relationships
Enclosure Children
string
All HP enclosures.
HP Servers Container
Relationships
Rack Children
string
All HP racks.
VMware VMs on HP Container
Relationships
Virtual Machine Children
string
The virtual machines running on HP servers.
VMware Hosts on HP Container
Relationships
Host System Children
string
The host systems running on HP servers.
HP Enclosure
Relationships
Servers Parent
string
Servers container.
HP Enclosure
Relationships
Chassis Children
string
All the chassis in this enclosure.
HP Enclosure
General
Name
string
The name of the enclosure.
HP Enclosure
General
UUID
string
The universal unique identifier for this enclosure.
HP Blade
Relationships
Network Adapter Child
string
The network adapter on the blade.
HP Blade
Relationships
Chassis Parent
string
The chassis this blade is in.
HP Blade
General
Serial Number
string
The blade serial number.
HP Blade
General
UUID
string
The universal unique identifier for this blade.
HP Blade
General
Manufacturer
string
The manufacturer or OEM of this blade.
HP Blade
General
Model
string
The model information that the manufacturer uses to refer to this blade.
HP Blade
General
SKU
string
SKU for the blade.
HP Blade
General
Asset Tag
string
A user-definable tag that is used to track this blade for inventory or other client purposes.
HP Blade
Firmware
BIOS Version
string
The version of the blade BIOS or primary blade firmware.
HP Blade
Firmware
Platform Definition Firmware Version
string
Platform Definition Table version.
15
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Resource Kind
Resource Group
Resource Attribute
Attribute Type
Attribute Description
HP Blade
Firmware
Power Management Controller Firmware Version
string
The Power Management Microcontroller firmware version.
HP Blade
Firmware
Power Management Controller Bootloader Firmware Version
string
The Power Management Microcontroller firmware bootloader version.
HP Blade
Firmware
SAS PLD Firmware Version
string
SAS Programmable Logic Device version.
HP Blade
Firmware
SPS Firmware Version
string
SPS Firmware version
HP Blade
Firmware
System PLD Firmware Version
string
The firmware version of the CPLD.
HP Blade
Processor
Processor Count
string
The number of processors in the blade.
HP Blade
Processor
Processor Model
string
The processor model for the primary or majority of processors in this blade.
HP Blade
Processor
Processor Health
string
This represents the health state of the processors in the absence of its dependent resources.
HP Blade
Power
Power State
string
This is the current power state of the blade.
HP Blade
Power
Power Capacity
double
The total amount of power allocated to the blade.
HP Blade
Memory
Memory Health
string
This represents the health state of the memory in the absence of its dependent resources.
HP Blade
Memory
Total Memory
double
The total amount of memory in the blade.
HP Blade
Status
State
string
This indicates the known state of the blade, such as if it is enabled.
HP Blade
Status
Health
string
This represents the health state of this blade in the absence of its dependent resources.
HP Network Adapter
Relationships
Blade Parent
string
The blade this network adapter is on.
HP Network Adapter
Relationships
Rack Parent
string
The rack system this network adapter is on.
HP Network Adapter
Relationships
Port Child
string
Port that this network adapter has.
16
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Resource Kind
Resource Group
Resource Attribute
Attribute Type
Attribute Description
HP Network Adapter
General
IP Addresses
string
The IP addresses of the ports on the network adapter.
HP Network Adapter
General
Serial Number
string
The device serial number.
HP Network Adapter
General
Part Number
string
The device part number.
HP Network Adapter
Firmware
Firmware Version
string
This string represents the version of the firmware image.
HP Network Adapter
Status
State
string
This indicates the known state of the port, such as if it is enabled.
HP Network Adapter
Status
Health
string
This represents the health state of this resource in the absence of its dependent resources.
HP Network Adapter
Performance
Bad Receives
double
A count of frames that were received by the adapter but which had an error. This counter is the sum of mib items cpqNicIfPhysAdapterAlignmentErrors, cpqNicIfPhysAdapterFCSErrors, cpqNicIfPhysAdapterFrameTooLongs, and cpqNicIfPhysAdapterInternalMacReceiveErrors. If this counter increments frequently, check the more detailed error statistics and take appropriate action.
HP Network Adapter
Performance
Bad Transmits
double
A count of frames that were not transmitted by the adapter because of an error. This counter is the sum of MIB items cpqNicIfPhysAdapterDeferredTransmissions, cpqNicIfPhysAdapterLateCollisions, cpqNicIfPhysAdapterExcessiveCollisions, cpqNicIfPhysAdapterCarrierSenseErrors, and cpqNicIfPhysAdapterInternalMacTransmitErrors. If this counter increments frequently, check the more detailed error statistics and take appropriate action.
HP Network Adapter
Performance
Good Receives
double
A count of frames successfully received by the physical adapter.
17
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Resource Kind
Resource Group
Resource Attribute
Attribute Type
Attribute Description
HP Network Adapter
Performance
Good Transmits
double
A count of frames successfully transmitted by the physical adapter.
HP Port
Relationships
Network Adapter Parent
string
The network adapter this port is on.
HP Port
General
MAC Address
string
The port MAC address.
HP Port
General
Full Duplex
string
Full-duplex data transmission means that data can be transmitted in both directions on a signal carrier at the same time.
HP Port
General
IPv4 Address
string
This is the IPv4 Address.
HP Port
General
IPv6 Address
string
This is the IPv6 Address.
HP Port
Performance
Bad Receives
double
A count of frames that were received by the adapter but which had an error. This counter is the sum of mib items cpqNicIfPhysAdapterAlignmentErrors, cpqNicIfPhysAdapterFCSErrors, cpqNicIfPhysAdapterFrameTooLongs, and cpqNicIfPhysAdapterInternalMacReceiveErrors. If this counter increments frequently, check the more detailed error statistics and take appropriate action.
HP Port
Performance
Bad Transmits
double
A count of frames that were not transmitted by the adapter because of an error. This counter is the sum of MIB items cpqNicIfPhysAdapterDeferredTransmissions, cpqNicIfPhysAdapterLateCollisions, cpqNicIfPhysAdapterExcessiveCollisions, cpqNicIfPhysAdapterCarrierSenseErrors, and cpqNicIfPhysAdapterInternalMacTransmitErrors. If this counter increments frequently, check the more detailed error statistics and take appropriate action.
HP Port
Performance
Good Receives
double
A count of frames successfully received by the physical adapter.
HP Port
Performance
Good Transmits
double
A count of frames successfully transmitted by the physical adapter.
18
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Resource Kind
Resource Group
Resource Attribute
Attribute Type
Attribute Description
HP Port
Performance
Speed
double
An estimate of the interface's current bandwidth. For interfaces which do not vary in bandwidth or for those where no accurate estimation can be made, this object should contain the nominal bandwidth.
HP Port
Status
State
string
This indicates the known state of the port, such as if it is enabled.
HP Port
Status
Health
string
This represents the health state of this resource in the absence of its dependent resources.
HP Chassis
Relationships
Enclosure Parent
string
Enclosure this chassis is on.
HP Chassis
Relationships
Power Supply Child
string
Power Supply this chassis has.
HP Chassis
Relationships
Blade Child
string
Blades this chassis has.
HP Chassis
Relationships
Fan Child
string
Fans this chassis has.
HP Chassis
General
Model
string
The chassis model number.
HP Chassis
General
Serial Number
string
The chassis serial number.
HP Chassis
General
Chassis Type
string
This property indicates the physical form factor type of this chassis.
HP Chassis
General
Manufacturer
string
The chassis manufacturer.
HP Chassis
General
UUID
string
The chassis UUID.
HP Chassis
General
Name
string
The chassis name.
HP Chassis
General
SKU
string
SKU for the chassis.
HP Chassis
General
Version
string
The chassis version.
HP Chassis
General
Asset Tag
string
The chassis user-assigned asset tag.
HP Chassis
General
Consumed Height (Bays)
string
The number of enclosure bays this chassis consumes in height.
HP Chassis
General
Consumed Width (Bays)
string
The number of enclosure bays this chassis consumes in width.
HP Chassis
General
Bay Number
string
The position of the chassis inside an enclosure.
HP Chassis
General
Height (U)
string
The chassis rack U height.
HP Chassis
General
U Location
string
The chassis rack U location.
19
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Resource Kind
Resource Group
Resource Attribute
Attribute Type
Attribute Description
HP Chassis
General
U Position
string
The chassis U position in the rack.
HP Chassis
General
Rack Part Number
string
The chassis rack part number.
HP Chassis
Status
State
string
This indicates the known state of the chassis, such as if it is enabled.
HP Chassis
Status
Health
string
This represents the health state of this chassis in the absence of its dependent resources.
HP Chassis
Firmware
SAS PLD Firmware Version
string
The firmware version of the SAS controller.
HP Chassis
Firmware
SPS Firmware Version
string
The SPS FW Version number, aka ME FW Version, AAAA.BBBB. CCCC.DDDD.E
HP Chassis
Firmware
System PLD Firmware Version
string
The firmware version of the CPLD.
HP Chassis
Firmware
Power Management Controller Firmware Version
string
The firmware version of the Power Monitor.
HP Chassis
Firmware
Power Management Controller Boot Loader Firmware Version
string
The firmware version of the Power Monitor boot loader.
HP Chassis
Firmware
Platform Definition Firmware Version
string
The version of the Intelligent Platform Abstraction Data.
HP Chassis
Power
Power Management Hardware Family
string
The family type of the Power Monitor hardware.
HP Chassis
Power
Power Capacity
double
The total power (Watts) available to the chassis from all power supplies (adjusting for redundancy settings).
HP Chassis
Power
Power Metrics Interval
double
The interval between power metric evaluation in minutes.
HP Chassis
Power
Minimum Power Consumed
double
The minimum power consumed during the interval specified by IntervalInMin.
HP Chassis
Power
Maximum Power Consumed
double
The maximum power consumed during the interval specified by IntervalInMin.
20
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Resource Kind
Resource Group
Resource Attribute
Attribute Type
Attribute Description
HP Chassis
Power
Average Power Consumed
double
The average power consumed during the interval specified by IntervalInMin.
HP Chassis
Power
Power Consumed
double
The latest observed power (Watts) being drawn by this chassis. The update interval may vary depending upon implementation but is usually measured in seconds.
HP Rack
Relationships
Servers Parent
string
Servers container.
HP Rack
Relationships
Enclosure Parent
string
Enclosure this rack is on.
HP Rack
Relationships
Power Supply Child
string
Power Supply this rack has.
HP Rack
Relationships
Network Adapter Child
string
Network adapter this rack has.
HP Rack
Relationships
Fan Child
string
Fan this rack has.
HP Rack
General
Serial Number
string
The rack serial number.
HP Rack
General
Chassis Type
string
This property indicates the physical form factor type of this rack.
HP Rack
General
Manufacturer
string
The rack manufacturer.
HP Rack
General
Model
string
The rack model number.
HP Rack
General
SKU
string
SKU for the rack.
HP Rack
General
Version
string
The rack version.
HP Rack
General
Asset Tag
string
The rack’s user-assigned asset tag.
HP Rack
General
UUID
string
The rack UUID provided by SMBIOS.
HP Rack
General
Consumed Height (Bays)
string
The number of enclosure bays this rack consumes in height.
HP Rack
General
Consumed Width (Bays)
string
The number of enclosure bays this rack consumes in width.
HP Rack
General
Bay Number
string
The position of the rack inside an enclosure.
HP Rack
General
Height (U)
string
The rack U height.
HP Rack
General
U Location
string
The rack U location.
HP Rack
General
U Position
string
The rack U position in the rack.
HP Rack
General
Enclosure
string
The name of the rack enclosure.
21
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Resource Kind
Resource Group
Resource Attribute
Attribute Type
Attribute Description
HP Rack
Status
State
string
This indicates the known state of the rack, such as if it is enabled.
HP Rack
Status
Health
string
This represents the health state of this rack in the absence of its dependent resources.
HP Rack
Firmware
BIOS Version
string
The version of the blade BIOS or primary blade firmware.
HP Rack
Firmware
SAS PLD Firmware Version
string
The firmware version of the SAS controller.
HP Rack
Firmware
SPS Firmware Version
string
The SPS FW Version number, aka ME FW Version, AAAA.BBBB. CCCC.DDDD.E
HP Rack
Firmware
System PLD Firmware Version
string
The firmware version of the CPLD.
HP Rack
Firmware
Power Management Controller Firmware Version
string
The firmware version of the Power Monitor.
HP Rack
Firmware
Power Management Controller Boot Loader Firmware Version
string
The firmware version of the Power Monitor boot loader.
HP Rack
Firmware
Platform Definition Firmware Version
string
The version of the Intelligent Platform Abstraction Data.
HP Rack
Power
Power Management Hardware
string
The family type of the Power Monitor hardware.
HP Rack
Power
Power State
string
This is the current power state of the blade.
HP Rack
Power
Power Limit
double
The total amount of power allocated to the blade.
HP Rack
Power
Power Capacity
double
The total power (Watts) available to the chassis from all power supplies (adjusting for redundancy settings).
HP Rack
Power
Power Metrics Interval
double
The interval between power metric evaluation in minutes.
HP Rack
Power
Minimum Power Consumed
double
The minimum power consumed during the interval specified by IntervalInMin.
HP Rack
Power
Maximum Power Consumed
double
The maximum power consumed during the interval specified by IntervalInMin.
22
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Resource Kind
Resource Group
Resource Attribute
Attribute Type
Attribute Description
HP Rack
Power
Average Power Consumed
double
The average power consumed during the interval specified by IntervalInMin.
HP Rack
Power
Power Consumed
double
The latest observed power (Watts) being drawn by this chassis. The update interval may vary depending upon implementation but is usually measured in seconds.
HP Rack
Processor
Processor Count
string
The number of processors in the rack system.
HP Rack
Processor
Processor Model
string
The processor model for the primary or majority of processors in this rack system.
HP Rack
Processor
Processor Health
string
This represents the health state of the processors in the absence of its dependent resources.
HP Rack
Memory
Memory Health
string
This represents the health state of the memory in the absence of its dependent resources.
HP Rack
Memory
Total Memory
double
The total amount of memory in the rack system.
HP Power Supply
Relationships
Chassis Parent
string
Chassis this power supply is on.
HP Power Supply
Relationships
Rack Parent
string
Rack this power supply is on.
HP Power Supply
General
Serial Number
string
The serial number for this Power Supply
HP Power Supply
General
Bay Number
string
The power supply bay number.
HP Power Supply
General
Spare Part Number
string
The part number for this Power Supply
HP Power Supply
General
Model
string
The model number for this Power Supply.
HP Power Supply
General
Type
string
The Power Supply type (AC or DC).
HP Power Supply
General
Hotplug Capable
string
If true, this power supply (and power supply bay) is capable of being hotplugged.
HP Power Supply
Firmware
Firmware Version
string
The firmware version for this Power Supply.
23
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Resource Kind
Resource Group
Resource Attribute
Attribute Type
Attribute Description
HP Power Supply
Status
State
string
This indicates the known state of the power supply, such as if it is enabled.
HP Power Supply
Status
Health
string
This represents the health state of this power supply in the absence of its dependent resources.
HP Power Supply
Performance
Power Capacity
double
The maximum capacity of this Power Supply.
HP Power Supply
Performance
Average Power Output
double
The average power output of this Power Supply.
HP Fan
Relationships
Chassis Parent
string
Chassis this fan is on.
HP Fan
Relationships
Rack Parent
string
Rack this fan is on.
HP Fan
General
Chassis Serial Number
string
The serial number of the chassis this fan is on.
HP Fan
General
Name
string
The name of the fan sensor.
HP Fan
General
Fan Location
string
The area or device to which this fan is located.
HP Fan
Status
State
string
The state of the fan.
HP Fan
Status
Health
string
The health of the fan.
HP Fan
Performance
Current Utilization (%)
double
The current utilization (% of max speed) of the fan.
HP Fan
Performance
Current Speed (RPM) double
24
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
The current speed of the fan.
11. Appendix II: Alerts, Symptoms, & Recommendations The vROps Management Pack for HP Servers creates alerts and provides recommended actions based on various symptoms that it detects in the environment. See the table below for details regarding each alert. Table 3: Alerts, Symptoms, & Recommendations Name
Description
Symptom
Recommendation
Blade Health: Warning
Blade Health: Warning
Blade Health: Warning
Blade health is degraded. Review the health of this blade's related components to diagnose the problem.
Blade Health: Critical
Blade Health: Critical
Blade Health: Critical
Blade health is failed. Review the health of this blade's related components to diagnose the problem.
Blade Memory Health: Warning
Blade Memory Blade Memory Health: Health: Warning Warning
Blade memory health is degraded. Recommended actions: - Be sure the memory meets the blade requirements and is installed as required by the blade. - Some blades may require that memory banks be populated fully or that all memory within a memory bank must be the same size, type, and speed. To determine if the memory is installed properly, see the blade documentation. - Check any blade LEDs that correspond to memory slots. - If you are unsure which DIMM has failed, test each bank of DIMMs by removing all other DIMMs. Then isolate the failed DIMM by switching each DIMM in a bank with a known working DIMM. - Remove any third-party memory and run HP Insight Diagnostics.
Rack Memory Health: Warning
Rack Memory Health: Warning
Rack memory health is degraded. Recommended actions: - Be sure the memory meets the rack requirements and is installed as required by the rack. - Some racks may require that memory banks be populated fully or that all memory within a memory bank must be the same size, type, and speed. To determine if the memory is installed properly, see the rack documentation. - Check any rack LEDs that correspond to memory slots. - If you are unsure which DIMM has failed, test each bank of DIMMs by removing all other DIMMs. Then, isolate the failed DIMM by switching each DIMM in a bank with a known working DIMM. - Remove any third-party memory and run HP Insight Diagnostics.
Blade Memory Health: Critical
Blade Memory Blade Memory Health: Critical Health: Critical
25
Rack Memory Health: Warning
Blade memory health is failed. Recommended actions: Be sure the memory meets the blade requirements and is installed as required by the blade. - Some blades may require that memory banks be populated fully or that all memory within a memory bank must be the same size, type, and speed. To determine if the memory is installed properly, see the blade documentation. - Check any blade LEDs that correspond to memory slots. - If you are unsure which DIMM has failed, test each bank of DIMMs by removing all other DIMMs. Then, isolate the failed DIMM by switching each DIMM in a bank with a known working DIMM. - Remove any third-party memory and run HP Insight Diagnostics.
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Name
Description
Rack Memory Health: Critical
Rack Memory Rack Memory Health: Critical Health: Critical
Rack memory health is failed. Recommended actions: Be sure the memory meets the rack requirements and is installed as required by the rack. - Some racks may require that memory banks be populated fully or that all memory within a memory bank must be the same size, type, and speed. To determine if the memory is installed properly, see the rack documentation. - Check any rack LEDs that correspond to memory slots. - If you are unsure which DIMM has failed, test each bank of DIMMs by removing all other DIMMs. Then, isolate the failed DIMM by switching each DIMM in a bank with a known working DIMM. - Remove any third-party memory and run HP Insight Diagnostics.
Blade Processor Health: Warning
Blade Processor Health: Warning
Blade processor health is degraded. Recommended actions: - Be sure each processor is supported by the blade and is installed as directed in the blade documentation. The processor socket requires very specific installation steps and only supported processors should be installed. For processor requirements, see the blade documentation. - Be sure the blade ROM is current. - Be sure you are not mixing processor stepping, core speeds, or cache sizes if this is not supported on the blade. For more information, see the blade documentation. CAUTION: Removal of some processors and heatsinks require special considerations for replacement, while other processors and heatsinks are integrated and cannot be reused once separated. For specific instructions for the blade you are troubleshooting, refer to processor information in the blade user guide. - If the blade has only one processor installed, reseat the processor. If the problem is resolved after you restart the blade, the processor was not installed properly. - If the blade has only one processor installed, replace it with a known functional processor. If the problem is resolved after you restart the blade, the original processor failed. - If the blade has multiple processors installed, test each processor: 1. Remove all but one processor from the blade. Replace each with a processor terminator board or blank, if applicable to the blade. 2. Replace the remaining processor with a known functional processor. If the problem is resolved after you restart the blade, a fault exists with one or more of the original processors. Install each processor one by one, restarting each time, to find the faulty processor or processors. At each step, be sure the blade supports the processor configurations.
26
Symptom
Blade Processor Health: Warning
Recommendation
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Name
Description
Symptom
Recommendation
Rack Processor Health: Warning
Rack Processor Health: Warning
Rack Processor Health: Warning
Rack processor health is degraded. Recommended actions: - Be sure each processor is supported by the rack and is installed as directed in the rack documentation. The processor socket requires very specific installation steps and only supported processors should be installed. For processor requirements, see the rack documentation. - Be sure the rack ROM is current. - Be sure you are not mixing processor stepping, core speeds, or cache sizes if this is not supported on the rack. For more information, see the rack documentation. CAUTION: Removal of some processors and heatsinks require special considerations for replacement, while other processors and heatsinks are integrated and cannot be reused once separated. For specific instructions for the rack you are troubleshooting, refer to processor information in the rack user guide. If the rack has only one processor installed, reseat the processor. If the problem is resolved after you restart the rack, the processor was not installed properly. - If the rack has only one processor installed, replace it with a known functional processor. If the problem is resolved after you restart the rack, the original processor failed. If the rack has multiple processors installed, test each processor: 1. Remove all but one processor from the rack. Replace each with a processor terminator board or blank, if applicable to the rack. 2. Replace the remaining processor with a known functional processor. If the problem is resolved after you restart the rack, a fault exists with one or more of the original processors. Install each processor one by one, restarting each time, to find the faulty processor or processors. At each step, be sure the rack supports the processor configurations.
27
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Name
Description
Blade Processor Health: Critical
Blade Blade Processor Processor Health: Critical Health: Critical
28
Symptom
Recommendation Blade processor health is failed. Recommended actions: - Be sure each processor is supported by the blade and is installed as directed in the blade documentation. The processor socket requires very specific installation steps and only supported processors should be installed. For processor requirements, see the blade documentation. Be sure the blade ROM is current. - Be sure you are not mixing processor stepping, core speeds, or cache sizes if this is not supported on the blade. For more information, see the blade documentation. CAUTION: Removal of some processors and heatsinks require special considerations for replacement, while other processors and heatsinks are integrated and cannot be reused once separated. For specific instructions for the blade you are troubleshooting, refer to processor information in the blade user guide. If the blade has only one processor installed, reseat the processor. If the problem is resolved after you restart the blade, the processor was not installed properly. If the blade has only one processor installed, replace it with a known functional processor. If the problem is resolved after you restart the blade, the original processor failed. - If the blade has multiple processors installed, test each processor: 1. Remove all but one processor from the blade. Replace each with a processor terminator board or blank, if applicable to the blade. 2. Replace the remaining processor with a known functional processor. If the problem is resolved after you restart the blade, a fault exists with one or more of the original processors. Install each processor one by one, restarting each time, to find the faulty processor or processors. At each step, be sure the blade supports the processor configurations.
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Name
Description
Rack Processor Health: Critical
Rack Rack Processor Processor Health: Critical Health: Critical
Network Adapter Network Health: Warning Adapter Health: Warning
Symptom
Recommendation Rack processor health is failed. Recommended actions: - Be sure each processor is supported by the rack and is installed as directed in the rack documentation. The processor socket requires very specific installation steps and only supported processors should be installed. For processor requirements, see the rack documentation. Be sure the rack ROM is current. - Be sure you are not mixing processor stepping, core speeds, or cache sizes if this is not supported on the rack. For more information, see the rack documentation. CAUTION: Removal of some processors and heatsinks require special considerations for replacement, while other processors and heatsinks are integrated and cannot be reused once separated. For specific instructions for the rack you are troubleshooting, refer to processor information in the rack user guide. If the rack has only one processor installed, reseat the processor. If the problem is resolved after you restart the rack, the processor was not installed properly. - If the rack has only one processor installed, replace it with a known functional processor. If the problem is resolved after you restart the rack, the original processor failed. If the rack has multiple processors installed, test each processor: 1. Remove all but one processor from the rack. Replace each with a processor terminator board or blank, if applicable to the rack. 2. Replace the remaining processor with a known functional processor. If the problem is resolved after you restart the rack, a fault exists with one or more of the original processors. Install each processor one by one, restarting each time, to find the faulty processor or processors. At each step, be sure the rack supports the processor configurations.
Network Adapter Health: Warning
Network adapter health is degraded. Recommended action: - Reseat the network adapter and restart the server. - Review the signal backplane on the server or the midplane for damage. - Replace the adapter.
Network Adapter Network Network Adapter Health: Critical Adapter Health: Critical Health: Critical
Network adapter health is failed. Recommended action: - Reseat the network adapter and restart the server. Review the signal backplane on the server or the midplane for damage. - Replace the adapter.
Port Health: Warning
Port Health: Warning
Port Health: Warning
Port health is degraded. Recommended action: - Review the signal backplane on the server or the midplane for damage. - Replace the network adapter.
Port Health: Critical
Port Health: Critical
Port Health: Critical
Port health is failed. Recommended action: - Review the signal backplane on the server or the midplane for damage. - Replace the network adapter.
Chassis Health: Warning
Chassis Health: Warning
Chassis Health: Warning
Chassis health is degraded. Review the health of this chassis' related components to diagnose the problem.
Rack Chassis Health: Warning
Rack Chassis Health: Warning
Rack Health: Warning
Rack chassis health is degraded. Review the health of this rack's related components to diagnose the problem.
Rack System Health: Warning
Rack System Health: Warning
Rack System Health: Warning
Rack system health is degraded. Review the health of this rack's related components to diagnose the problem.
29
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Name
Description
Chassis Health: Critical
Chassis Chassis Health: Health: Critical Critical
Chassis health is failed. Review the health of this chassis' related components to diagnose the problem.
Rack Chassis Health: Critical
Rack Chassis Rack Health: Health: Critical Critical
Rack chassis health is failed. Review the health of this rack's related components to diagnose the problem.
Rack System Health: Critical
Rack System Rack System Health: Critical Health: Critical
Rack system health is failed. Review the health of this rack's related components to diagnose the problem.
Power Supply Health: Warning
Power Supply Health: Warning
Power supply health is degraded. Recommended actions: - Be sure no loose connections exist. - Check the power source. If the power source is working properly, then replace the power supply. - Be sure the system has enough power, particularly if you recently added hardware, such as hard drives. Remove the newly added component and if the problem is no longer present, then additional power supplies are required. Check the system information from the IML. - If running a redundant configuration, be sure that all of the power supplies in the system have the same spare part number and are supported by the server.
Power Supply Health: Critical
Power Supply Power Supply Health: Critical Health: Critical
30
Symptom
Power Supply Health: Warning
Recommendation
Power supply health is failed. Recommended actions: Be sure no loose connections exist. - Check the power source. If the power source is working properly, then replace the power supply. - Be sure the system has enough power, particularly if you recently added hardware, such as hard drives. Remove the newly added component and if the problem is no longer present, then additional power supplies are required. Check the system information from the IML. - If running a redundant configuration, be sure that all of the power supplies in the system have the same spare part number and are supported by the server.
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Name
Description
Symptom
Recommendation
Fan Health: Warning
Fan Health: Warning
Fan Health: Warning
Fan health is degraded. Recommended actions: - Be sure the fans are properly seated and working. Follow the procedures and warnings in the server documentation for removing the access panels and accessing and replacing fans. Unseat, and then reseat, each fan according to the proper procedures. Replace the access panels, and then attempt to restart the server. - Be sure the fan configuration meets the functional requirements of the server. See the server documentation. - Be sure no ventilation problems exist. If you have been operating the server for an extended period of time with the access panel removed, airflow may have been impeded, causing thermal damage to components. For further requirements, see the server documentation. - Be sure no POST error messages are displayed while booting the server that indicate temperature violation or fan failure information. For the temperature requirements for the server, see the server documentation. - Use iLO or an optional IML viewer to access the IML to see if any event list error messages relating to fans are listed. - In the iLO web interface, navigate to the Information > System Information page and verify the following information: a. Click the Fans tab and verify the fan status and fan speed. b. Click the Temperatures tab and verify the temperature readings for each location on the Temperatures tab. If a hot spot is located, then check the airflow path for blockage by cables and other material. - Replace any required nonfunctioning fans and restart the server. For specifications on fan requirements, see the server documentation. Be sure all fan slots have fans or blanks installed. For requirements, see the server documentation. - Verify the fan airflow path is not blocked by cables or other material.
31
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Name
Description
Symptom
Recommendation
Fan Health: Critical
Fan Health: Critical
Fan Health: Critical
Fan health is failed. Recommended actions: - Be sure the fans are properly seated and working. Follow the procedures and warnings in the server documentation for removing the access panels and accessing and replacing fans. Unseat, and then reseat, each fan according to the proper procedures. Replace the access panels, and then attempt to restart the server. - Be sure the fan configuration meets the functional requirements of the server. See the server documentation. - Be sure no ventilation problems exist. If you have been operating the server for an extended period of time with the access panel removed, airflow may have been impeded, causing thermal damage to components. For further requirements, see the server documentation. - Be sure no POST error messages are displayed while booting the server that indicate temperature violation or fan failure information. For the temperature requirements for the server, see the server documentation. - Use iLO or an optional IML viewer to access the IML to see if any event list error messages relating to fans are listed. - In the iLO web interface, navigate to the Information > System Information page and verify the following information: a. Click the Fans tab and verify the fan status and fan speed. b. Click the Temperatures tab and verify the temperature readings for each location on the Temperatures tab. If a hot spot is located, then check the airflow path for blockage by cables and other material. - Replace any required nonfunctioning fans and restart the server. For specifications on fan requirements, see the server documentation. Be sure all fan slots have fans or blanks installed. For requirements, see the server documentation. - Verify the fan airflow path is not blocked by cables or other material.
Chassis Power Consumption: High
Chassis Power Consumption: High
Chassis Power Consumption: High
Chassis power consumption is high. Recommended actions: - Be sure the power supplies are properly seated and operational. - Be sure the system has enough power, particularly if you recently added hardware, such as hard drives. Remove the newly added component and if the problem is no longer present, then additional power supplies are required.
Rack Power Consumption: High
Rack Power Consumption: High
Rack Power Consumption: High
Rack power consumption is high. Recommended actions: - Be sure the power supplies are properly seated and operational. - Be sure the system has enough power, particularly if you recently added hardware, such as hard drives. Remove the newly added component and if the problem is no longer present, then additional power supplies are required.
Power Supply Average Output: High
Power Supply Average Output: High
Power Supply Average Output: High
Power supply average output is high. Recommended actions: - Be sure all the other power supplies are properly seated and operational. - Be sure the system has enough power, particularly if you recently added hardware, such as hard drives. Remove the newly added component and if the problem is no longer present, then additional power supplies are required.
32
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
Name
Description
Symptom
Recommendation
Temperature Sensor Reading Exceeded Non-Critical Threshold
Temperature Sensor Reading Exceeded Non-Critical Threshold
Temperature Sensor Reading Exceeded Non-Critical Threshold
Temperature exceeded the non-critical threshold. Recommended actions: - Check the airflow path for blockage by cables and other material. - Replace any required non-functioning fans and restart the server. For specifications on fan requirements, see the server documentation. - Be sure all fan slots have fans or blanks installed. For requirements, see the server documentation.
Temperature Sensor Reading Exceeded Non-Critical Threshold
Temperature Sensor Reading Exceeded Non-Critical Threshold
Temperature Sensor Reading Exceeded Non-Critical Threshold
Temperature exceeded the non-critical threshold. Recommended actions: - Check the airflow path for blockage by cables and other material. - Replace any required non-functioning fans and restart the server. For specifications on fan requirements, see the server documentation. - Be sure all fan slots have fans or blanks installed. For requirements, see the server documentation.
Temperature Sensor Reading Exceeded Critical Threshold
Temperature Sensor Reading Exceeded Critical Threshold
Temperature Sensor Reading Exceeded Critical Threshold
Temperature exceeded the critical threshold. Recommended actions: - Check the airflow path for blockage by cables and other material. - Replace any required non-functioning fans and restart the server. For specifications on fan requirements, see the server documentation. - Be sure all fan slots have fans or blanks installed. For requirements, see the server documentation.
Temperature Sensor Reading Exceeded Critical Threshold
Temperature Sensor Reading Exceeded Critical Threshold
Temperature Sensor Reading Exceeded Critical Threshold
Temperature exceeded the critical threshold. Recommended actions: - Check the airflow path for blockage by cables and other material. - Replace any required non-functioning fans and restart the server. For specifications on fan requirements, see the server documentation. - Be sure all fan slots have fans or blanks installed. For requirements, see the server documentation.
33
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
12. Appendix III: Capacity Definitions The Management Pack uses the following capacity definitions, which help determine the value of Analysis Badges (refer to section 7, Analysis Badges) within vRealize Operations. For more information on using Analysis Badges, refer to the VMware vRealize Operations online help. Table 4: Capacity Definitions CONTAINER
USE IN WORKLOAD
HP Chassis vRealize Calculated Power Capacity
yes
HP Rack vRealize Calculated Power Capacity
yes
HP Power Supply vRealize Calculated Power Output Capacity
34
yes
Blue Medora VMware vRealize Operations Management Pack for HP Servers User Guide
You can find the most up-to-date technical documentation on the Blue Medora support site at: http://support.bluemedora.com. The Blue Medora website also provides the latest product updates. If you have comments about this documentation, submit your feedback to:
[email protected].
Copyright © 2016 Blue Medora Inc. All rights reserved. U.S. and international copyright and intellectual property laws protect this product. Blue Medora is a registered trademark or trademark of Blue Medora in the United States and/or other jurisdictions. The HP name (including HP ProLiant Servers) and logo are trademarks or registered trademarks of Hewlett Packard Enterprise (HPE) in the United States and/or other jurisdictions. All other marks and names mentioned herein may be trademarks of their respective companies. Blue Medora 3225 N Evergreen Dr. NE Suite 103 Grand Rapids, MI 49525 www.bluemedora.com