Back to the list of connectors

Oracle/Sun Solaris - Fault Manager - Memory and CPU

Description

This connector parses fmadm faulty command looking for faulty memory modules

Connector ID: SunFmadm

This connector is superseded by:

Target

Typical platform: Oracle/Sun

Operating system: Oracle Solaris

Prerequisites

Leverages: Sun Solaris system commands (fmadm)

Technology and protocols: System Commands

This connector requires advanced privileges on the managed host for the command below:

  • /usr/sbin/fmadm

This connector therefore needs to run as root or you need to configure a privilege-escalation mechanism like sudo on the managed host to allow the monitoring account to run the command listed above.

Sample of /etc/sudoers to allow the above command to be run as root by the hwsagent account:

hwsagent ALL=(root) NOPASSWD: /usr/sbin/fmadm

Examples

CLI

hws HOSTNAME -t solaris -f SunFmadm --ssh -u USER --sudo-command-list /usr/sbin/fmadm

hws-config.yaml

hosts:
- host:
    hostname: <HOSTNAME> # Change with actual host name
    type: solaris
  selectedConnectors: [ SunFmadm ] # Optional, to load only this connector
  ssh:
    username: <USERNAME> # Change with actual credentials
    password: <PASSWORD> # Encrypted using hws-encrypt
    useSudo: true
    useSudoCommands: [ "/usr/sbin/fmadm" ]

Connector Activation Criteria

The Oracle/Sun Solaris - Fault Manager - Memory and CPU connector will be automatically activated, and its status will be reported as OK if all the below criteria are met:

  • Operating System is Oracle Solaris
  • The command below succeeds on the monitored host
    • Command: /bin/uname -r
    • Output contains: 5\.1[0-9] (regex)
  • The command below succeeds on the monitored host
    • Command: /usr/sbin/fmadm faulty;/usr/bin/echo errorlevel $?
    • Output contains: ^errorlevel 0$ (regex)
  • The command below succeeds on the monitored host
    • Command: /usr/sbin/fmadm config | grep cpumem
    • Output contains: active (regex)

Metrics

Type Collected Metrics Specific Attributes (Labels)
Memory Module
  • hw.memory.limit
  • hw.status{hw.type="memory",state="ok|degraded|failed"}
  • hw.status{hw.type="memory",state="present"}
    Other Device
    • hw.status{hw.type="other_device",state="ok|degraded|failed"}
    • hw.status{hw.type="other_device",state="present"}
    • device_type
    No results.