+ - Framework for infrastructure services to raise and persist alarm and event data.
+
+ - Set, clear and query customer alarms
+ - Generate customer logs for significant events
+
+ - Maintains an Active Alarm List
+ - Provides REST API to query alarms and events, also available through SNMP traps
+ - Support for alarm suppression
+ - Operator alarms
+
+ - On platform nodes and resources
+ - On hosted virtual resources
+
+ - Operator logs - Event List
+
+ - Logging of sets/clears of alarms
+ - Related to platform nodes and resources
+ - Related to hosted virtual resources
+
+2. Configuration Management
+
+ - Manages Installation and Commissioning
+
+ - Auto-discover of new nodes
+ - Full Infrastructure management
+ - Manage installation parameters (i.e. console, root disks)
+
+ - Nodal Configuration
+
+ - Node role, role profiles
+ - Core, memory (including huge page) assignments
+ - Network Interfaces and storage assignments
+
+ - Hardware Discovery
+
+ - CPU/cores, SMT, processors, memory, huge pages
+ - Storage, ports
+ - GPUs, storage, Crypto/compression H/W
+
+3. Software Management
+
+ - Manages Installation and Commissioning
+
+ - Auto-discover of new nodes
+ - Full Infrastructure management
+ - Manage installation parameters (i.e. console, root disks)
+
+ - Nodal Configuration
+
+ - Node role, role profiles
+ - Core, memory (including huge page) assignments
+ - Network Interfaces and storage assignments
+
+ - Hardware Discovery
+
+ - CPU/cores, SMT, processors, memory, huge pages
+ - Storage, ports
+ - GPUs, storage, Crypto/compression H/W
+
+4. Host Management
+
+ - Full life-cycle and availability management of the physical hosts
+ - Detects and automatically handles host failures and initiates recovery
+ - Monitoring and fault reporting for:
+
+ - Cluster connectivity
+ - Critical process failures
+ - Resource utilization thresholds, interface states
+ - H/W fault / sensors, host watchdog
+ - Activity progress reporting
+
+ - Interfaces with board management (BMC)