Use
Case: Executive, Executive Package: Exec_CheckSystem
When the ALMA Observing System is active it will continously check
if all required resources are available. If malfunctions are
detected the Operator is alerted. S/he can then try to solve
problems e.g. by shutting down or re-initializing the relevant
systems.
Goal: Check the system status/alert operator of
problems.
Contact Author: P. Grosbol
Role(s)/Actor(s):
Primary:
Secondary:
Priority:
Critical
Performance:
On the order of seconds
Frequency of Use:
A few times per week
Preconditions:
- The ALMA software system is running.
- Main operator GUI is available.
Basic
Course:
- The operator notice that a component/subsystem is in an error
state as indicated by the operator GUI.
- Looking on the error tree provided is determines which subsystems
are effected.
- The relevant components/subsystems are shutdown which may include
suspension of observations.
- Other maintenance actions are initiated (e.g. change of hardware)
- The relevant subsystems are started again
- The GUI indicates that all systems are operational.
- Normal operations is resumed including start of observations
- Alternative Course:
- Standard re-initiation of components/subsystems does not solve
the failure.
- Full system is shutdown
- Detailed error logs are analyzed
- Appropriate patched are provided and system restarted.
Postconditions:
- Failing subsystem/component is working
- Executive continues to monitor system performance
Issues
to be Determined or Resolved:
- none at this time
Last modified: 31oct03