Computer Science – Distributed – Parallel – and Cluster Computing
Scientific paper
2003-05-30
Computer Science
Distributed, Parallel, and Cluster Computing
Talk from the 2003 Computing in High Energy and Nuclear Physics (CHEP03), La Jolla, Ca, USA, March 2003, 5 pages
Scientific paper
The BaBar online data acquisition (DAQ) system includes approximately fifty Unix systems that collectively implement the level-three trigger. These systems all run the same code. Each of these systems has its own state, and this state is expected to change in response to changes in the overall DAQ system. A specialized subsystem has been developed to initiate processing on this collection of systems, and to monitor them both for error conditions and to ensure that they all follow the same state trajectory within a specifiable period of time. This subsystem receives start commands from the main DAQ run control system, and reports major coherent state changes, as well as error conditions, back to the run control system. This state monitoring subsystem has the novel feature that it does not know anything about the state machines that it is monitoring, and hence does not introduce any fundamentally new state machine into the overall system. This feature makes it trivially applicable to other multi-node subsystems. Indeed it has already found a second application beyond the level-three trigger, within the BaBar experiment.
Bartoldus Rainer
Dubois-Felsmann Gregory P.
Hamilton James A.
No associations
LandOfFree
A Generic Multi-node State Monitoring Subsystem does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with A Generic Multi-node State Monitoring Subsystem, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Generic Multi-node State Monitoring Subsystem will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-395400