Designing a high-availability cluster for the Subaru Telescope second generation observation control system

Astronomy and Astrophysics – Astronomy

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

Subaru Telescope is commissioning a second-generation Observation Control System (OCS), building upon a 10 hear history of using the first generation OCS. One of the primary lessons learned about maintaining a distributed OCS system is that the idea of individual computer nodes specialized for specific functions greatly complicates troubleshooting and failover, even with a dedicated "hot spare" for each specialized node. In contrast, the Generation 2 (Gen2) system was designed from the ground up around the principle of a High-Availability (HA) cluster, commonly used for high-traffic, mission-critical web sites. In such a cluster, nodes are not specialized, and any node can perform any function of the OCS. We describe the problems encountered in trying to troubleshoot and manage failure on the legacy OCS system and describe the architectural design of the HA cluster for the new system, including special characteristics designed for the high-altitude, remote environment of the summit of Mauna Kea, where there is a greatly increased probability of such failures. Although the focus is primarily on the hardware, we touch upon the software architecture written to take advantage of the features of the HA cluster design. Finally, we outline the advantages of the new system and show how the design greatly facilitates troubleshooting, robustness and ease of failure management. The results may be of interest to anyone designing a distributed system using COTS hardware and open-source software to withstand failure and improve manageability in a remote environment.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Designing a high-availability cluster for the Subaru Telescope second generation observation control system does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Designing a high-availability cluster for the Subaru Telescope second generation observation control system, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Designing a high-availability cluster for the Subaru Telescope second generation observation control system will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-1386622

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.