Tools and Techniques for Managing Clusters for SciDAC Lattice QCD at Fermilab

Computer Science – Distributed – Parallel – and Cluster Computing

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Talk from the 2003 Computing in High Energy and Nuclear Physics (CHEP03), La Jolla, Ca, USA, March 2003, 4 pages, PDF. PSN TUI

Scientific paper

Fermilab operates several clusters for lattice gauge computing. Minimal manpower is available to manage these clusters. We have written a number of tools and developed techniques to cope with this task. We describe our tools which use the IPMI facilities of our systems for hardware management tasks such as remote power control, remote system resets, and health monitoring. We discuss our techniques involving network booting for installation and upgrades of the operating system on these computers, and for reloading BIOS and other firmware. Finally, we discuss our tools for parallel command processing and their use in monitoring and administrating the PBS batch queue system used on our clusters.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Tools and Techniques for Managing Clusters for SciDAC Lattice QCD at Fermilab does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Tools and Techniques for Managing Clusters for SciDAC Lattice QCD at Fermilab, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Tools and Techniques for Managing Clusters for SciDAC Lattice QCD at Fermilab will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-10314

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.