Computer Science – Performance
Scientific paper
2003-06-14
Computer Science
Performance
Talk from the 2003 Computing in High Energy and Nuclear Physics (CHEP03), La Jolla, Ca, USA, March 2003, 6 pages, LaTeX, 7 eps
Scientific paper
The Grid Datafarm architecture is designed for global petascale data-intensive computing. It provides a global parallel filesystem with online petascale storage, scalable I/O bandwidth, and scalable parallel processing, and it can exploit local I/O in a grid of clusters with tens of thousands of nodes. One of features is that it manages file replicas in filesystem metadata for fault tolerance and load balancing. This paper discusses and evaluates several techniques to support long-distance fast file replication. The Grid Datafarm manages a ranked group of files as a Gfarm file, each file, called a Gfarm file fragment, being stored on a filesystem node, or replicated on several filesystem nodes. Each Gfarm file fragment is replicated independently and in parallel using rate-controlled HighSpeed TCP with network striping. On a US-Japan testbed with 10,000 km distance, we achieve 419 Mbps using 2 nodes on each side, and 741 Mbps using 4 nodes out of 893 Mbps with two transpacific networks.
Matsuoka Satoshi
Morita Youhei
Sekiguchi Satoshi
Soda Noriyuki
Tatebe Osamu
No associations
LandOfFree
Worldwide Fast File Replication on Grid Datafarm does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Worldwide Fast File Replication on Grid Datafarm, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Worldwide Fast File Replication on Grid Datafarm will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-488775