Yu sdm 2006 ft

Information about Yu sdm 2006 ft

Published on November 21, 2007

Author: Justine

Source: authorstream.com

Content

Testing Efficiency of Parallel I/O Software:  Testing Efficiency of Parallel I/O Software Weikuan Yu, Jeffrey Vetter December 6, 2006 Testing Parallel IO at ORNL:  Testing Parallel IO at ORNL Earlier analysis of scientific codes running at ORNL/LCF Most users have limited I/O capability in their applications because they have not had access to portable, widespread PIO Seldom direct use of MPI-IO Little use of high-level IO middleware: PnetCDF or HDF5 Large variance in performance of parallel IO using different software stacks (as demonstrated in VH1 experiments) Ongoing work Collect application IO access pattern and Lustre server IO traces, with tau, craypat, mpiP, etc Testing other parallel IO components over Lustre Analysis, benchmarking and optimization of data intensive scientific codes Parallel IO Optimization at ORNL:  Parallel IO Optimization at ORNL Parallel IO over Lustre A new file system still relies on a generic ADIO implementation Generations of platforms at ORNL demands efficient parallel IO Performance with Jaguar Good read/write bandwidth for large shared single file Not scalable for small read/write and parallel IO management operations (metadata) Approaches for Optimizations Providing a specific, ADIO implementation well-tuned for Lustre Investigating parameters for adjusting striping pattern Exploited Lustre file joining Regular files can be joined in place Split writing and hierarchical striping Developed a prototype on an 80-node Linux cluster Paper submitted to CCGrid 2006, available if interested Some Characteristics of Lustre IO Performance:  Some Characteristics of Lustre IO Performance Performance can be significantly affected by stripe width Need to introduce flexibilities in striping pattern Exploit file joining for growing stripe width with increasing file size Explore Lustre File Joining:  Explore Lustre File Joining Split writing: Create/write a shared file as multiple small files, aka subfiles Temporary structure to hold file attributes Subfiles joined at the closing time open read/write close File Attributes Subfiles Joined file Diagram of Split Writing Hierarchical Striping:  0 Hierarchical Striping Hierarchical striping Create another level of striping pattern for subfiles Allow maximum coverage of Lustre storage targets Mitigate the impact of striping overhead 1 2 3 S-2 S-1 S+1 S 2S-2 S+2 S+3 2S-1 nS+1 nS nS-2 nS+2 nS+3 nS-1 Diagram of Hierarchical Striping (HS) (HS width: N+1; HS size: S*w) subfile 0 subfile 1 subfile n (Stripe width: 2; Stripe size: w) ost0 ost1 ost2 ost3 ost2n ost2n+1 Evaluation:  Evaluation Table 1: Scalability of Management Operations Scalability of file open and file resize improved dramatically Table 2: Performance of Collective Read/Write Write/Read performance improved dramatically for new files Read/Write of an existing join file is not well performing due to a non-optimized IO path for a join file in Lustre Results on Scientific Benchmarks – MPI-Tile-IO and BT/IO:  Results on Scientific Benchmarks – MPI-Tile-IO and BT/IO IO Pattern as represented by BT-IO can be improved if the number of iterations is small. It may help if an arbitrary number of files can be joined. Write Performance in MPI-Tile-IO can be improved dramatically Read performance in MPI-Tile-IO cannot be improved by file joining because reading an existing join file does not perform well Conclusions:  Conclusions Parallel IO over Lustre Split writing can improve metadata management operations Stripe overhead can be mitigated with careful augmentations of stripe width Lustre file joining Race conditions when joining files multiple processes Low read/write performance on an existing file Not possible for arbitrary hierarchical striping because limited number of files can be joined Need improvement before its production usage parallel IO Next Steps Continue optimization of parallel IO at ORNL, Adapting the earlier techniques to liblustre on XT3/XT4 Develop/Exploit other features, group locks and dynamic stripe width Adapting parallel I/O and parallel FS to wide area collaborative science with other IO protocols such as pNFS and Logistical Networking

Related presentations


Other presentations created by Justine

Flex PLM 2004
27. 11. 2007
0 views

Flex PLM 2004

osa slides 041215
28. 11. 2007
0 views

osa slides 041215

ps ria
01. 12. 2007
0 views

ps ria

344
10. 12. 2007
0 views

344

Understanding Islam
05. 11. 2007
0 views

Understanding Islam

halloween 4
05. 11. 2007
0 views

halloween 4

catering
05. 11. 2007
0 views

catering

schwarz
15. 11. 2007
0 views

schwarz

DKGRAN02
16. 11. 2007
0 views

DKGRAN02

linux firewall
26. 11. 2007
0 views

linux firewall

NeSS02 neutrinos
06. 12. 2007
0 views

NeSS02 neutrinos

506 EMF Concerns
18. 12. 2007
0 views

506 EMF Concerns

thebusinesscase
28. 11. 2007
0 views

thebusinesscase

Security Openess
25. 12. 2007
0 views

Security Openess

sca02
29. 12. 2007
0 views

sca02

MPH 2005
29. 12. 2007
0 views

MPH 2005

bsc201 week09
01. 01. 2008
0 views

bsc201 week09

robot cwe
02. 01. 2008
0 views

robot cwe

Cotton
07. 01. 2008
0 views

Cotton

Energy in the Cell
07. 01. 2008
0 views

Energy in the Cell

SleepDisorders
29. 11. 2007
0 views

SleepDisorders

HydrothermalVentsand OilPlumes
09. 10. 2007
0 views

HydrothermalVentsand OilPlumes

CCH20AgFor
31. 12. 2007
0 views

CCH20AgFor

CS202Last2006Grokste rDMCA
26. 02. 2008
0 views

CS202Last2006Grokste rDMCA

General Marshall revised
28. 02. 2008
0 views

General Marshall revised

apr11 ticoll
11. 03. 2008
0 views

apr11 ticoll

wc game
12. 03. 2008
0 views

wc game

AAIupdateJan06
14. 03. 2008
0 views

AAIupdateJan06

Yr7GermanAssign
18. 03. 2008
0 views

Yr7GermanAssign

Art Millet
01. 10. 2007
0 views

Art Millet

TEFinal3
30. 03. 2008
0 views

TEFinal3

U S Economic History
13. 04. 2008
0 views

U S Economic History

apco presentation 2
02. 11. 2007
0 views

apco presentation 2

78
05. 12. 2007
0 views

78

holiday safety tips
29. 11. 2007
0 views

holiday safety tips

NOFA 5 MAR 05
27. 12. 2007
0 views

NOFA 5 MAR 05

module2
07. 01. 2008
0 views

module2

cryptographie camp
05. 01. 2008
0 views

cryptographie camp

Bell 1
08. 11. 2007
0 views

Bell 1

ASegal
28. 12. 2007
0 views

ASegal

RTH Beijing
27. 09. 2007
0 views

RTH Beijing

Sturgeon
07. 01. 2008
0 views

Sturgeon

Recommenders HCC01
23. 12. 2007
0 views

Recommenders HCC01