|
| 1 | +HPCG-Benchmark |
| 2 | +version=3.0 |
| 3 | +Release date=November 11, 2015 |
| 4 | +Machine Summary= |
| 5 | +Machine Summary::Distributed Processes=2 |
| 6 | +Machine Summary::Threads per processes=1 |
| 7 | +Global Problem Dimensions= |
| 8 | +Global Problem Dimensions::Global nx=64 |
| 9 | +Global Problem Dimensions::Global ny=32 |
| 10 | +Global Problem Dimensions::Global nz=32 |
| 11 | +Processor Dimensions= |
| 12 | +Processor Dimensions::npx=2 |
| 13 | +Processor Dimensions::npy=1 |
| 14 | +Processor Dimensions::npz=1 |
| 15 | +Local Domain Dimensions= |
| 16 | +Local Domain Dimensions::nx=32 |
| 17 | +Local Domain Dimensions::ny=32 |
| 18 | +Local Domain Dimensions::Lower ipz=0 |
| 19 | +Local Domain Dimensions::Upper ipz=0 |
| 20 | +Local Domain Dimensions::nz=32 |
| 21 | +########## Problem Summary ##########= |
| 22 | +Setup Information= |
| 23 | +Setup Information::Setup Time=0.390614 |
| 24 | +Linear System Information= |
| 25 | +Linear System Information::Number of Equations=65536 |
| 26 | +Linear System Information::Number of Nonzero Terms=1678840 |
| 27 | +Multigrid Information= |
| 28 | +Multigrid Information::Number of coarse grid levels=3 |
| 29 | +Multigrid Information::Coarse Grids= |
| 30 | +Multigrid Information::Coarse Grids::Grid Level=1 |
| 31 | +Multigrid Information::Coarse Grids::Number of Equations=8192 |
| 32 | +Multigrid Information::Coarse Grids::Number of Nonzero Terms=198904 |
| 33 | +Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 |
| 34 | +Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 |
| 35 | +Multigrid Information::Coarse Grids::Grid Level=2 |
| 36 | +Multigrid Information::Coarse Grids::Number of Equations=1024 |
| 37 | +Multigrid Information::Coarse Grids::Number of Nonzero Terms=22264 |
| 38 | +Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 |
| 39 | +Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 |
| 40 | +Multigrid Information::Coarse Grids::Grid Level=3 |
| 41 | +Multigrid Information::Coarse Grids::Number of Equations=128 |
| 42 | +Multigrid Information::Coarse Grids::Number of Nonzero Terms=2200 |
| 43 | +Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 |
| 44 | +Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 |
| 45 | +########## Memory Use Summary ##########= |
| 46 | +Memory Use Information= |
| 47 | +Memory Use Information::Total memory used for data (Gbytes)=0.0468871 |
| 48 | +Memory Use Information::Memory used for OptimizeProblem data (Gbytes)=0 |
| 49 | +Memory Use Information::Bytes per equation (Total memory / Number of Equations)=715.44 |
| 50 | +Memory Use Information::Memory used for linear system and CG (Gbytes)=0.041255 |
| 51 | +Memory Use Information::Coarse Grids= |
| 52 | +Memory Use Information::Coarse Grids::Grid Level=1 |
| 53 | +Memory Use Information::Coarse Grids::Memory used=0.00493516 |
| 54 | +Memory Use Information::Coarse Grids::Grid Level=2 |
| 55 | +Memory Use Information::Coarse Grids::Memory used=0.000618744 |
| 56 | +Memory Use Information::Coarse Grids::Grid Level=3 |
| 57 | +Memory Use Information::Coarse Grids::Memory used=7.8136e-05 |
| 58 | +########## V&V Testing Summary ##########= |
| 59 | +Spectral Convergence Tests= |
| 60 | +Spectral Convergence Tests::Result=PASSED |
| 61 | +Spectral Convergence Tests::Unpreconditioned= |
| 62 | +Spectral Convergence Tests::Unpreconditioned::Maximum iteration count=11 |
| 63 | +Spectral Convergence Tests::Unpreconditioned::Expected iteration count=12 |
| 64 | +Spectral Convergence Tests::Preconditioned= |
| 65 | +Spectral Convergence Tests::Preconditioned::Maximum iteration count=2 |
| 66 | +Spectral Convergence Tests::Preconditioned::Expected iteration count=2 |
| 67 | +Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon= |
| 68 | +Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Result=PASSED |
| 69 | +Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for SpMV=2.47586e-06 |
| 70 | +Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for MG=6.05509e-07 |
| 71 | +########## Iterations Summary ##########= |
| 72 | +Iteration Count Information= |
| 73 | +Iteration Count Information::Result=PASSED |
| 74 | +Iteration Count Information::Reference CG iterations per set=50 |
| 75 | +Iteration Count Information::Optimized CG iterations per set=50 |
| 76 | +Iteration Count Information::Total number of reference iterations=50 |
| 77 | +Iteration Count Information::Total number of optimized iterations=50 |
| 78 | +########## Reproducibility Summary ##########= |
| 79 | +Reproducibility Information= |
| 80 | +Reproducibility Information::Result=PASSED |
| 81 | +Reproducibility Information::Scaled residual mean=1.26595e-16 |
| 82 | +Reproducibility Information::Scaled residual variance=0 |
| 83 | +########## Performance Summary (times in sec) ##########= |
| 84 | +Benchmark Time Summary= |
| 85 | +Benchmark Time Summary::Optimization phase=2.38419e-07 |
| 86 | +Benchmark Time Summary::DDOT=0.0234666 |
| 87 | +Benchmark Time Summary::WAXPBY=0.0181344 |
| 88 | +Benchmark Time Summary::SpMV=0.101883 |
| 89 | +Benchmark Time Summary::MG=0.605319 |
| 90 | +Benchmark Time Summary::Total=0.748903 |
| 91 | +Floating Point Operations Summary= |
| 92 | +Floating Point Operations Summary::Raw DDOT=1.97919e+07 |
| 93 | +Floating Point Operations Summary::Raw WAXPBY=1.97919e+07 |
| 94 | +Floating Point Operations Summary::Raw SpMV=1.71242e+08 |
| 95 | +Floating Point Operations Summary::Raw MG=9.50444e+08 |
| 96 | +Floating Point Operations Summary::Total=1.16127e+09 |
| 97 | +Floating Point Operations Summary::Total with convergence overhead=1.16127e+09 |
| 98 | +GB/s Summary= |
| 99 | +GB/s Summary::Raw Read B/W=9.56503 |
| 100 | +GB/s Summary::Raw Write B/W=2.21106 |
| 101 | +GB/s Summary::Raw Total B/W=11.7761 |
| 102 | +GB/s Summary::Total with convergence and optimization phase overhead=11.1923 |
| 103 | +GFLOP/s Summary= |
| 104 | +GFLOP/s Summary::Raw DDOT=0.843406 |
| 105 | +GFLOP/s Summary::Raw WAXPBY=1.0914 |
| 106 | +GFLOP/s Summary::Raw SpMV=1.68077 |
| 107 | +GFLOP/s Summary::Raw MG=1.57015 |
| 108 | +GFLOP/s Summary::Raw Total=1.55063 |
| 109 | +GFLOP/s Summary::Total with convergence overhead=1.55063 |
| 110 | +GFLOP/s Summary::Total with convergence and optimization phase overhead=1.47376 |
| 111 | +User Optimization Overheads= |
| 112 | +User Optimization Overheads::Optimization phase time (sec)=2.38419e-07 |
| 113 | +User Optimization Overheads::Optimization phase time vs reference SpMV+MG time=1.85134e-05 |
| 114 | +DDOT Timing Variations= |
| 115 | +DDOT Timing Variations::Min DDOT MPI_Allreduce time=0.0137126 |
| 116 | +DDOT Timing Variations::Max DDOT MPI_Allreduce time=0.0247626 |
| 117 | +DDOT Timing Variations::Avg DDOT MPI_Allreduce time=0.0192376 |
| 118 | +Final Summary= |
| 119 | +Final Summary::HPCG result is VALID with a GFLOP/s rating of=1.47376 |
| 120 | +Final Summary::HPCG 2.4 rating for historical reasons is=1.55063 |
| 121 | +Final Summary::Reference version of ComputeDotProduct used=Performance results are most likely suboptimal |
| 122 | +Final Summary::Reference version of ComputeSPMV used=Performance results are most likely suboptimal |
| 123 | +Final Summary::Reference version of ComputeMG used=Performance results are most likely suboptimal |
| 124 | +Final Summary::Reference version of ComputeWAXPBY used=Performance results are most likely suboptimal |
| 125 | +Final Summary::Results are valid but execution time (sec) is=0.748903 |
| 126 | +Final Summary::You have selected the QuickPath option=Results are official for legacy installed systems with confirmation from the HPCG Benchmark leaders. |
| 127 | +Final Summary::After confirmation please upload results from the YAML file contents to=http://hpcg-benchmark.org |
0 commit comments