-
Notifications
You must be signed in to change notification settings - Fork 754
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[UR][PERF] add L0, UR, and SYCL Sin Kernel Graph benchmark #17087
Conversation
pbalcer
commented
Feb 20, 2025
1681da8
to
dd85b9a
Compare
dd85b9a
to
636fc2d
Compare
Compute Benchmarks level_zero run (with params: --filter "SinKernelGraph"): |
Benchmarks level_zero run (--filter "SinKernelGraph"): Summary(Emphasized values are the best results) Improved 2 (threshold 2.00%)
Performance change in benchmark groupsCompute BenchmarksRelative perf in group SinKernelGraph 5 (6)
Relative perf in group SinKernelGraph 100 (6)
Relative perf in group SubmitKernel (7)
Relative perf in group Other (17)
Relative perf in group SinKernelGraph (2)
Relative perf in group SubmitGraph (3)
Relative perf in group ExecGraph (3)
Relative perf in group SubmitKernel CPU count (3)
Velocity BenchRelative perf in group Other (5)
SYCL-BenchRelative perf in group Other (53)
DetailsBenchmark details - environment, command...graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:5Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=5 --withGraphs=0 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_sycl SinKernelGraph graphs:1, numKernels:5Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=5 --withGraphs=1 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:100Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=100 --withGraphs=0 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_sycl SinKernelGraph graphs:1, numKernels:100Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=100 --withGraphs=1 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_l0 SinKernelGraph graphs:0, numKernels:5Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_l0 --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=5 --withGraphs=0 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_l0 SinKernelGraph graphs:1, numKernels:5Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_l0 --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=5 --withGraphs=1 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_l0 SinKernelGraph graphs:0, numKernels:100Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_l0 --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=100 --withGraphs=0 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_l0 SinKernelGraph graphs:1, numKernels:100Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_l0 --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=100 --withGraphs=1 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_ur SinKernelGraph graphs:0, numKernels:5Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_ur --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=5 --withGraphs=0 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_ur SinKernelGraph graphs:1, numKernels:5Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_ur --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=5 --withGraphs=1 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_ur SinKernelGraph graphs:0, numKernels:100Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_ur --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=100 --withGraphs=0 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_ur SinKernelGraph graphs:1, numKernels:100Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_ur --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=100 --withGraphs=1 --withCopyOffload=1 --immediateAppendCmdList=0 |
Compute Benchmarks level_zero_v2 run (with params: --filter "SinKernelGraph"): |
Benchmarks level_zero_v2 run (--filter "SinKernelGraph"): Summary(Emphasized values are the best results) Improved 2 (threshold 2.00%)
Performance change in benchmark groupsCompute BenchmarksRelative perf in group SinKernelGraph 5 (5)
Relative perf in group SinKernelGraph 100 (5)
Relative perf in group SubmitKernel (7)
Relative perf in group Other (17)
Relative perf in group SinKernelGraph (2)
Relative perf in group SubmitGraph (3)
Relative perf in group ExecGraph (3)
Relative perf in group SubmitKernel CPU count (3)
Velocity BenchRelative perf in group Other (5)
SYCL-BenchRelative perf in group Other (53)
DetailsBenchmark details - environment, command...graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:5Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=5 --withGraphs=0 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_sycl SinKernelGraph graphs:1, numKernels:5Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=5 --withGraphs=1 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:100Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=100 --withGraphs=0 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_sycl SinKernelGraph graphs:1, numKernels:100Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=100 --withGraphs=1 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_l0 SinKernelGraph graphs:0, numKernels:5Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_l0 --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=5 --withGraphs=0 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_l0 SinKernelGraph graphs:1, numKernels:5Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_l0 --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=5 --withGraphs=1 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_l0 SinKernelGraph graphs:0, numKernels:100Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_l0 --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=100 --withGraphs=0 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_l0 SinKernelGraph graphs:1, numKernels:100Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_l0 --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=100 --withGraphs=1 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_ur SinKernelGraph graphs:0, numKernels:5Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_ur --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=5 --withGraphs=0 --withCopyOffload=1 --immediateAppendCmdList=0 graph_api_benchmark_ur SinKernelGraph graphs:0, numKernels:100Command:/home/test-user/llvm_bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_ur --test=SinKernelGraph --csv --noHeaders --iterations=10000 --numKernels=100 --withGraphs=0 --withCopyOffload=1 --immediateAppendCmdList=0 |
@intel/unified-runtime-reviewers please review. The failures are unrelated since this is only adding new scripts. The new tests work as evident by the benchmark runs above. |
@intel/llvm-gatekeepers please merge The failures in |