Constant stack size #2688

timcassell · 2025-01-16T05:30:06Z

Refactored engine stages to make the stack size constant for each benchmark invocation.
Applied AggressiveOptimization to engine methods to eliminate tiered-JIT as a potential variable in the engine when running iterations. (See also Apply AggressiveOptimization to clocks AndreyAkinshin/perfolizer#19)

I tested this on Ryzen 7 9800X3D, and got the same results on both master and this PR. I could not repro the measurement spikes from the issue (the graph was smooth).

I tested on Apple M3 and got these results.

Master:

PR:

Observations to note

The after results are much less spikey for longer.
- I can't say why the last bit of the graph is also spikey, it could have to do with the fact that I ran on a MacBook Air and it could have throttled, but I can't prove it (the benchmark took a very long time to run, so I only ran it once). I also did not include the changes in Apply AggressiveOptimization to clocks AndreyAkinshin/perfolizer#19 in this test, which could also be a factor.
The after results are larger, at the upper end of the spikes from the results with master.
- I suspect that the benchmarks are sensitive to the alignment of the stack when it is invoked. I'm not sure if BDN should take this into account by default, and if so, what it should do about it (we already have [MemoryRandomization] that randomizes the stack for each iteration).

src/BenchmarkDotNet/Engines/Engine.cs

timcassell · 2025-02-13T15:08:47Z

Another curiosity I found while working on #2336 is, unrolling the calls affects performance strangely on ARM architecture (Apple M3) (while it does what you'd expect on x86-64).

Benchmark of just _field++ (M3):

Unroll x16

overhead:  0.911, workload:  0.635, diff: - 0.275

NoUnroll

overhead:  0.900, workload:  1.240, diff:  0.339

Apply AggressiveOptimization to engine methods.

Co-authored-by: Corniel Nobel <[email protected]>

…tion and simpler engine code.

timcassell added the Area:Engine label Jan 16, 2025

timcassell requested a review from AndreyAkinshin January 16, 2025 05:30

timcassell force-pushed the engine-stages branch from f62a7a4 to 2c135d2 Compare January 16, 2025 06:14

Corniel reviewed Feb 3, 2025

View reviewed changes

src/BenchmarkDotNet/Engines/Engine.cs Outdated Show resolved Hide resolved

timcassell and others added 3 commits February 15, 2025 15:09

Inline engine stages.

3cb2d96

Apply AggressiveOptimization to engine methods.

Update src/BenchmarkDotNet/Engines/Engine.cs

284644f

Co-authored-by: Corniel Nobel <[email protected]>

Refactor to use IEngineStageEvaluator for constant instruction loca…

091bb56

…tion and simpler engine code.

timcassell force-pushed the engine-stages branch from 4907ffd to 091bb56 Compare February 16, 2025 00:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Constant stack size #2688

Constant stack size #2688

timcassell commented Jan 16, 2025 •

edited

Loading

timcassell commented Feb 13, 2025 •

edited

Loading

Constant stack size #2688

Are you sure you want to change the base?

Constant stack size #2688

Conversation

timcassell commented Jan 16, 2025 • edited Loading

timcassell commented Feb 13, 2025 • edited Loading

timcassell commented Jan 16, 2025 •

edited

Loading

timcassell commented Feb 13, 2025 •

edited

Loading