Creating cpu flame graphs for Istio / Envoy

Flame graph shows how much time was spent in a particular function.

The width of a stack element (flame) indicates the relative time spent in the function.
Call stacks are plotted vertically.
Colors are arbitrary.
Function names are sorted left to right.

This document shows how to gather performance data from via the perf container.

Setup the perf container

Enable profilingMode in values.yaml. This will end up adding the perf container to the server and client pods, which both will be running on separate nodes.

Flame graphs and visualizations are created from data collected using linux perf_events by the perf and BCC tools, as well as Envoy's built-in profiler.

Obtaining flame graphs

Flame graphs can be produced via runner.py, and will be stored in flame/flameoutput.

A few sample command line arguments. {duration} will be replaced by whatever was passed for --duration to runner.py. {sidecar_pid} will be replaced by runner.py with the process id of the Envoy sidecar.

It is valid to omit {sidecar_pid} in --custom_profiling_command. This may be useful for machine-wide profiling or arbitrary processes.

runner/runner.py ... --custom_profiling_command="profile-bpfcc -df {duration} -p {sidecar_pid}" --custom_profiling_name="bcc-oncputime-sidecar"

runner/runner.py ... --custom_profiling_command="offcputime-bpfcc -df {duration} -p {sidecar_pid}" --custom_profiling_name="bcc-offcputime-sidecar"

runner/runner.py ... --custom_profiling_command="offwaketime-bpfcc -df {duration} -p {sidecar_pid}" --custom_profiling_name="bcc-offwaketime-sidecar"

runner/runner.py ... --custom_profiling_command="wakeuptime-bpfcc -f -p {sidecar_pid} {duration}" --custom_profiling_name="bcc-wakeuptime-sidecar"

runner/runner.py ... --custom_profiling_command="stackcount-bpfcc -p {sidecar_pid} *alloc* -fD {duration}" --custom_profiling_name="bcc-stackcount-alloc"

runner/runner.py ... --custom_profiling_command="perf record -F 99 -g -p {sidecar_pid} -- sleep {duration} && perf script | ~/FlameGraph/stackcollapse-perf.pl | c++filt -n"
--custom_profiling_name="perf-oncputime-sidecar"

runner/runner.py ... --custom_profiling_command="perf record -e page-faults -g -p {sidecar_pid} -- sleep {duration} && perf script | ~/FlameGraph/stackcollapse-perf.pl | c++filt -n" --custom_profiling_name="perf-pagefaults-sidecar"

Leveraging Istio's sidecar built-in profiling

Istio's sidecar proxy (Envoy) is usually build with tcmalloc, and as such traditional memory profiling methods may give unsatisfactory results. Fortunately, the proxy provides a built-in means to collect profiling data, and the benchmark tool is able to leverage that. Doing so, however, requires a writeable file system as well as priviliges to install new packages for the sidecar containers. The following command (re)configures istio to satisfy these requirements:

istioctl manifest apply --set "values.global.proxy.enableCoreDump=true" --set "values.global.proxy.privileged=true"

After doing so runner.py can be run with --envoy_profiler [heapprofiler|cpuprofiler]. This will start/stop the built-in profilers of the sidecars, obtain the collected profiles, and visualize them via pprof. The resulting output will end up in flame/flameoutput just like in the other flows:

runner/runner.py --envoy_profiler cpuprofiler|heapprofiler ...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Creating cpu flame graphs for Istio / Envoy

Setup the perf container

Obtaining flame graphs

Leveraging Istio's sidecar built-in profiling

Files

README.md

Latest commit

History

README.md

File metadata and controls

Creating cpu flame graphs for Istio / Envoy

Setup the perf container

Obtaining flame graphs

Leveraging Istio's sidecar built-in profiling