Some performance improvements #6

JCBrouwer · 2023-02-18T14:23:32Z

Some progress on #4

Refactor optimal_textures function into OptimalTextures module (this makes it easier to use PyTorch's JIT and compile APIs)
Enable batch support
Update defaults and add some performance related arguments to CLI:

  --no_tf32             Disable tf32 format (probably slower).
  --cudnn_benchmark     Enable CUDNN benchmarking (probably slower unless doing a high number of iterations).
  --compile             Use PyTorch 2.0 compile function to optimize the model.
  --script              Use PyTorch JIT script function to optimize the model.
  --device DEVICE       Which device to run on.
  --memory_format {contiguous,channels_last}
                        Which memory format to use for optimization.

On my 1080 Ti this version is almost twice as fast for a batch size of 1. Utilization is definitely still low, but upping the batch size helps a lot and I think this is a good springboard for realizing more gains.

… minor clean-ups

JCBrouwer added 4 commits February 18, 2023 12:14

make cholesky histogram matching the default, script a few functions,…

addfdc7

… minor clean-ups

refactor function to torch.nn.Module, add batch support

450b931

ensure module is torch.jit.script'able

94ae342

minor changes to performance-related args

6f47947

JCBrouwer merged commit 693556f into main Feb 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some performance improvements #6

Some performance improvements #6

JCBrouwer commented Feb 18, 2023

Some performance improvements #6

Some performance improvements #6

Conversation

JCBrouwer commented Feb 18, 2023