Skip to content

b4735

Compare
Choose a tag to compare
@github-actions github-actions released this 17 Feb 13:49
73e2ed3
CUDA: use async data loading for FlashAttention (#11894)

* CUDA: use async data loading for FlashAttention

---------

Co-authored-by: Diego Devesa <[email protected]>