Add EnzymeRules #103

sethaxen · 2023-05-22T14:04:44Z

Will fix #99

sethaxen · 2023-05-22T14:16:17Z

For some reason, I can't seem to get the extension to work. Package precompilation fails with the error:

ERROR: The following 1 direct dependency failed to precompile:

AbstractFFTs [621f4979-c628-5d54-868e-fcf4e3e8185c]

Failed to precompile AbstractFFTs [621f4979-c628-5d54-868e-fcf4e3e8185c] to "/home/runner/.julia/compiled/v1.9/AbstractFFTs/jl_mYHZQL".
ERROR: LoadError: ArgumentError: Package AbstractFFTs does not have LinearAlgebra in its dependencies:
- You may have a partially installed environment. Try `Pkg.instantiate()`
  to ensure all packages in the environment are installed.
- Or, if you have AbstractFFTs checked out for development and have
  added LinearAlgebra as a dependency but haven't updated your primary
  environment's manifest file, try `Pkg.resolve()`.
- Otherwise you may need to report an issue with AbstractFFTs

although LinearAlgebra is clearly listed as both a dep and a weak dep.

Weirder still, if I activate the project, it now says it's empty, whereas if I remove this extension, it shows the dependencies:

julia> using Pkg; Pkg.activate(".")
  Activating project at `~/projects/AbstractFFTs.jl`

julia> Pkg.status()
Project AbstractFFTs v1.3.1
Status `~/projects/AbstractFFTs.jl/Project.toml` (empty project)

shell> head ./Project.toml
name = "AbstractFFTs"
uuid = "621f4979-c628-5d54-868e-fcf4e3e8185c"
version = "1.3.1"

[deps]
ChainRulesCore = "d360d2e6-b24c-11e9-a2a3-2a2ae2dbcce4"
LinearAlgebra = "37e2e46d-f89d-539d-b4ee-838fcccc9c8e"

[weakdeps]
ChainRulesCore = "d360d2e6-b24c-11e9-a2a3-2a2ae2dbcce4"

@KristofferC I've never had this problem with my extensions before. Do you know what could cause this?

sethaxen · 2023-05-22T22:27:20Z

Nevermind, it seems extensions cannot have weak deps that are also deps. In this case, the dep needs to be loaded within the extension from the main package, see e.g. JuliaStats/LogExpFunctions.jl#63

codecov · 2023-05-22T22:34:40Z

Codecov Report

Patch coverage has no change and project coverage change: -8.48 ⚠️

Comparison is base (a25656d) 87.08% compared to head (859abf0) 78.60%.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #103      +/-   ##
==========================================
- Coverage   87.08%   78.60%   -8.48%     
==========================================
  Files           3        4       +1     
  Lines         209      229      +20     
==========================================
- Hits          182      180       -2     
- Misses         27       49      +22

Impacted Files	Coverage Δ
ext/AbstractFFTsEnzymeCoreExt.jl	`0.00% <0.00%> (ø)`

... and 1 file with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

sethaxen · 2023-06-30T10:34:42Z

If #67 is merged, we could add rules for *(::Plan, ::StridedArray), so long as the plan is Const (if it's non-Const, then we would need the rule to support it being an in-place plan, which we can't do).

GiggleLiu · 2023-07-14T02:32:07Z

ext/AbstractFFTsEnzymeCoreExt.jl

+    y::DuplicatedOrBatchDuplicated{<:StridedArray{T}},
+    p::Const{<:AbstractFFTs.Plan{T}},
+    x::DuplicatedOrBatchDuplicated{<:StridedArray{T}},
+) where {T}


I wish the type T can be restricted to a finite set, e.g. BLAS number types, otherwise, it may produce incorrect gradients for user defined extensions. Generally speaking, I feel "generic" AD is not a good practise.

The pushforward of a linear operator is always itself. And so far as I know, every definition of an FFT is a linear operator. So I can see no reasons why this rule should be problematic for forward-mode.

For example, I may want to extended FFT with tropical numbers, which is not a real number. It is linear, but does not have an inverse. Then your rule would give me incorrect gradients without throwing an error. I have seen too many incorrect gradients in previous AD frameworks such as Zygote when handling complex numbers.

I agree it is good to have a generic backward routine there, but please constraint the interfaces to concrete types when porting it to an AD engine. It should not be so difficult for users to extend the list of supported types in the future. Defining fft rules on BLAS types would be good enough to cover most using cases. For those non-BLAS types, honestly we can not make any assumption for them. Julia community needs an AD engine with provable correctness, I think it is also one of the goals of Enzyme.

I may want to extended FFT with tropical numbers

Is this really an FFT per se? I would consider a DFT generalized to some other ring to be a different transform.

I may want to extended FFT with tropical numbers

Is this really an FFT per se? I would consider a DFT generalized to some other ring to be a different transform.

Since Julia does not have a good trait system, I think it is in general impossible to restrict users to input what the functions are designed for. This is what I meant there lacks provable correctness.

It has been a big issue that none of the Julia libraries (except Enzyme) can provide reliable gradients. They claim too much on untested using cases, like complex numbers and tropical numbers. There has been a belief that "it is cool if the code works in cases that it is not expected to work". But no, untested rules are not reliable, they can break on any future change even it works now. Rules must be concrete and tested, they are easy to extend, but hard to debug.

By that argument, no AD rules should be defined here anyways, since downstream a user could define a custom Plan that doesn't do any kind of FFT at all. Then even with BLAS number types and strides arrays, any rule we write here would be wrong.

The counterargument is that if a user adds a method of a function whose properties are well-documented, other code should be able to assume and depend on those properties when calling the method for arbitrary inputs.

Taken to its logical conclusion, wouldn't your principle require that rules are never defined for abstract types, and further, that the type of every argument is concrete and known to the rule implementer?

wouldn't your principle require that rules are never defined for abstract types, and further, that the type of every argument is concrete and known to the rule implementer?

A big YES. I do not think many people need the backward rules for non-BLAS types. You may want to support e.g. double float that defined in DoubleFloat.jl. I would argue in these using cases, users can port the generic rule to the AD framework with little effort. The rule can be generic, but when porting it to the AD framework, it should be concrete.

We have to decide between support more data types and ensure the correctness. I really wish there can be a trait system that user can tell the compiler "this element type is a field", then users can use the rule with more confidence. Facts obvious to you, like "fft should work on field rather than other rings" may not be obvious to others.

The counterargument is that if a user adds a method of a function whose properties are well-documented, other code should be able to assume and depend on those properties when calling the method for arbitrary inputs.

To differentiate a long code, I will let the code fly and see where it falls. I will add new rules to the AD engine to keep it flying. It is not a problem for me if a rule does not exist. So when using a new element type, like complex number, symbolic type, finite field algebra or the Tropical number type as mentioned above, I will probably not check whether the property of each function is as documented.

Then even with BLAS number types and strides arrays, any rule we write here would be wrong.

A warning will be thrown when overloading an existing function. Also, pirating is not difficult to avoid.

In any case, if we have ChainRules I think we should have the corresponding EnzymeRules.

If users make the questionable choice of overriding fft to compute an unrelated function, then it is up to them to override the EnzymeRules/ChainRules as well.

sethaxen · 2023-08-26T19:42:36Z

I've paused work on this until EnzymeTestUtils (EnzymeAD/Enzyme.jl#782) is registered, which will make testing these rules reliably much more straightforward.

sethaxen · 2023-09-13T13:21:14Z

Coming back to this, I think Enzyme rules should only be defined here abstractly for cases where we know they will not be breaking downstream code that otherwise Enzyme would have handled fine. So I agree with the following restrictions:

Restrict eltypes to BLAS types
Restrict array types to StridedArrays
only have rules for fft, fft!, and other other variants. In general we cannot tell if a plan is in-place or not. If we can catch cases where it is Const (i.e. Enzyme has inferred it is not used to carry any derivative information) without breaking fall backs, then great, but otherwise we don't define the rule.

These rules are considerably stricter than the ChainRules and for good reason. ChainRules are by convention often defined to cover up indexing code and mutating code to help Zygote and Diffractor, but this comes at the cost of doing the wrong thing for lots of types, hence the ProjectTo mechanism. Enzyme, on the other hand, can in principle handle many more types well, so we want to avoid writing rules that do the wrong thing for any cases where with no rule Enzyme would have worked fine.

Rules for * with Plans can be define in packages like FFTW where the type informs the in-placeness of the plan.

danielwe · 2024-08-15T05:11:31Z

Any hope of having this PR revived? Enzyme has come a long way lately and FFT support would be another great step forward.

sethaxen · 2024-08-18T19:58:35Z

Any hope of having this PR revived? Enzyme has come a long way lately and FFT support would be another great step forward.

I'm afraid I don't have the bandwidth now to revive this. I still think #103 (comment) is the right way forward, and this PR is a good starting point for someone who wants to take it on.

sethaxen added 2 commits May 22, 2023 15:58

Add EnzymeCore extension

450bc49

Add Enzyme as test dependency

f4c7a7e

sethaxen added 2 commits May 22, 2023 22:05

List LinearAlgebra only as dependency

06bef3a

Add forward-mode rules

859abf0

devmotion mentioned this pull request Jul 4, 2023

Chain rules for FFT plans via AdjointPlans #67

Merged

GiggleLiu reviewed Jul 14, 2023

View reviewed changes

danielwe mentioned this pull request Aug 15, 2024

Missing rules for FFTW unsafe_execute EnzymeAD/Enzyme.jl#1717

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add EnzymeRules #103

Add EnzymeRules #103

sethaxen commented May 22, 2023

sethaxen commented May 22, 2023

sethaxen commented May 22, 2023

codecov bot commented May 22, 2023 •

edited

Loading

sethaxen commented Jun 30, 2023

GiggleLiu Jul 14, 2023 •

edited

Loading

sethaxen Jul 14, 2023

GiggleLiu Jul 14, 2023 •

edited

Loading

stevengj Jul 16, 2023

GiggleLiu Jul 16, 2023 •

edited

Loading

sethaxen Jul 16, 2023

GiggleLiu Jul 16, 2023 •

edited

Loading

stevengj Aug 23, 2023

sethaxen commented Aug 26, 2023 •

edited

Loading

sethaxen commented Sep 13, 2023

danielwe commented Aug 15, 2024

sethaxen commented Aug 18, 2024

Add EnzymeRules #103

Are you sure you want to change the base?

Add EnzymeRules #103

Conversation

sethaxen commented May 22, 2023

sethaxen commented May 22, 2023

sethaxen commented May 22, 2023

codecov bot commented May 22, 2023 • edited Loading

Codecov Report

sethaxen commented Jun 30, 2023

GiggleLiu Jul 14, 2023 • edited Loading

Choose a reason for hiding this comment

sethaxen Jul 14, 2023

Choose a reason for hiding this comment

GiggleLiu Jul 14, 2023 • edited Loading

Choose a reason for hiding this comment

stevengj Jul 16, 2023

Choose a reason for hiding this comment

GiggleLiu Jul 16, 2023 • edited Loading

Choose a reason for hiding this comment

sethaxen Jul 16, 2023

Choose a reason for hiding this comment

GiggleLiu Jul 16, 2023 • edited Loading

Choose a reason for hiding this comment

stevengj Aug 23, 2023

Choose a reason for hiding this comment

sethaxen commented Aug 26, 2023 • edited Loading

sethaxen commented Sep 13, 2023

danielwe commented Aug 15, 2024

sethaxen commented Aug 18, 2024

codecov bot commented May 22, 2023 •

edited

Loading

GiggleLiu Jul 14, 2023 •

edited

Loading

GiggleLiu Jul 14, 2023 •

edited

Loading

GiggleLiu Jul 16, 2023 •

edited

Loading

GiggleLiu Jul 16, 2023 •

edited

Loading

sethaxen commented Aug 26, 2023 •

edited

Loading