-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Clarify definitions of "default device" and "current device" #835
Comments
Actually maybe |
Yes, creation functions indeed. The
Correct indeed. May also be global state rather than a context manager (e.g., PyTorch has both - the global one is
Yes agreed, creation functions will use the current device. Both may be useful to inspect indeed. I believe we didn't consider this in detail because the standard has no way to set the current device, however I think that shouldn't stop us from adding an inspection function.
It's not necessary indeed, it'd just be nicer syntax for |
It's also worth noting that the "default real floating-point dtype" (another notion from the same APIs) can change at runtime in PyTorch using the |
While the value of knowing the current device used when one passes FWIW, I just fell into this misunderstanding while writing scipy/scipy#22756. |
JAX has interpreted d = xp.__array_namespace_info__().default_device()
a = xp.asarray(0, device=d)
assert a.device == d This will also fail: assert d in xp.__array_namespace_info__().devices I think it's reasonable to say that a backend offers no guarantee of using the same device when one isn't explicitly pinned, but such a quirk should be spelled out by the Standard. |
We were doing our best to interpret https://data-apis.org/array-api/2023.12/design_topics/device_support.html in a way that was compatible with JAX's pre-existing array creation and device placement semantics, where the default Please let me know if there's something in the specification that we've misinterpreted. |
@jakevdp, when you say
I'm reading the JAX documentation and it gives me the impression that when you call a creation function ( |
This is true to an extent: when executing eagerly (i.e. outside JIT) the buffer behind the array has to be allocated somewhere, and if
Within JIT-compiled code, there is not necessarily any physical buffer associated with a python array variable, and so the device specification is meaningless and will be ignored. If a buffer is in fact needed, the compiler will allocate bytes on the device that makes sense in the context of the overall compiled computation. That could be the same backend as the default or different, depending on the nature of the compiled code, the parameters (e.g. sharding specifications) passed to the |
I should add: this is one reason that prior to the array API standard, we never implemented a |
The inspection API mentions "default device" in several places, for instance
However, as far as I can tell, this term is never actually defined anywhere. I thought it might be defined at https://data-apis.org/array-api/latest/design_topics/device_support.html#device-support (or at https://data-apis.org/array-api/latest/purpose_and_scope.html#terms-and-definitions), but it doesn't seem to be. Are there APIs where the default device should be used? I thought this would be the case for creation functions, but the phrase "default device" never appears (e.g., at https://data-apis.org/array-api/latest/API_specification/generated/array_api.empty.html#array_api.empty). Presumably this is what is meant by default device, but are there other instances where "default device" should be used.
Another related concept that appears in some places but is never really defined is "current device" (for example,
dtypes()
should use the "current device" whendevice=None
https://data-apis.org/array-api/latest/API_specification/generated/array_api.info.dtypes.html#array_api.info.dtypes). It would be helpful to clarify the difference between "current device" and "default device" and when one should be used and when the other should be used.Actually the creation functions never really state clearly what should happen when
device=None
, except forasarray
and the*_like
functions, which say the device should be inferred fromx
.My understanding is that for libraries with a context manager, the "current device" is the device set in the current context, whereas the "default device" is the device used when no context is set (the "current device" would be the same as the "default device" in this case). Is this correct? By this reasoning creation functions should actually use the "current device" when
device=None
, not the "default device". But is it also true thatdefault_device()
should always return the "default device" regardless of the current context? It seems like this would be less useful than "current device". Do we need a separatecurrent_device()
inspection API?Or is it actually the case that these terms were both meant to mean the same thing (i.e., both mean the current device used by default in the current context)? If that's the case, then the note at https://data-apis.org/array-api/latest/API_specification/generated/array_api.info.default_dtypes.html#array_api.info.default_dtypes doesn't make much sense to me. And if so, we should unify this terminology to avoid confusion (probably the term "default device" should be preferred since that is the name of the inspection function).
The text was updated successfully, but these errors were encountered: