More floating point codegen #1599

ltratt · 2025-02-10T15:11:12Z

We previously didn't handle select with floating point values -- which I've now seen in the wild! Please check the codegen properly for this: it made my head hurt.

Conditional moves for xmm registers confused me and since this is not a common thing we yet see (I've seen it once in real code), I'm not too worried about making it perfectly fast yet.

The first of these caught a genuine bug (that we didn't handle `select` and floats properly), so it's worth running them more often.

vext01 · 2025-02-10T16:51:46Z

ykrt/src/compile/jitc_yk/codegen/x64/mod.rs

+        match inst.trueval(self.m).bitw(self.m) {
+            32 => {
+                dynasm!(self.asm
+                    ;   bt Rd(cond_reg.code()), 0


Would the fcmov instruction help here? That would bring this in parity with the integer version.

You tell me. I ended up baffled by this; the documentation on this stuff is much sparser -- and frankly, much worse -- than for general purpose registers. Personally I'd be happy to get something we're confident is correct in and optimise it later.

Now I look at it, fcmov uses the old fashioned floating point stack and not the xmm registers, so we can't (and shouldn't) use it.

Your codegen looks correct to me. AI tells me there is no xmm equivalent to fcmov and gave me a technique similar to what you have done here.

Sounds like we can merge this then?

ltratt added 2 commits February 10, 2025 15:04

select can take ints and floats.

81764d1

Conditional moves for xmm registers confused me and since this is not a common thing we yet see (I've seen it once in real code), I'm not too worried about making it perfectly fast yet.

Convert two cheapish debug_asserts to assert.

6fbefcd

The first of these caught a genuine bug (that we didn't handle `select` and floats properly), so it's worth running them more often.

ltratt assigned vext01 Feb 10, 2025

vext01 reviewed Feb 10, 2025

View reviewed changes

vext01 added this pull request to the merge queue Feb 11, 2025

Merged via the queue into ykjit:master with commit 9ede83a Feb 11, 2025
2 checks passed

ltratt deleted the more_floating_point_codegen branch February 14, 2025 17:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More floating point codegen #1599

More floating point codegen #1599

ltratt commented Feb 10, 2025

vext01 Feb 10, 2025

ltratt Feb 10, 2025

vext01 Feb 10, 2025

ltratt Feb 10, 2025

More floating point codegen #1599

More floating point codegen #1599

Conversation

ltratt commented Feb 10, 2025

vext01 Feb 10, 2025

Choose a reason for hiding this comment

ltratt Feb 10, 2025

Choose a reason for hiding this comment

vext01 Feb 10, 2025

Choose a reason for hiding this comment

ltratt Feb 10, 2025

Choose a reason for hiding this comment