Prefer equality in boolean comparisons #34166

ranma42 · 2024-07-05T10:31:55Z

In most databases, equality comparisons can take advantage of indexing and inequalities cannot.

roji · 2024-07-11T16:42:36Z

test/EFCore.Sqlite.FunctionalTests/Query/NullSemanticsQuerySqliteTest.cs

@@ -839,7 +839,7 @@ public override async Task Rewrite_compare_bool_with_bool(bool async)
            """
 SELECT "e"."Id"
 FROM "Entities1" AS "e"
-WHERE "e"."BoolA" <> "e"."NullableBoolB"
+WHERE "e"."BoolA" = (NOT ("e"."NullableBoolB"))


Is this a positive change? I doubt any database out there will use index with equality and NOT, more than it would for inequality, no? Should we make this change more targeted, so that it doesn't do this specific transformation?

Is this a positive change?

It is a positive change, at least as long as we do not analyze the provenance of the columns.
In this specific case, it is basically irrelevant: the whole table is going to be scanned linearly (just once).
If the left column and the right column came from different tables, it would be much more efficient.

I doubt any database out there will use index with equality and NOT, more than it would for inequality, no?

At least Sqlite does. This is basically an instance of #34048 (comment)

Should we make this change more targeted, so that it doesn't do this specific transformation?

Do you expect worse plans on some db?

I'll add some examples in the issue #34164.

Added Sqlite and Postgres examples 🚀

So between a <> b and a = NOT(b), the former certainly seems more natural, and what I'd expect a standard SQL query to look like. If these perform the same, I'd definitely prefer the first, at least for readability etc. (and we do generally care about that).

I could alsi imagine a database where the planner optimizes to use an index on b (effectively transforming the inequality into a not on a), where this change would cause a regression. of course, this is entirely speculative, and I have an actually looked into which databases basis to which optimizations.

I'd prefer us to do a bit more cross database research before merging a change like this, which at the very least makes our SQL less readable/standard/expected (and that does tend to have some correlation sometimes with performance). If this address is a very specific Sqlite behavior, where equality is always better, we always have the option of doing this change for sqlite only.

OK, I wrote the above comment before noticing you posted data on other databases… some remarks:

For the Sqlite case, have you confirmed that the second option, where an index is built, is actually faster than the first?

For the SQL Server case, the total subtree cost is actually higher with the second method.

I will perform some measurement, but I do not have real code/an actual database that is using this filter; I will try to do some synthetic examples (I'll try to cover some interesting cases, but a real-world case would definitely be more relevant).

Some (synthetic) benchmarks have been posted in #34164

ranma42 · 2024-07-12T12:14:33Z

Maybe I should have explained this in advance: this is not a new/different approach to translate boolean (in)equalities; it is just making the code more consistent in choosing = over <> (as per the issue).

Ideally there should be no duplication of this code, hence it should be "inevitably" consistent.

ranma42 · 2024-07-13T17:46:41Z

I cleaned up the code a little (the same equality conversion logic is now shared between OptimizeComparison and RewriteNullSemantics.

ranma42 · 2024-07-29T19:32:47Z

Rebased to resolve conflicts

ranma42 · 2024-12-23T12:30:20Z

Rebased to resolve conflicts

…Entities Problem was that in EF9 we moved some optimizations from sql nullability processor to SqlExpressionFactory (so that we optimize things early). One of the optimizations: ``` !(true == a) -> false == a !(false == a) -> true == a ``` is not safe to do when a is a constant or parameter null value, because it evaluates to true IS NULL (two value logic). Fix is to remove this optimization from SqlExpressionFactory and instead put it back into OptimizeNotExpression, once we've converted everything we could to IS NULL checks already. Fixes dotnet#35393

src/EFCore.Relational/Query/SqlNullabilityProcessor.cs

They can take advantage of indexing.

These are the baselines regenerated after a rebase on top of 35fc423.

ranma42 · 2025-01-06T10:22:08Z

This is currently marked as draft as it is based on (an old version of):

Fix to #35393 - GroupJoin in EF Core 9 Returns Null for Joined Entities #35395

maumar · 2025-01-07T01:28:28Z

/azp run

azure-pipelines · 2025-01-07T01:28:40Z

Azure Pipelines successfully started running 1 pipeline(s).

maumar · 2025-01-07T01:31:19Z

@ranma42 I think it makes sense to use this PR in favor of 35395 as the basis for the fix (if we iron out all the kinks), at least for main. We could consider scoping it for patch, but that can be a separate discussion

ranma42 · 2025-01-07T07:30:41Z

👍 @maumar your fix is still needed, at least the part that drops the invalid optimization from SqlExpressionFactory, but we can probably avoid re-introducing the code in SqlNullabilityProcessor.
I will work on cleaning this up a bit and experiment with showing & fixing the regression that occurs on value-converted types as mentioned in #35395 (comment)

maumar · 2025-01-14T09:11:09Z

@ranma42 also consider extending Rewrite_compare_bool_with_bool test to include constants. I did that as part of the patch fix and it indeed caught the error e12547f

ranma42 force-pushed the prefer-equal branch from 785b752 to 80a7450 Compare July 9, 2024 22:30

roji reviewed Jul 11, 2024

View reviewed changes

ranma42 mentioned this pull request Jul 12, 2024

Prefer equality when comparing values #34164

Open

ranma42 force-pushed the prefer-equal branch 2 times, most recently from 7f57ec3 to 32d05b3 Compare July 13, 2024 17:45

ranma42 force-pushed the prefer-equal branch from 32d05b3 to 3a4ff39 Compare July 13, 2024 17:55

ranma42 mentioned this pull request Jul 27, 2024

Avoid duplicating complex expression in comparisons #34172

Open

ranma42 force-pushed the prefer-equal branch from 3a4ff39 to a16fd4f Compare July 29, 2024 19:32

ranma42 force-pushed the prefer-equal branch from a16fd4f to b79642e Compare July 29, 2024 21:10

maumar assigned roji Sep 25, 2024

ranma42 force-pushed the prefer-equal branch from b79642e to 09f58e6 Compare December 23, 2024 12:29

ranma42 requested a review from a team as a code owner December 23, 2024 12:29

ranma42 mentioned this pull request Dec 23, 2024

Implement IS [NOT] DISTINCT FROM translation #34048

Draft

ranma42 mentioned this pull request Jan 1, 2025

Fix to #35393 - GroupJoin in EF Core 9 Returns Null for Joined Entities #35395

Closed

ranma42 commented Jan 1, 2025

View reviewed changes

src/EFCore.Relational/Query/SqlNullabilityProcessor.cs Show resolved Hide resolved

ranma42 marked this pull request as draft January 1, 2025 09:06

ranma42 added 3 commits January 1, 2025 10:11

Prefer equality in boolean comparisons

50078f2

They can take advantage of indexing.

Update baselines

aae5e34

Update baselines

0eb325a

These are the baselines regenerated after a rebase on top of 35fc423.

ranma42 force-pushed the prefer-equal branch from 09f58e6 to 0eb325a Compare January 1, 2025 11:18

This was referenced Jan 10, 2025

[release/9.0-staging] Fix to #35393 - GroupJoin in EF Core 9 Returns Null for Joined Entities #35448

Closed

GroupJoin in EF Core 9 Returns Null for Joined Entities #35393

Closed

AndriySvyryd added the community-contribution label Jan 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prefer equality in boolean comparisons #34166

Prefer equality in boolean comparisons #34166

ranma42 commented Jul 5, 2024

roji Jul 11, 2024

ranma42 Jul 12, 2024 •

edited

Loading

ranma42 Jul 12, 2024

ranma42 Jul 12, 2024

roji Jul 12, 2024

roji Jul 12, 2024

ranma42 Jul 12, 2024

ranma42 Jan 1, 2025

ranma42 commented Jul 12, 2024

ranma42 commented Jul 13, 2024

ranma42 commented Jul 29, 2024

ranma42 commented Dec 23, 2024

ranma42 commented Jan 6, 2025 •

edited

Loading

maumar commented Jan 7, 2025

azure-pipelines bot commented Jan 7, 2025

maumar commented Jan 7, 2025 •

edited

Loading

ranma42 commented Jan 7, 2025

maumar commented Jan 14, 2025

Prefer equality in boolean comparisons #34166

Are you sure you want to change the base?

Prefer equality in boolean comparisons #34166

Conversation

ranma42 commented Jul 5, 2024

roji Jul 11, 2024

Choose a reason for hiding this comment

ranma42 Jul 12, 2024 • edited Loading

Choose a reason for hiding this comment

ranma42 Jul 12, 2024

Choose a reason for hiding this comment

ranma42 Jul 12, 2024

Choose a reason for hiding this comment

roji Jul 12, 2024

Choose a reason for hiding this comment

roji Jul 12, 2024

Choose a reason for hiding this comment

ranma42 Jul 12, 2024

Choose a reason for hiding this comment

ranma42 Jan 1, 2025

Choose a reason for hiding this comment

ranma42 commented Jul 12, 2024

ranma42 commented Jul 13, 2024

ranma42 commented Jul 29, 2024

ranma42 commented Dec 23, 2024

ranma42 commented Jan 6, 2025 • edited Loading

maumar commented Jan 7, 2025

azure-pipelines bot commented Jan 7, 2025

maumar commented Jan 7, 2025 • edited Loading

ranma42 commented Jan 7, 2025

maumar commented Jan 14, 2025

ranma42 Jul 12, 2024 •

edited

Loading

ranma42 commented Jan 6, 2025 •

edited

Loading

maumar commented Jan 7, 2025 •

edited

Loading