Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enrich filter statistics predictions with estimated column boundaries #4518

Closed
isidentical opened this issue Dec 5, 2022 · 0 comments · Fixed by #4519
Closed

Enrich filter statistics predictions with estimated column boundaries #4518

isidentical opened this issue Dec 5, 2022 · 0 comments · Fixed by #4519
Labels
enhancement New feature or request

Comments

@isidentical
Copy link
Contributor

Is your feature request related to a problem or challenge? Please describe what you are trying to do.
The current implementation of the filter statistics hides all the column statistics which makes it really hard to further cost estimators to work (e.g. if the parent node is a hashjoin, it needs the child's column boundaries to estimate its own result; otherwise it just gives up).

Describe the solution you'd like
There are certain cases where we can know a particular filter's effect on the resulting table (e.g. a > 25 on a a=[0, 100]; b=[50, 60] would mean a=[25, 100] (different), b=[50, 60] (same)). For simple (and relatively common) expressions like the above, we should be able to derive the new column boundaries for used predicates and push it down further in the statistic estimation chain.

Describe alternatives you've considered
None

Additional context
Related to #3929

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant