Avoid double buffering direct IO index input slices with BufferedIndexInput #14103

ChrisHegarty · 2025-01-06T10:12:15Z

This commit avoids double buffering direct IO index input slices with BufferedIndexInput.

Currently BufferedIndexInput is used for slicing, since it will handle the initial offset and length, but this adds an extra layer of buffering - the buffer in buffered index input as well as the buffer in direct IO index input. This change reflows direct IO index input so that it can handle an offset and length, so can be its own implementation for slices.

Existing tests covered this, but I found case where a clone of a slice was not covered. I added a small change to the base directory test case which covers this.

My motivation for doing this is that I've been investigating the possibility of using direct IO for random access reads of float32 vectors when rescoring an initial set of candidates retrieved from scalar quantized approximations.

mikemccand

Thank you for tackling this @ChrisHegarty -- I don't feel qualified to review the code changes too closely (they are / this class is somewhat scary).

It's a neat idea to try this for KNN 2nd phase rescoring, I guess to prevent that (heavy) IO from polluting OS's buffer cache for other hot index pages?

And with a fast enough device (SSD) maybe the uncached IO penalty of retrieving N full precision vectors for rescoring is acceptable latency in the query hot path ... if you can read these N full precision vectors using IO concurrency that might also be a big win, though I think @jpountz's recent cool IO hinting changes rely on OS buffer cache (?) ... but maybe many virtual threads each doing the blocking (O_DIRECT) IO could work somehow?

lucene/misc/src/java/org/apache/lucene/misc/store/DirectIODirectory.java

ChrisHegarty · 2025-01-07T09:37:24Z

~~To help move this forward, I'm going to separate out the changes into several smaller more targeted PRs, tracked by #14106~~. This PR now only tracks the changes required for slicing.

Optimize DirectIOIndexInput

957df1f

mikemccand reviewed Jan 6, 2025

View reviewed changes

lucene/misc/src/java/org/apache/lucene/misc/store/DirectIODirectory.java Show resolved Hide resolved

ChrisHegarty added 4 commits January 9, 2025 14:19

fix clone of slice

fd35bfc

Merge branch 'main' into dio_improvement

71896f4

fix bad merge

eb0182c

Merge branch 'main' into dio_improvement

4210ba5

ChrisHegarty requested a review from original-brownbear January 9, 2025 16:32

ChrisHegarty changed the title ~~Optimize DirectIOIndexInput~~ Avoids double buffering direct IO index input slices with BufferedIndexInput Jan 9, 2025

ChrisHegarty mentioned this pull request Jan 9, 2025

Optimize DirectIOIndexInput #14106

Open

3 tasks

ChrisHegarty changed the title ~~Avoids double buffering direct IO index input slices with BufferedIndexInput~~ Avoid double buffering direct IO index input slices with BufferedIndexInput Jan 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid double buffering direct IO index input slices with BufferedIndexInput #14103

Avoid double buffering direct IO index input slices with BufferedIndexInput #14103

ChrisHegarty commented Jan 6, 2025 •

edited

Loading

mikemccand left a comment

ChrisHegarty commented Jan 7, 2025 •

edited

Loading

Avoid double buffering direct IO index input slices with BufferedIndexInput #14103

Are you sure you want to change the base?

Avoid double buffering direct IO index input slices with BufferedIndexInput #14103

Conversation

ChrisHegarty commented Jan 6, 2025 • edited Loading

mikemccand left a comment

Choose a reason for hiding this comment

ChrisHegarty commented Jan 7, 2025 • edited Loading

ChrisHegarty commented Jan 6, 2025 •

edited

Loading

ChrisHegarty commented Jan 7, 2025 •

edited

Loading