Ensure context isn't exhausted via concurrent query as opposed to sentinel query #3334

gnpaone · 2025-04-05T13:56:19Z

Switch from sequential sentinel query to concurrent query to avoid context exhaustion and connect to new master redis after redis failover

gnpaone · 2025-04-05T13:57:09Z

Hey @ndyakov tried to fix your review points in #3174

ndyakov

Please add a test , it may be a setup of three sentinels, two of which are unreachable. Define the context in such a way, to get context.DeadlineExceeded error on GetMasterAddrByName. Such setup should fail prior to your PR and succeeded with the introduced change, correct?

sentinel.go

…onditions

kwenzh · 2025-04-10T12:06:31Z

Please add a test , it may be a setup of three sentinels, two of which are unreachable. Define the context in such a way, to get context.DeadlineExceeded error on GetMasterAddrByName. Such setup should fail prior to your PR and succeeded with the introduced change, correct?

The current problem is that the faulty sentinel node is in the front, which will cause context exhaustion. Even if the last sentinel can correctly provide the master address, don't we consider this situation?

kwenzh · 2025-04-11T03:01:40Z

I think that once any sentinel returns the master address correctly, it should be considered finished and other coroutines should be notified of the end. Even if the context timeout occurs afterwards, we should return the master address that has been queried, right?

kwenzh · 2025-04-11T03:26:50Z

I think that once any sentinel returns the master address correctly, it should be considered finished and other coroutines should be notified of the end. Even if the context timeout occurs afterwards, we should return the master address that has been queried, right?

with issue： #3172

gnpaone · 2025-04-11T04:10:35Z

I think that once any sentinel returns the master address correctly, it should be considered finished and other coroutines should be notified of the end. Even if the context timeout occurs afterwards, we should return the master address that has been queried, right?

with issue： #3172

Maybe should I use something like

ctx, cancel := context.WithCancel(ctx)
defer cancel()

...

cancel()

to stop goroutines early, once success is achieved ie sentinel returns the master address correctly?

kwenzh · 2025-04-11T08:06:09Z

once success is achieved ie sentinel returns the master address correctly?

yeah，once success returns the master address correctly，That's exactly what I meant！

gnpaone · 2025-04-13T12:03:35Z

Done

ndyakov · 2025-04-15T13:18:32Z

@gnpaone can you please add the issue in the description of this PR. I will review the latest changes later today and get back to you if there are any comments.

gnpaone · 2025-04-16T11:25:52Z

Updated the description @ndyakov

bug: ensure context isn't exhausted via concurrent query

6f22bf5

Update sentinel.go

6ec0cfd

ndyakov requested changes Apr 7, 2025

View reviewed changes

sentinel.go Outdated Show resolved Hide resolved

ndyakov and others added 3 commits April 7, 2025 13:46

Merge branch 'master' into patch-1

9b40ce4

Remove separate goroutine and just wait on the main WaitGroup

1601768

Need to immediately return in context canceled or deadline exceeded c…

84b2016

…onditions

Add test for proper sentinel resolution

d231fa0

Stop goroutines early after success is achieved

6d78214

gnpaone requested review from kwenzh and ndyakov April 13, 2025 12:03

gnpaone and others added 2 commits April 13, 2025 17:35

Update sentinel_test.go

bd27395

Merge branch 'master' into patch-1

7ab8412

Merge branch 'master' into patch-1

0f05272

Merge branch 'master' into patch-1

173d598

ndyakov approved these changes Apr 16, 2025

View reviewed changes

Merge branch 'master' into patch-1

67fe50d

ndyakov merged commit a4aea25 into redis:master Apr 16, 2025
16 checks passed

gnpaone deleted the patch-1 branch April 17, 2025 03:22

kwenzh mentioned this pull request Apr 17, 2025

Sentinel cluster settings 1 node network iface down, Probability unable to query the master node, MasterAddr error： context deadline exceeded #3172

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure context isn't exhausted via concurrent query as opposed to sentinel query #3334

Ensure context isn't exhausted via concurrent query as opposed to sentinel query #3334

gnpaone commented Apr 5, 2025 •

edited

Loading

gnpaone commented Apr 5, 2025

ndyakov left a comment

kwenzh commented Apr 10, 2025

kwenzh commented Apr 11, 2025

kwenzh commented Apr 11, 2025

gnpaone commented Apr 11, 2025

kwenzh commented Apr 11, 2025

gnpaone commented Apr 13, 2025

ndyakov commented Apr 15, 2025

gnpaone commented Apr 16, 2025

Ensure context isn't exhausted via concurrent query as opposed to sentinel query #3334

Ensure context isn't exhausted via concurrent query as opposed to sentinel query #3334

Conversation

gnpaone commented Apr 5, 2025 • edited Loading

gnpaone commented Apr 5, 2025

ndyakov left a comment

Choose a reason for hiding this comment

kwenzh commented Apr 10, 2025

kwenzh commented Apr 11, 2025

kwenzh commented Apr 11, 2025

gnpaone commented Apr 11, 2025

kwenzh commented Apr 11, 2025

gnpaone commented Apr 13, 2025

ndyakov commented Apr 15, 2025

gnpaone commented Apr 16, 2025

gnpaone commented Apr 5, 2025 •

edited

Loading