[Merged by Bors] - sync: prioritize peers with higher success rate and low latency #5143

dshulyak · 2023-10-11T10:29:32Z

closes: #5127 #5036

peers that are overwhelmed or generally will not be used for requests. there are two criteria used to select good peer:

request success rate . success rates within 0.1 (10%) of each other are treated as equal, and in such case we will use latency
latency. hs/1 protocol used to track latency, as it is the most used protocol and objects served in this protocol are of the same size with several exceptions (active sets, list of malfeasence proofs).

related: #4977

limits number of peers to request data for atxs. previously we were requesting data from all peers atleast once.

synced data 2 times in 90m, previous attempt on my computer was 1 week ago and took 12h

codecov · 2023-10-12T07:49:04Z

Codecov Report

Merging #5143 (dcca7de) into develop (6338651) will increase coverage by 77.7%.
Report is 1 commits behind head on develop.
The diff coverage is 90.0%.

@@            Coverage Diff             @@
##           develop   #5143      +/-   ##
==========================================
+ Coverage         0   77.7%   +77.7%     
==========================================
  Files            0     259     +259     
  Lines            0   30495   +30495     
==========================================
+ Hits             0   23704   +23704     
- Misses           0    5299    +5299     
- Partials         0    1492    +1492

Files	Coverage Δ
fetch/interface.go	`100.0% <100.0%> (ø)`
fetch/peers/peers.go	`100.0% <100.0%> (ø)`
syncer/data_fetch.go	`76.4% <100.0%> (ø)`
syncer/find_fork.go	`78.2% <100.0%> (ø)`
syncer/syncer.go	`92.8% <95.4%> (ø)`
fetch/fetch.go	`82.0% <91.7%> (ø)`
syncer/state_syncer.go	`73.9% <57.1%> (ø)`

... and 252 files with indirect coverage changes

dshulyak · 2023-10-12T08:37:05Z

submit change with formatting changes in touched files

poszu

Did you benchmark the SelectBest() method? I'm not sure how often it is called and what's the expected total number of peers, but it takes around 3ms to find 10 best peers from a set of 50.

fetch/fetch_test.go

fetch/peers/peers.go

syncer/syncer.go

dshulyak · 2023-10-12T15:33:38Z

bors try

bors · 2023-10-12T15:48:44Z

try

Build failed:

ci-status

dshulyak · 2023-10-12T16:45:20Z

@poszu i added benchmark. below is SelectBest(10) from 10000 total. how did you get 3 ms?

BenchmarkSelectBest-24 2854 401721 ns/op 240 B/op 2 allocs/op

dshulyak · 2023-10-13T03:51:15Z

bors merge

closes: #5127 #5036 peers that are overwhelmed or generally will not be used for requests. there are two criteria used to select good peer: - request success rate . success rates within 0.1 (10%) of each other are treated as equal, and in such case we will use latency - latency. hs/1 protocol used to track latency, as it is the most used protocol and objects served in this protocol are of the same size with several exceptions (active sets, list of malfeasence proofs). related: #4977 limits number of peers to request data for atxs. previously we were requesting data from all peers atleast once. synced data 2 times in 90m, previous attempt on my computer was 1 week ago and took 12h

bors · 2023-10-13T04:17:08Z

Build failed:

ci-status

dshulyak · 2023-10-13T04:42:33Z

bors merge

dshulyak · 2023-10-13T04:56:54Z

bors cancel

bors · 2023-10-13T04:56:57Z

Canceled.

dshulyak · 2023-10-13T04:58:25Z

bors merge

closes: #5127 #5036 peers that are overwhelmed or generally will not be used for requests. there are two criteria used to select good peer: - request success rate . success rates within 0.1 (10%) of each other are treated as equal, and in such case we will use latency - latency. hs/1 protocol used to track latency, as it is the most used protocol and objects served in this protocol are of the same size with several exceptions (active sets, list of malfeasence proofs). related: #4977 limits number of peers to request data for atxs. previously we were requesting data from all peers atleast once. synced data 2 times in 90m, previous attempt on my computer was 1 week ago and took 12h

bors · 2023-10-13T05:50:21Z

Pull request successfully merged into develop.

Build succeeded!

The publicly hosted instance of bors-ng is deprecated and will go away soon.

If you want to self-host your own instance, instructions are here.
For more help, visit the forum.

If you want to switch to GitHub's built-in merge queue, visit their help page.

ci-status
systest-status

poszu · 2023-10-13T07:01:56Z

@poszu i added benchmark. below is SelectBest(10) from 10000 total. how did you get 3 ms?

BenchmarkSelectBest-24 2854 401721 ns/op 240 B/op 2 allocs/op

I'm sorry, my bad - I can't do the math 🤦 it was 3µs.

…emeshos#5143) closes: spacemeshos#5127 spacemeshos#5036 peers that are overwhelmed or generally will not be used for requests. there are two criteria used to select good peer: - request success rate . success rates within 0.1 (10%) of each other are treated as equal, and in such case we will use latency - latency. hs/1 protocol used to track latency, as it is the most used protocol and objects served in this protocol are of the same size with several exceptions (active sets, list of malfeasence proofs). related: spacemeshos#4977 limits number of peers to request data for atxs. previously we were requesting data from all peers atleast once. synced data 2 times in 90m, previous attempt on my computer was 1 week ago and took 12h

dshulyak added 3 commits October 11, 2023 10:26

integrate peers tracker

57f8202

Merge branch 'develop' into sync/priotize-fast

3f5b776

add coverage

a102583

add doc

2ac58c8

dshulyak changed the title ~~sync: prioritize responsive peers with low latency~~ sync: prioritize peers with higher success rate and low latency Oct 12, 2023

Merge branch 'develop' into sync/priotize-fast

e0e3cfa

dshulyak marked this pull request as ready for review October 12, 2023 08:37

dshulyak requested review from countvonzero, fasmat and poszu as code owners October 12, 2023 08:37

poszu approved these changes Oct 12, 2023

View reviewed changes

bors bot added a commit that referenced this pull request Oct 12, 2023

Try #5143:

1b3ddf7

add benchmark to select 10 peers from 10000

379aee2

dshulyak added 4 commits October 13, 2023 05:41

review

138b8ec

store rate on struct

1c93431

add changelog

0fbcb58

Merge branch 'develop' into sync/priotize-fast

4cb8f33

only import

dcca7de

dshulyak force-pushed the sync/priotize-fast branch from b0c679d to dcca7de Compare October 13, 2023 04:58

bors bot changed the title ~~sync: prioritize peers with higher success rate and low latency~~ [Merged by Bors] - sync: prioritize peers with higher success rate and low latency Oct 13, 2023

bors bot closed this Oct 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Merged by Bors] - sync: prioritize peers with higher success rate and low latency #5143

[Merged by Bors] - sync: prioritize peers with higher success rate and low latency #5143

dshulyak commented Oct 11, 2023 •

edited

codecov bot commented Oct 12, 2023 •

edited

dshulyak commented Oct 12, 2023

poszu left a comment

dshulyak commented Oct 12, 2023

bors bot commented Oct 12, 2023

dshulyak commented Oct 12, 2023

dshulyak commented Oct 13, 2023

bors bot commented Oct 13, 2023

dshulyak commented Oct 13, 2023

dshulyak commented Oct 13, 2023

bors bot commented Oct 13, 2023

dshulyak commented Oct 13, 2023

bors bot commented Oct 13, 2023

poszu commented Oct 13, 2023

[Merged by Bors] - sync: prioritize peers with higher success rate and low latency #5143

[Merged by Bors] - sync: prioritize peers with higher success rate and low latency #5143

Conversation

dshulyak commented Oct 11, 2023 • edited

codecov bot commented Oct 12, 2023 • edited

Codecov Report

dshulyak commented Oct 12, 2023

poszu left a comment

Choose a reason for hiding this comment

dshulyak commented Oct 12, 2023

bors bot commented Oct 12, 2023

try

dshulyak commented Oct 12, 2023

dshulyak commented Oct 13, 2023

bors bot commented Oct 13, 2023

dshulyak commented Oct 13, 2023

dshulyak commented Oct 13, 2023

bors bot commented Oct 13, 2023

dshulyak commented Oct 13, 2023

bors bot commented Oct 13, 2023

poszu commented Oct 13, 2023

dshulyak commented Oct 11, 2023 •

edited

codecov bot commented Oct 12, 2023 •

edited