fix: correctly track CPLs of never refreshed buckets #71

Stebalien · 2020-04-05T23:18:16Z

To determine how many buckets we should refresh, we:

Look at the max bucket.
Look at the peer with the greatest common prefix in the max bucket.
Return the "last refresh" times for all buckets between 0 and min(maxPeer, 15)

BREAKING: this returns a slice of times instead of CplRefresh objects because we want to refresh all buckets between 0 and the max CPL.

To determine how many buckets we should refresh, we: 1. Look at the max bucket. 2. Look at the peer with the greatest common prefix in the max bucket. 3. Return the "last refresh" times for all buckets between 0 and `min(maxPeer, 15)` BREAKING: this returns a slice of times instead of CplRefresh objects because we want to refresh _all_ buckets between 0 and the max CPL.

Stebalien · 2020-04-05T23:49:51Z

table.go

+	defer rt.tabLock.RUnlock()
+
+	for i := len(rt.buckets) - 1; i >= 0; i-- {
+		if rt.buckets[i].len() > 0 {


While #72 is a valid point are we sure we actually want to return a smaller number of tracked buckets and ramp back up? Upsides: Scales nicely as the network grows without us touching anything. Downsides: Maybe we want some minimal number of buckets to ramp up our scale.

Feel free to ignore this comment or just say "meh, this was is probably fine"

The number of buckets shouldn't matter in practice, right?

The number of buckets we have shouldn't matter. We might care about the number of refreshes as long, but as long as it's ~15-20 and not ~100) it's probably fine.

I'm not sure if there's any tweaking to be done over how we do initial bootstrapping so that we fill our buckets fairly quickly. I suspect in the real network this will be much less problematic then the test network though.

My point is that we:

Start with one big bucket.

Split the big bucket as we add peers.

Never shrink back down to 1 bucket.

In terms of refreshing the routing table, the logic in this PR will refresh every CPL, not regardless of how many buckets we actually have. In practice, the number of buckets we have shouldn't affect anything but memory usage, unless we're doing something wrong.

aschmahmann

LGTM aside from the comment I added about a potential off by one error

table_refresh.go

aschmahmann · 2020-04-06T01:59:45Z

table.go

+	defer rt.tabLock.RUnlock()
+
+	for i := len(rt.buckets) - 1; i >= 0; i-- {
+		if rt.buckets[i].len() > 0 {


While #72 is a valid point are we sure we actually want to return a smaller number of tracked buckets and ramp back up? Upsides: Scales nicely as the network grows without us touching anything. Downsides: Maybe we want some minimal number of buckets to ramp up our scale.

Feel free to ignore this comment or just say "meh, this was is probably fine"

table_refresh.go

aschmahmann · 2020-04-06T02:09:50Z

bucket.go

+
+// maxCommonPrefix returns the maximum common prefix length between any peer in
+// the bucket with the target ID.
+func (b *bucket) maxCommonPrefix(target ID) uint {


This is interesting... it means that if we remove the maxCpl (because we fix the RPCs in the DHT to allow for querying random KadIDs) then if there's 2^10 peers in the network and bucket number 10 has someone really close to us (e.g. 20 shared bits) that I'm now going to be querying 20 buckets. Not sure if that's really what we want, is it?

We should probably query until we fail to fill a bucket.

You mean do a query, wait for it to finish and stop if the percentage of the bucket that is full after the query drops below x%? That seems reasonable.

Yeah, something like that. But I'm fine cutting an RC without that.

No problem at all since we max out at 15 anyway.

I'll file an issue to track it

Stebalien · 2020-04-06T02:50:48Z

LGTM from @aschmahmann over slack.

Stebalien · 2020-04-06T02:53:39Z

@aarshkshah1992 I'd like your thoughts on this post-merge. I'm merging now so I can cut an RC, but we may want to tweak the behavior here.

aarshkshah1992 · 2020-04-06T06:00:58Z

@Stebalien While this looks good to me for now, we should revisit/refactor this for 0.6. I've created a meta-issue to track it and will have more thoughts on how to improve it once I start working on the issue and dig deeper.

libp2p/go-libp2p-kad-dht#556

Stebalien force-pushed the fix/cpl-refresh-tracking branch from dde9e83 to 7c961a0 Compare April 5, 2020 23:47

Stebalien requested review from aarshkshah1992 and aschmahmann and removed request for aarshkshah1992 April 5, 2020 23:47

Stebalien commented Apr 5, 2020

View reviewed changes

aschmahmann requested changes Apr 6, 2020

View reviewed changes

aschmahmann reviewed Apr 6, 2020

View reviewed changes

address comments

844420e

Stebalien merged commit 7438bac into master Apr 6, 2020

Stebalien deleted the fix/cpl-refresh-tracking branch April 6, 2020 02:53

Stebalien restored the fix/cpl-refresh-tracking branch April 6, 2020 02:53

This was referenced Apr 6, 2020

Only do bucket refreshes on buckets that are likely to have peers libp2p/go-libp2p-kad-dht#551

Closed

Add structured routing table event logging libp2p/go-libp2p-kad-dht#552

Closed

iand mentioned this pull request Aug 8, 2023

Routing Table Maintenance probe-lab/go-kademlia#45

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: correctly track CPLs of never refreshed buckets #71

fix: correctly track CPLs of never refreshed buckets #71

Stebalien commented Apr 5, 2020

Stebalien Apr 5, 2020

aschmahmann Apr 6, 2020

Stebalien Apr 6, 2020

aschmahmann Apr 6, 2020

Stebalien Apr 6, 2020

aschmahmann left a comment

aschmahmann Apr 6, 2020

aschmahmann Apr 6, 2020

Stebalien Apr 6, 2020

aschmahmann Apr 6, 2020

Stebalien Apr 6, 2020

aschmahmann Apr 6, 2020

aschmahmann Apr 6, 2020

Stebalien commented Apr 6, 2020

Stebalien commented Apr 6, 2020

aarshkshah1992 commented Apr 6, 2020 •

edited

Loading

fix: correctly track CPLs of never refreshed buckets #71

fix: correctly track CPLs of never refreshed buckets #71

Conversation

Stebalien commented Apr 5, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aschmahmann left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Stebalien commented Apr 6, 2020

Stebalien commented Apr 6, 2020

aarshkshah1992 commented Apr 6, 2020 • edited Loading

aarshkshah1992 commented Apr 6, 2020 •

edited

Loading