DHT 2.0 #291

Stebalien · 2018-03-29T21:01:49Z

So, we've accumulated quite a few "if only"s in the DHT. We should start thinking about what we'd like in a DHT 2.0.

Request/message oriented. We should have the Network provide some form of (possibly unreliable) request + message system.
Only three requests: PUT_VALUES (batch), FIND_VALUES (batch), FIND_NODES (batch).
Explicitly stores signed PeerInfo records (possibly including the public key?).
Ideally, real IPRS.

fahrradflucht · 2018-04-06T08:47:04Z

Related to but not a duplicate of #162

daviddias · 2018-05-30T10:15:44Z

While at it, let's make sure to consider how to make it private.

Stebalien · 2018-06-09T22:42:23Z

While at it, let's make sure to consider how to make it private.

@diasdavid, @jbenet and I discussed this yesterday and he had several really nice ideas.

The core idea is that, instead of calling Put(key, value), we can call Put(Hash(key), Encrypt(DeriveSymmetricKey(key), value)). This obviously doesn't work for records with one value like IPNS records but works really well for both provider records and PeerInfo records. This obviously assumes that key has sufficient randomness.

He also proposed an extension where the record author can put an additional "challenge" key that the record author can prove knowledge of to fetch the record. However, that may not be that useful given that DHTs are supposed to allow neighbors to fetch keys from each other on join.

I also proposed an additional privacy enhancing extension where one can use Hash(key || date) so that provider records get moved around. Juan extended this with Hash(key || date || counter) allowing users to post multiple records in different areas of the DHT (helps both with privacy and, possibly, reliability).

This should make the network significantly harder to enumerate.

We can also use these features to get basic capabilities.

It allows "hidden" peers where you need to know the peer's ID to find them. See: Multiple peer IDs, ephemeral IDs, and permanent/private IDs. libp2p/libp2p#37
With some modifications to bitswap, it could avoid leaking information about what a peer is looking for to the peers from which it is requesting blocks.

daviddias · 2018-06-19T04:59:45Z

//cc @mafintosh here who has been thinking a lot on how to make DHTs private and is currently building one (or something to add this functionality).

Stebalien · 2018-07-10T14:45:00Z

Additional notes:

DoS

Because these records are encrypted, it will be impossible for DHT nodes to filter out "bad" peer routing records (or content routing records).

Solution: Allow peers to explicitly expose the unhashed key so that DHTs can perform additional validation.

DHT Delete

More generally, we've found that we'd like a DELETE function for the DHT.

We could use it for the DoS issue above to clear out bad records without exposing the key. This would, unfortunately, require some additional crypto magic (unless we're fine exposing the key).
We'd also like this to implement an "inbox" protocol. When peer A is offline, peer B could post an encrypted message for peer A to the DHT under /inbox/peerA. When peer A comes back online, it would download and then clear all messages on the DHT (not currently possible).

Stebalien · 2018-08-20T23:20:20Z

One issue that has been plaguing us in the DHT is adding new datatypes. Our plan is to fix this with IPRS and arbitrary WASM functions however, at the moment, this has some pretty significant drawbacks:

Security. The VM will likely be really complex.
Simplicity. Any DHT will now need access to a WASM VM. This will become less of an issue over time as WASM reaches greater adoption but is a major problem in the short term.
Performance. At the moment, doing this would require spinning up a new WASM VM each time we want to validate something. Worse, we'd need a VM that can quickly verify signatures.

When thinking about libp2p/go-libp2p-kad-dht#189 (comment), I realized that there's a way to significantly improve the current situation without necessarily adding full IPRS: We can use one protocol per record type.

That is, we can have a "separate" DHT for each key type. At the end of the day, this won't really cost us anything, we're just creating multiple overlay networks. To add a new key type, we'd:

Register a new "sub" DHT with a description of acceptable values/keys (validators/selectors).
This new "sub" DHT would speak a new protocol (e.g., /p2p/dht/MyTypeName/1.0.0).

This would allow users to:

Choose which DHTs they want to participate in.
Add new "types" without convincing the entire network to support their type.

This should add almost no overhead as the nodes will keep the same IDs in all the DHTs (so the routing tables will be almost identical).

Note: This'll also allow applications to register external DHTs using the HTTP API (kind of like the ipfs p2p feature).

The difficult issue here will be finding these "sub" DHTs. That is, finding peers participating in them. This is basically the rendezvous/discovery issue but the tricky part is that these "sub" DHTs may be massive (the size of the DHT) to tiny (a few nodes).

florianlenz · 2018-08-21T00:42:52Z

@Stebalien for the messaging system quasar might is of interest for the DHT. It's very effective in the terms of routing, give it a quick read if you have the time. The only problem I see right now with quasar is the filter propagation which take a few minutes with the current mechanism (but I guess we can come up with sometime more effective).

sanderpick · 2018-08-21T00:51:00Z

Sounds pretty slick @Stebalien

Stebalien · 2018-08-21T04:22:24Z

@florianlenz that looks like a pubsub system. We actually have a separate pubsub implementation and even have a "value store" adapter for pubsub (which we use for IPNS over pubsub).

The difference here is that a DHT stores values (temporarily) so members can join and leave and still get published values.

jhiesey · 2018-12-22T01:22:41Z

So I finally got around to writing up a proposal for a refactor followed by a new protocol: libp2p/research-dht#8

Thoughts @Stebalien @daviddias @anacrolix

daviddias · 2019-01-08T10:05:08Z

Ton of DHT notes at gpestana/notes#8 by @gpestana

Stebalien · 2019-02-01T21:37:46Z

Looks like the TOR project had put a lot of thought into this: https://github.com/torproject/torspec/blob/master/rend-spec-v3.txt

Stebalien mentioned this issue Jun 9, 2018

Multiple peer IDs, ephemeral IDs, and permanent/private IDs. libp2p/libp2p#37

Open

daviddias mentioned this issue Jul 2, 2018

[Protocol Design] How to create a fully private DHT libp2p/developer-meetings#6

Open

jbenet added the Candidate Open Problem label Jul 9, 2018

gpestana mentioned this issue Jul 10, 2018

IPFS: metadata and censorship gpestana/notes#5

Open

Stebalien mentioned this issue Jul 18, 2018

feat: Adds ws and wss multiaddr multiformats/go-multiaddr#72

Closed

daviddias added the topic/libp2p Topic libp2p label Nov 23, 2018

This was referenced Feb 1, 2019

Peer ID Calculation History And Resolution libp2p/specs#138

Closed

Tracking protection libp2p/libp2p#67

Open

This was referenced Mar 15, 2019

advertising supported protocols via the DHT libp2p/go-libp2p-kad-dht#302

Open

Using ipfs dht to store key and value pair ipfs/kubo#5519

Closed

aschmahmann mentioned this issue Apr 12, 2019

Closed DHT Based Routing libp2p/notes#10

Open

Stebalien mentioned this issue Jun 11, 2019

Anonymous IPFS ipfs/kubo#6430

Open

daviddias mentioned this issue Sep 10, 2019

Open Problem: Routing at Scale (1M, 10M, 100M, 1B.. nodes) libp2p/research#4

Merged

Stebalien mentioned this issue Apr 27, 2020

Excessive bandwidth use ipfs/kubo#3429

Closed

lontivero mentioned this issue May 6, 2020

PayJoin without urls WalletWasabi/WalletWasabi#3619

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DHT 2.0 #291

DHT 2.0 #291

Stebalien commented Mar 29, 2018

fahrradflucht commented Apr 6, 2018

daviddias commented May 30, 2018

Stebalien commented Jun 9, 2018

daviddias commented Jun 19, 2018

Stebalien commented Jul 10, 2018

Stebalien commented Aug 20, 2018 •

edited

Loading

florianlenz commented Aug 21, 2018

sanderpick commented Aug 21, 2018

Stebalien commented Aug 21, 2018

jhiesey commented Dec 22, 2018

daviddias commented Jan 8, 2019

Stebalien commented Feb 1, 2019

DHT 2.0 #291

DHT 2.0 #291

Comments

Stebalien commented Mar 29, 2018

fahrradflucht commented Apr 6, 2018

daviddias commented May 30, 2018

Stebalien commented Jun 9, 2018

daviddias commented Jun 19, 2018

Stebalien commented Jul 10, 2018

DoS

DHT Delete

Stebalien commented Aug 20, 2018 • edited Loading

florianlenz commented Aug 21, 2018

sanderpick commented Aug 21, 2018

Stebalien commented Aug 21, 2018

jhiesey commented Dec 22, 2018

daviddias commented Jan 8, 2019

Stebalien commented Feb 1, 2019

Stebalien commented Aug 20, 2018 •

edited

Loading