Explicit trust establishment mechanism for MicroCloud

jpelizaeus · April 17, 2024, 2:03pm

Project	MicroCloud
Status	Active
Author(s)	@jpelizaeus
Approver(s)	@tomp @maria-seralessandri
Release	2.x
Internal ID	LX073

Abstract

Allow all members of a MicroCloud to explicitly establish trust so they can securely join the cluster. Grant both the joining member and the cluster the possibility to verify its peer
and do not transfer critical information like API secrets across the network.

Rationale

In its current version MicroCloud offers a convenient approach to bootstrap clusters built using LXD, MicroCeph and MicroOVN. By controlling the entire process, MicroCloud can generate join tokens for each of the services and distribute them across the cluster members. This allows forming additional clusters for each of the services without additional manual intervention.

As MicroCloud itself offers many additional settings to configure its behavior, additional support for a preseed configuration file has been added. This allows and administrator to skip the setup dialog and apply the configuration for an entire MicroCloud in one step.

Having this one step configuration mechanism requires MicroCloud to make decisions on the administrators behalf. One of them is to accept additional cluster members that have been selected or configured (using preseed) by the administrator. As there is no additional step involved to ensure the integrity of either one of the joining peers, this might lead to security risks as currently the network is considered to be a trusted party.

Specification

This specification supersedes the already existing cluster join mechanism with proactive tasks that have to be executed on each cluster member individually to ensure integrity before starting the join procedure.

Existing mechanism

In the latest release of MicroCloud the cluster join mechanism is largely dependent on mDNS in order to discover its peers, share relevant connection details and to bootstrap the final cluster. Therefore on each node of the cluster a microcloudd daemon is running that both broadcasts its own set of details onto the network but also receives details sent by others in the same network.
This ultimately allows to construct a picture of the available resources so that MicroCloud can offer the administrator a straightforward question and answer input dialog:

MicroCloud mDNS

The details broadcasted on the network by each of the daemons consist of the following:

The current version of the mDNS broadcast/lookup format (currently 1.0)
The hostname of the node
The address of the node’s MicroCloud API endpoint
The nodes network interface over which the broadcast was sent
A list of services (MicroCloud, LXD, MicroCeph and MicroOVN) present on this node
An authentication secret that can be used to access this nodes API endpoint using the X-MicroCloud-Auth header

Using those information the initial microcloudd requests further details on network interfaces and storage disks from the local and peer LXD servers for later selection by the administrator.

Discovery

At the beginning MicroCloud requires three independent microcloudd daemons to be running on a shared network. As a node running microcloudd could potentially have more than a single network interface, each microcloudd broadcasts its details on each of the network interfaces which are available on the underlying node that have an IP address configured.
An administrator will then pick any of the microcloudd daemons to be the initial one that will be used to bootstrap the MircoCloud. It’s the responsibility of this microcloudd to listen only to broadcasts on the network(s) selected by the administrator:

Select an address for MicroCloud's internal traffic:
Space to select; enter to confirm; type to filter results.
Up/down to move; right to select all; left to select none.
       +----------------------------------------+--------+
       |                ADDRESS                 | IFACE  |
       +----------------------------------------+--------+
> [x]  | 10.237.170.93                          | enp5s0 |
  [ ]  | fd42:e287:8e5c:b221:216:3eff:fe6f:45cb | enp5s0 |
       +----------------------------------------+--------+
...
Limit search for other MicroCloud servers to 10.237.170.93/24? (yes/no) [default=yes]:

After selecting the network(s) on which MicroCloud should discover any potential peers, MicroCloud prompts the user with a list of peers that have been discovered on the respective network interface:

Scanning for eligible servers ...
Space to select; enter to confirm; type to filter results.
Up/down to move; right to select all; left to select none.
       +------+--------+----------------+
       | NAME | IFACE  |      ADDR      |
       +------+--------+----------------+
> [x]  | m3   | enp5s0 | 10.237.170.140 |
  [x]  | m2   | enp5s0 | 10.237.170.61  |
       +------+--------+----------------+

After this the administrator is guided through multiple dialogs allowing further configuration of the final MicroCloud in regards to storage and networking.

As a last step MicroCloud instructs its peers to form a cluster for each of the services (MicroCloud, LXD, MicroCeph and MicroOVN). In order to allow access from the initial MicroCloud node to every other node in the cluster, each node has broadcasted an API secret that is now used to invoke RPC requests on the cluster’s nodes by setting the X-MicroCloud-Auth header.
As part of those requests MicroCloud initiates the cluster forming process for the various services.

Cluster forming

MicroCloud is using MicroCluster under the hood to form a cluster of nodes. The mechanism on the MicroCluster side currently relies on the fact that the join token is considered to be a secret. In addition a node joining using this secret is automatically trusted to be the one for which the secret has been created.
The joining node however is checking if the fingerprint of the clusters certificate (embedded into the token) is matching the one returned by the cluster API when issuing the join request:

MicroCluster forming

The response of the join request contains the cluster’s certificate and key and a list of certificates from all the nodes who have already joined the cluster. This list is used to extend the truststore of the newly joined node.
The nodes own certificate is added automatically to the truststore.

After adding the other peer’s certificates to the nodes truststore, it will start its API and join the already existing dqlite cluster
using mutual TLS with the certificates that have been obtained during the join process.

Updated mechanism

As the current mechanism fully trusts the local LAN, every microcloudd broadcasts an authentication secret and trusts the broadcasts received from other peers in the network.
This can lead to man in the middle (MITM) attacks as the broadcasts itself aren’t protected and could potentially be read and modified by somebody sitting in the same network.

By removing the authentication secret from the broadcast message, the initial microcloudd cannot anymore talk to its peers as there isn’t a trust relationship anymore. This breaks the disk and network discovery as well as the final cluster forming as they are currently making use of this secret.

Instead this communication could already make use of the mutual TLS that currently gets established during the final cluster forming. By moving the exchange of trust right after the discovery of peers and extending it with a proactive human verification option, it can be ensured
that the nodes in the cluster are the ones they pretend to be.
Instead of forming the MicroCloud’s MicroCluster at the end to establish the base for mTLS, a new temporary trust store is build up during the cluster forming. This store can be used for any follow up tasks to discover the required information and to finally form the clusters for MicroCloud, LXD, MicroCeph and MicroOVN.

To allow exchanging the public keys for the temporary trust store, the already existing flow of communication (discovery) is extended with a HMAC to prevent broadcasting a secret across the network but still allow the sender and receiver to validate and trust the received payloads. This requires having a shared secret on both ends that never gets transmitted across the network.

Discovery (KDF and HMAC)

Discovery

The discovery is relying on HMAC to sign the messages exchanged between the initial microcloudd and potential peers so that each side can verify any received contents. For added security a key derivation function (KDF) is used together with a salt that allows having a stronger secret when computing the message’s HMAC.

The overall idea is to allow establishing a verified mTLS connection as soon as possible so that both ends can talk via an encrypted channel to exchange further information. Therefore the discovery ensures that both sides exchange their own public key rather quick so that if a new HTTPS connection gets opened up from one end to the other, we can rely on TLS to perform a proper exchange and to setup a secure session.
Even before the peers can be verified using mTLS, both the joiner and initiator open up TLS connections to the other end (if allowed by the underlying protocol) so that the flow of communication is encrypted.

In case of preseed the steps that require human interaction are skipped and only the HMAC comparison is performed on both ends. Additionally there is no random password generated on the initial microcloudd. Instead the password has to be generated by the administrator and injected accordingly when running microcloud preseed.

The initial public key exchange is what is depicted in the next six steps which are also shown in the graphic above.

Startup

An administrator starts the cluster forming process by reading a randomly generated password displayed by the initial microcloudd and setting it on any of the potential joiners. The password itself is a concatenation of strings which have been selected randomly from a given word list (e.g. EFF wordlist for random passphrases). There are various approaches for the word lists. One of them might be picking a list that only contains words which have a unique three-character prefix (see example list) so that an administrator only has to type in the first three characters of each word and the remainder of the characters can be guessed using auto completion.

The length of the password is based on the number of words selected from the words list. It takes around n^k/2 guesses to crack the password where n is the length of the overall word list and k the number of words chosen from the list. Picking between 4-6 words from a list with a length of around 5000 should be sufficient. The password is displayed on the initial microcloudd and has to be typed in on any other microcloudd that should join the cluster.

Using this password the joining microcloudd can derive a key using a random salt with an appropriate length. We have chosen argon2, but other KDFs (HKDF, scrypt) might work too. As inputs for argon2 we are using the second option of recommended defaults from https://www.rfc-editor.org/rfc/rfc9106#section-4-6.2:

if salt == nil {
	// 128 bit salt.
	salt = make([]byte, 16)
	_, err := rand.Read(salt)
	if err != nil {
		return nil, fmt.Errorf("Failed to create salt: %w", err)
	}
}

// 3 iterations.
var time uint32 = 3

// 64 MiB memory.
var memory uint32 = 64 * 1024

// 4 lanes.
var threads uint8 = 4

// 256 bit tag size.
var keyLen uint32 = 32

The code snippet above shows the recommended inputs (option 2 in the RFC) which were implemented through the common HMAC tooling added with Shared: Add HMAC and cert utils by roosterfish · Pull Request #13969 · canonical/lxd · GitHub.

As each joining microcloudd will pick another random salt, the initial microcloudd can derive the key only after receiving the intent from the joiner which also includes the salt. This is covered in the next section.

For the purpose of human validation, the respective local microcloudd will also print it’s fingerprint on startup either when running microcloud init or microcloud join for visual comparison on the other end.

Discover joiner

As both ends need to be aware of each other, the joining microcloudd has to send its intent to join an existing MicroCloud cluster. This intent is sent to the initiating side and the request contains at least the following information:

The local public key
The version of MicroCloud
The name of the node
The local address of the API
The random salt
The HMAC

The value of the HMAC field is created by taking the contents of the body (see the section below) and creating the MAC by using the key which was computed by the KDF. The HMAC is sent as part of the Authentication header.

Authenticate joiner

After receiving the intent to join from any potential joiner, the initial microcloudd first has to validate the contents of the received payload. This is required to filter out joiners that don’t have the same version of MicroCloud as well as the ones that have sent a payload with an HMAC that cannot be reproduced on the receiving side. Those have to be marked with extra care as this could be the result of a MITM attack.

Using the salt and the random password that has been generated by the initial microcloudd during the startup, the exact same key can now be derived using the same KDF as on the joining side. Now the HMAC over the body can be computed using the key and compared to the one that got sent over the wire. If the HMACs match, the administrator has the possibility to approve the join request from the joiner. Joiners with invalid versions and non matching HMAC’s are rejected.

After the joiner is accepted, its public key gets added to the local microcloudd’s temporary trust store (bound to its address) which allows for certificate validation of new mTLS connections during the remainder of the cluster forming. Now when opening up a new mTLS connection from the initial microcloudd to the joiner, the certificate provided from the other end has to match the one which is tracked in the local temporary trust store.

Authenticate cluster

The last step of the discovery allows the joiner to also verify that it is joining the right cluster. As the initial microcloudd already knows the address of the joiner (from the received payload), a new HTTPS request is made to the API of the joiner. As the received payload from before, the request’s body contains:

The local public key
The version of MicroCloud
The name of the node

The HMAC of the request body is sent alongside the Authentication header.

After receiving the request, the joiner now computes the HMAC itself using the key from before and the contents of the request’s body. If the HMAC doesn’t match, this might be an indication for a MITM attack. As the protocol doesn’t foresee multiple clusters contacting the same joiner, such a mismatch is ignored but an appropriate warning message is logged to the daemon’s log of the joiner. Also if the version doesn’t match, the request should be ignored too. The initial microcloudd should have never contacted the joiner in the first place if the versions don’t match.

If both the version and HMAC matches, the administrator is asked to approve the request from the cluster:

Scanning for response ...

Would you like to join m1 (fingerprint): (yes/no) [default=yes]:

After the cluster is accepted, its public key gets added to the local microcloudd’s temporary trust store (bound to its address) which allows for certificate validation of new mTLS connections during the remainder of the cluster forming. Now if the joiner receives a new mTLS connection from the initial microcloudd, it can verify the provided public key based on the entry in its local trust store.

The response of the HTTPS request indicates a successful pairing and marks the end of the discovery/authentication protocol. This also marks the end of the session and discards the random password on each end.

Cluster forming

The initial microcloudd can now use mTLS to retrieve further information from the joiner and both ends can validate the other side based on their temporary trust store entries. Furthermore join tokens are created on the initial microcloudd for each of the services (LXD, MicroCeph and MicroOVN). Those tokens are now sent through the encrypted and trusted mTLS channel in order to form each of the services MicroCluster’s.

Cleanup

During the cleanup stage both ends discard their temporary trust store as the service’s MicroClusters are formed and the trust is established in each MicroCluster’s own truststore.

Daemon and API changes

MicroCloud

A new API extension is added that indicates the change in how MicroCloud performs the discovery/authentication.

Joiner request

The request payload is sent with the following information. Check the previous sections on some more explanations:

type SessionJoinPost struct {
    // The current version of the payload format
    Version     string
    // The hostname of the node
    Name        string
    // The address of the node's MicroCloud API endpoint
    Address     string
    // A list of services (e.g. LXD, MicroCeph, MicroOVN) present on this node
    Services    []types.ServiceType
    // The node's public certificate for mTLS
    Certificate string
}

The struct is fed into the HMAC tooling which generates the salt, runs the KDF and produces the final HMAC header containing both the salt and HMAC:

h, err := trust.NewHMACArgon2([]byte(session.Passphrase), nil, trust.NewDefaultHMACConf(HMACMicroCloud10))
if err != nil {
	return fmt.Errorf("Failed to create a new HMAC instance using argon2: %w", err)
}

header, err := trust.HMACAuthorizationHeader(h, <SessionJoinPost struct>)
if err != nil {
	return fmt.Errorf("Failed to create HMAC for join intent: %w", err)
}

// header uses the form: "Authorization: MicroCloud1.0 <salt>:<HMAC>"

Temporary trust store

MicroCloud will maintain a temporary trust store on both ends that gets filled up with the public key of the respective peer. This temporary trust store has to be made available to MicroCluster so that the custom API endpoints of MicroCloud can use this temporary store instead.

There is an open proposal in the MicroCluster repo (#120) which makes the authentication handler public so that an importer of MicroCluster (like MicroCloud) can inject it’s own trust store information for every custom API endpoint. A more detailed description on the specifics can be found here.

In any case the X-MicroCloud-Auth header is being removed as there isn’t anymore a secret being broadcasted to the local network.
Requests to any peers of the MicroCloud have to be made using a mTLS connection which can be trusted on both ends using the temporary trust store.
This is only possible if both ends have successfully finished the discovery/authentication protocol.

Joining existing services

As part of #259 MicroCloud grows support to reuse existing MicroCeph and MicroOVN clusters. The process behind relies on one of the microcloudd within the existing cluster being able to create a join token on the peer that allows joining into the already existing remote cluster(s).

This concept wouldn’t be blocked as microcloudd would continue to use the same paths of communication to reuse the existing clusters.

Rate limiting

As the number of words inside the wordlist are limited, during the lifetime of a session an attacker might be able to guess the right session passphrase by retrying until the right passphrase is found.
To prevent this from happening, every request made by any joiner throughout the lifetime of the session is protected by the following measures:

Every join request is delayed by 100 milliseconds (See microcloud/api/session_join.go at main · canonical/microcloud · GitHub). This has the benefit that throughout the session lifetime only x passwords can be potentially tried by an attacker until the session expires.
Maximum session lifetime of 60 minutes (See microcloud/api/session.go at main · canonical/microcloud · GitHub). If the time expires the session passphrase is discarded and the session is closed
Maximum of 50 failed join attempts (See microcloud/service/session.go at main · canonical/microcloud · GitHub). If there are more than 50 failed join attempts throughout the lifetime of a session, the session passphrase is discarded and the session is closed.

MicroCluster

To have as little impact as possible on other active importers of MicroCluster (e.g. MicroCeph, MicroOVN), the modifications for the temporary trust store won’t affect any of the existing setups. It’s the choice of the importer to make use of this added functionality using the temporary trust store.

CLI changes

MicroCloud

Join command

A new command microcloud join gets added to allow a peer joining into MicroCloud.
This command is also the starting point after which the joiner’s microcloudd can send its cluster forming intent to the initiating side.
The command prompts the administrator to enter the random password that got displayed by the initial microcloudd.
The command blocks until the request got approved on both sides.

Preseed command

A new command microcloud preseed gets added to allow for unattended deployments.
All participants of the deployment load the password from the preseed file that gets passed via stdin.

Session timeout

In addition a new --session-timeout flag is added to both the init and join subcommands. It allows exiting the session at time x so that both ends discard their temporary trust stores and forget about the random password. Afterwards a new discovery/authentication session has to be started by running the init and join subcommands again on both ends.
The default session timeout value is set to ten minutes.

UX

To approve the requests on both ends, the dialogs displaying the information have to be extended to allow “greying out” invalid requests as well as setting a notification in case a peer cannot be selected due to version or integrity mismatches.

Database changes

No database changes expected.

Packaging changes

No packaging changes expected.

masnax · April 17, 2024, 4:11pm

Great write-up, thanks for this!

I believe @sdeziel1 mentioned there might be some issues with keyboard-and-mouse systems and copying strings from one system to another becomes very difficult. I’m not sure if this is something that greatly affects current/future MicroCloud users.

The problem is, if we don’t manually input a secret string/join token at some point, then there’s no way to verify whether an mDNS payload actually came from a genuine system. Since we can’t trust the local network, then we must expect any bad actor can just listen for the payload and broadcast the same thing, and we could mistakenly trust the spoofed server instead.

All that said, I do like option C the best because it doesn’t break the flow of the initialization process, and the secret can be selected by the user so the keyboard-and-mouse case is less of a problem.

sdeziel1 · April 17, 2024, 6:35pm

Thanks for jogging my memory. By those keyboard-and-mouse systems, I was referring to KVM (Keyboard, Video, Mouse) consoles sometimes present in server racks. Those give you console access to each of the servers in the rack but you cannot copy-n-paste between them. This means we should aim for easily (repeatedly) typed input.

(Still not done reading this spec so more feedback to come later).

masnax · April 17, 2024, 7:39pm

To add on to what I was thinking for option C, maybe microcloud init could look a bit like this?


   Scanning for eligible servers ...
   Please enter the following on any systems you want to join the cluster.

     microcloud cluster verify adjective-noun

   Space to select; enter to confirm; type to filter results.
   Up/down to move; right to select all verified; left to select none.
          +---------+--------+---------------+------------+
          |  NAME   | IFACE  |     ADDR      |   STATUS   |
          +---------+--------+---------------+------------+
   > [x]  | micro3  | enp5s0 | 203.0.113.171 |  verified  |
     [x]  | micro4  | enp5s0 | 203.0.113.172 |  verified  |
     [ ]  | micro2  | enp5s0 | 203.0.113.170 | unverified |
     [ ]  | micro5  | enp5s0 | 203.0.113.173 | unverified |
          +---------+--------+---------------+------------+

So how this would work is like follows:

when all MicroCloud daemons start, they continuously listen for mDNS payloads
first node runs microcloud init and broadcasts that it is looking to form a cluster
when other nodes receive this payload, they then broadcast basic information (the same info as today, but without the X-MicroCloud-Auth secret included).
the first node now consumes the minimal payloads from the other systems. At the same time, it generates a human-readable secret that must be entered on every joining system before we can proceed by running microcloud cluster verify <secret>. This secret is only displayed locally on the first node.
when other nodes run microcloud cluster verify <secret>, they change the payload they are broadcasting to instead be hashed with the secret, and include any other sensitive information that we need to set up the Authorization request header for requests going from the first node to joiners. The second “sensitive” payload can actually be sent directly via HTTP back to the first node as well, so we don’t even need to broadcast the hashed payload over the local network.

The table of systems that the first node finds from mDNS lookup will have a column STATUS that reports the verification status of any particular node. If it is still broadcasting the raw minimal payload, it will be considered unverified. If it’s broadcasting a payload that is hashed with the secret, it will be considered verified. The table will only allow selecting verified systems.

This way, KVM systems like @sdeziel1 mentioned can easily input the verification as it is consistent and human-readable, and the user doesn’t expose any sensitive information openly over the local network. As well, the user gets immediate feedback on the first node about when it can actually proceed with the initialization.

For the preseed, you’ve mentioned preloading the certificates on each system but that is itself a form of user interaction per system so I’m not sure if it makes a difference if we just use microcloud cluster verify <secret> on each system prior to running the preseed. I’m not sure if we need two separate verification mechanisms here.

Using another API endpoint

To prevent having long running requests, the existing public POST /cluster/1.0/cluster endpoint could return right after validating the token and marking the new peer as pending.
By adding a new GET /cluster/1.0/cluster/{member} endpoint the joining side can perform regular polling until the join request got allowed by an administrator.

For a bit of background on the PENDING cluster status in microcluster, that actually means that at least some nodes in the cluster will not yet accept requests originating from the PENDING node, as they do not yet have a truststore entry, which is used for API authentication. At the moment, newly joined nodes will remain pending until the next heartbeat synchronizes the truststore across all nodes. However, due to go-dqlite issues causing extremely resource-intensive heartbeats, the heartbeats occur over very long intervals. I have been working on instantly distributing truststore entries to all nodes in the cluster when a new node joins to work around the heartbeat issue, so that might pose some problems for this approach.

So some key points that we need to address:

Is one “secret” per initialization process enough, or should we have a unique “secret” per joining node?
How long should the “secret” live? The proposal in the spec involves generating the “secret” before calling microcloud init, but if instead we automatically generate it when running microcloud init then we can control its lifecycle more thoroughly. If it’s generated by running microcloud init, then it should expire if the initialization is cancelled, but how should we inform other nodes that the initialization was cancelled, assuming they have already been verified by the user?

maria-seralessandri · April 18, 2024, 7:51am

Thank you Julian for the spec. Option C seems to me the most viable as well. We should find a solution that doesn’t exclude automation, since for large clusters the administrator cannot add manually all nodes one by one. For option C I have a few comments:

Additional information on how the secret is stored securely on the nodes with microcloud config set secret
Some expiration for the secret is needed, it cannot last forever and also it is the same for all the nodes of the cluster. If in a second moment a node needs to be added to the cluster a different secret should be used and not the initial one

tomp · April 18, 2024, 11:40am

For Option A, I dont believe the spec as it currently stands explains how one can manually verify the cert fingerprint of a joining node matches the one expected? We would need a way for the joining node to locally display its fingerprint right?

tomp · April 18, 2024, 11:45am

Option B sounds similar to LXD’s long-lived trust password model, which allows setting a shared password that allows joining members to add their certificate into the cluster’s trust not.
It is not recommended anymore and is why short-lived per-member join tokens were added to avoid the leaking of the shared password presenting a security issue.

https://documentation.ubuntu.com/lxd/en/latest/authentication/#authentication-trust-pw

The difference is that the existing cluster has to verify the joiner in this case, whereas that doesn’t need to occur with LXD’s trust password model.

tomp · April 18, 2024, 12:02pm

This is similar to option B, except that apparently this secret is long-lived whereas in option B it was only valid during the microcloud init call and wasn’t persisted to the database. Is this correct?

This option seems even more similar to LXD’s not-recommended long-lived shared join password approach, except it does still require confirmation from the existing cluster.

tomp · April 18, 2024, 12:20pm

@maria-seralessandri agreed, but as I understand it, there are a couple of “flavours” of automation available to us.

Retain the option for MicroCloud to deploy itself from pre-seed files without interaction. This may not be possible given the requirement for confirming each side of the join process.
Arbitrated deployment by way of something like Juju. @sdeziel1 mentioned the other day that if MicroCloud itself cannot deploy itself in an automated manner, it may still be possible to provide automated deployments if using something like Juju which can replicate the manual verification steps required.

jpelizaeus · April 19, 2024, 12:52pm

If I understood it right your suggestion adds the following on top of C:

The initial MicroCloud daemon broadcasts its intent to form a cluster. This will cause the peers to start broadcasting
The peers respond either with a “plain” or “hashed” broadcast depending on whether the administrator already executed the microcloud cluster verify <secret> command on the peer to set the secret
The initial MicroCloud marks a peer as verified (trusted) if it receives a hashed broadcast

What would be the benefit of letting the initial MicroCloud daemon tell its local network that it wants to form a cluster if an admin anyway has to go to each of the nodes to enter the secret?
The only point I can think of is letting the peer validate that it doesn’t join a malicious cluster. But this functionality we already have in MicroCluster as the token contains the clusters fingerprint which can be validated by the joining side.

I like the idea of generating the secret in MicroCloud directly so that we have more control over it. When installing the snap on each of the nodes the secret will of course be different but with this approach it only needs to be changed on the joining side and can be leaved untouched on the initial MicroCloud which reduces the amount of required steps.

jpelizaeus · April 19, 2024, 12:55pm

That is a good point. The certificate gets created automatically in MicroCluster’s state directory but AFAIK there is currently no straightforward command to retrieve this information on the joining side.

tomp · April 19, 2024, 1:12pm

@jpelizaeus

@masnax and I were discussing in our 1:1 the possibility of avoiding needing microcloud config set core.secret xyz and persisting the secret to disk.

And instead having microcloud init generate a per-invocation secret, that would then be used as an argument/interactive to a microcloud join <secret> command which would block until the join was completed. That joining members would only start broadcasting when microcloud join was running.

And when the the microcloud [init|join] commands end they would forget their invocation secret.

jpelizaeus · April 19, 2024, 1:39pm

Thanks I see, so that would be option B without having a persistent token? As today the token/secret could contain the join information (address) which would make having mDNS obsolete. And the join request from the peer to the initial MicroCloud daemon could then already be a HTTPS request using the secret/HMAC for verification purposes.

masnax · April 19, 2024, 5:21pm

What would be the benefit of letting the initial MicroCloud daemon tell its local network that it wants to form a cluster if an admin anyway has to go to each of the nodes to enter the secret?

The initial MicroCloud still needs to somehow become aware of the joining nodes, and the joining nodes need to become aware of the initial node. Otherwise we would have to manually type in addresses as part of microcloud cluster verify <secret> <init node's address>.

Today, as soon as a MicroCloud snap is installed, it begins advertising its address and services over the local network. I’m proposing making this less noisy by instead making all nodes listen by default, and only trigger the advertisement after microcloud init has been executed somewhere on the local network.

Without this, all nodes (including the initial node, because it didn’t have to be the initial node) would have to perpetually broadcast their intent anyway, or we drop mDNS entirely and the user specifies the init node’s address directly to each joiner.

By keeping mDNS, we can still maintain some determination of compatibility of all nodes when running microcloud init, before going to each node and verifying them. We can see right away which nodes can even join the cluster, or have the same services, and we don’t have to log into each one first.

jpelizaeus · April 22, 2024, 1:38pm

I see what you mean. But if we use a secret/join token that embeds the address of the MicroCloud (like it is currently done for MicroCluster), the administrator doesn’t have to manually type it in on the joining side and the MicroCloud doesn’t need to broadcast it to potential peers.
However this collides with what @sdeziel1 wrote in regards to the KVM consoles as such a secret wouldn’t be easy to type in these environments.

In regards to the service discovery, we can potentially perform this as part of the initial join request from the peer to the MicroCloud by extending the request body.

jpelizaeus · April 22, 2024, 2:45pm

@tomp @masnax I have extended the spec with options B2 and C2 to address your feedback on using short lived secrets within a so called session so that we don’t have to persist anything to disk.

sdeziel1 · April 22, 2024, 2:52pm

If we consider the LAN to be untrusted/hostile, we then need a solution that is resistant to MiTM. This problem space has years of research and many failed attempts along the way so I think we should use something tried and true. Here are some existing solutions I’m aware of:

IPsec with Pre-Shared Key
TLS-SRP
WiFi WPA Pre-Shared Key
WiFi WPA3 SAE
Bluetooth Simple Secure Pairing with numeric comparison

The Bluetooth one seems particularly attractive to secure an unprotected mDNS conversation doing all the heavy lifting of copying certs and keys around.

masnax · April 22, 2024, 5:05pm

So I hope I’ve got the flow correct here:

B2 interactive setup:

First node runs microcloud init
- A token including the secret and the address of the init node is generated
- The init node waits for joiners to contact it. The user chooses when to continue through the setup.
All nodes must run microcloud join <token>
- The joiner reaches out to the init node over HTTPS, with info about joining the cluster.
Through the setup, until the nodes are clustered, they are trusted by the init node using the token.

Potential Issues with B2

Cumbersome to KVM setup due to having to copy the encoded secret to each joiner.
If microcloud init is aborted, the joiners will continue to perpetually trust the invalid token. But if the tokens have an expiry, they can expire before the user has finished the interactive setup. We would need to poll the init node from the joiners regularly.
We need some way to handle mismatch of installed services on each node. Currently, MicroCloud’s mDNS record will filter out any nodes that don’t have the same set of of services, or offer to the user if they want to skip that service. This would have to be implemented on each joiner instead when we call microcloud join <token>. Or we would have to be stricter about what service combinations are required.

C2 interactive setup:

The first node runs microcloud init
- It broadcasts its intent to form a cluster over mDNS.
- A plaintext, human readable password is generated.
- The init node begins looking up eligible systems over mDNS and displays them to the user.
The joiners enter microcloud join <password>
- Each joiner generates an HMAC payload encoded with the password, and broadcasts it over mDNS.
The init node receives the mDNS payloads, and decodes them with the password.
- The user makes a final confirmation of which nodes they want in the cluster
Through the setup, until the nodes are clustered, they are trusted by the init node using the password.

Potential issues with C2

C2 actually handles the issues from B2 rather well:
- The passwords are human-readable so KVMs have a viable solution.
- Because the init node is broadcasting its intent to form a cluster, the password will only be trusted as long as the broadcast is ongoing.
- Because we have some minimal information from each node prior to running microcloud join <password>, it’s easier to spot issues and config mismatches before logging into every joiner.
Biggest issue I see is that it includes mDNS so it’s a more complex system than B2

Preseed

In both B2 and C2, we would have to make some compromises for preseed authentication:
- In the case of B2, it’s not enough for the user to specify the secret because the init node’s address and available services must also be encoded. This means the user will have to run microcloud init --preseed first, which will then print out the token that the user must use in microcloud cluster join <token> on each joiner.
- For C2, we could either do the same as above, or the user can supply their own password directly in the preseed file.

sdeziel1 · April 29, 2024, 3:16pm

The EFF publishes a few lists of words that are easy and quick to memorize/type/autocomplete. Those are meant to be use in Diceware type of passwords but I think would make a good basis for the “authenticated exchange” PSK validation we intend on doing.

https://www.eff.org/deeplinks/2016/07/new-wordlists-random-passphrases

The short word list sounds interesting and should be easily embedded into the daemon.

jpelizaeus · April 29, 2024, 4:03pm

In Bluetooth SSP those verification numbers are actually derived from a hash that gets computed on both ends based on information that gets exchanged during the pairing.

In case of “Numeric Comparison protocol” (check section 7.2.1 of https://www.bluetooth.com/wp-content/uploads/Files/Specification/HTML/Core-54/out/en/br-edr-controller/security-specification.html#UUID-045cba38-3e1c-51b9-a02f-75356c6829c1), the numeric verification number is created by taking the last 32 bit from the hash function g(...)'s output (see section 7.7.2 in the same link) and dividing it by 10⁶ to always get six numbers.

I like the idea but it’s not an arbitrary string. Instead it’s based upon information provided by both ends with enough randomness.