Getting started with FEDn

Note

This tutorial is a quickstart guide to FEDn based on a pre-made FEDn Project. It is designed to serve as a starting point for new developers. To learn how to develop your own project from scratch, see Develop a FEDn project.

Prerequisites

1. Set up project

  1. Create a FEDn account. Sign up at fedn.scaleoutsystems.com/signup.

  2. Verify your email. Check your inbox for a verification email and click the link to activate your account.

  3. Log in and create a project. Once your account is activated, log in to the Studio and create a new project.

  4. Manage your projects. If you have multiple projects, you can view and manage them here: fedn.scaleoutsystems.com/projects.

Tip

You can also create a project using our CLI tool. Run the following command: For more details, see CLI.

fedn project create --name "My Project"

Replace “My Project” with your desired project name.

2. Prepare the clients and define the global model

Next, we will prepare and package the ML code to be executed by each client and create a first version of the global model (seed model). We will work with one of the pre-defined projects in the FEDn repository, mnist-pytorch.

First install the FEDn API on your local machine (client):

Using pip

On you local machine/client, install the FEDn package using pip:

pip install fedn

From source

Clone the FEDn repository and install the package:

git clone https://github.com/scaleoutsystems/fedn.git
cd fedn
pip install .

Create the compute package and seed model

In order to train a federated model using FEDn, your Studio project needs to be initialized with a compute package and a seed model. The compute package is a code bundle containing the code used by the client to execute local training and local validation. The seed model is a first version of the global model. For a detailed explaination of the compute package and seed model, see this guide: Develop a FEDn project

To work through this quick start you need a local copy of the mnist-pytorch example project contained in the main FEDn Git repository. Clone the repository using the following command, if you didn’t already do it in the previous step:

git clone https://github.com/scaleoutsystems/fedn.git

Navigate to the fedn/examples/mnist-pytorch folder. The compute package is located in the folder client.

Create a compute package:

fedn package create --path client

This will create a file called package.tgz in the root of the project.

Next, create the seed model:

fedn run build --path client

This will create a file called seed.npz in the root of the project.

Note

This example automatically creates the runtime environment for the compute package using Virtualenv. When you first exectue the above commands, FEDn will build a venv, and this takes a bit of time. For more information on the various options to manage the environement, see Develop a FEDn project.

Next will now upload these files to your Studio project.

3. Initialize the server-side

The next step is to initialize the server side with the client code and the initial global model. In the Studio UI,

Upload the compute package

  1. Navigate to your project from Step 1 and click Packages in the sidebar.

  2. Click Add Package.

  3. In the form that appears, enter a name and upload the generated package file.

Note

If no compute package is selected, the system will run in local mode. This is an advanced option that allows each client to connect with their own custom training and validation logic. It can also be useful during development, as it eliminates the need to upload a new package with every change or version update.

Upload the seed model

  1. Navigate to your project from Step 1 and click Models in the sidebar.

  2. Click Add Model.

  3. In the form that appears, upload the generated seed model file.

Note

You can upload multiple compute packages and seed models, selecting the appropriate one for each session. To create a new session from any model, navigate to its model page.

Continue to step 4 before starting the session. The uploaded package and seed files are saved.

4. Start clients

Before starting the clients, we need to configure what data partition the clients should use. This way each client will have access to a unique subset of the data.

Manage Data Splits for MNIST-PyTorch

The default training and test data for this particular example (mnist-pytorch) is for convenience downloaded and split automatically by the client when it starts up. The number of splits and which split to use by a client can be controlled via the environment variables FEDN_NUM_DATA_SPLITS and FEDN_DATA_PATH.

Setup the environement for a client (using a 10-split and the 1st partition) by running the following commands:

  • Unix/MacOS
  • Windows (PowerShell)
  • Windows (CMD.exe)
export FEDN_PACKAGE_EXTRACT_DIR=package
export FEDN_NUM_DATA_SPLITS=10
export FEDN_DATA_PATH=./data/clients/1/mnist.pt
$env:FEDN_PACKAGE_EXTRACT_DIR=".\package"
$env:FEDN_NUM_DATA_SPLITS=10
$env:FEDN_DATA_PATH=".\data\clients\1\mnist.pt"
set FEDN_PACKAGE_EXTRACT_DIR=.\package\\
set FEDN_NUM_DATA_SPLITS=10
set FEDN_DATA_PATH=.\data\\clients\\1\\mnist.pt

Start the client (on your local machine)

Each local client requires an access token to connect securely to the FEDn server. These tokens are issued from your FEDn Project.

  1. Navigate to the Clients page and click Connect Client.

  2. Follow the instructions in the dialog to generate a new token.

  3. Copy and paste the provided command into your terminal to start the client.

Repeat these two steps for the number of clients you want to use. A normal laptop should be able to handle several clients for this example. Remember to use different partitions for each client, by changing the number in the FEDN_DATA_PATH variable.

5. Train the global model

With clients connected, we are now ready to train the global model.

Tip

You can use the FEDn API Client to start a session and monitor the progress. For more details, see Using the API Client.

client.start_session(name="My Session", rounds=5)

In the FEDn UI,

  1. Navigate to the Sessions page and click on “Create session”. Fill in the form with the desired settings.

  2. When the session is created, click “Start training” and select the number of rounds to run.

  3. Once the training is started, you can follow the progress in the UI.

In the terminal where your are running your client you should now see some activity. When a round is completed, you can see the results on the “Models” page.

Congratulations, you have now completed your first federated training session with FEDn! Below you find additional information that can be useful as you progress in your federated learning journey.

Note

In FEDn Studio, you can access global model updates by going to the ‘Models’ or ‘Sessions’ tab. Here you can download model updates, metrics (as csv) and view the model trail.

Where to go from here?

With you first FEDn federated project set up, we suggest that you take a closer look at how a FEDn project is structured to learn how to develop your own FEDn projects:

Develop a FEDn project

In this tutorial we relied on the UI for running training sessions and retrieving models and results. The Python APIClient provides a flexible alternative, with additional functionality exposed, including the use of different aggregators. Learn how to use the APIClient here:

Using the API Client

Study the architecture overview to learn more about how FEDn is designed and works under the hood:

Architecture overview

For developers looking to customize FEDn and develop own aggregators, check out the local development guide to learn how to set up an all-in-one development environment using Docker and docker-compose:

Developer guide