Byzantine-robust decentralized learning via self-centered clipping

In this paper, we study the challenging task of Byzantine-robust decentralized training on arbitrary communication graphs. Unlike federated learning where workers communicate through a server, workers in the decentralized environment can only talk to their neighbors, making it harder to reach consensus. We identify a novel dissensus attack in which few malicious nodes can take advantage of information bottlenecks in the topology to poison the collaboration. To address these issues, we propose a Self-Centered Clipping (SSClip) algorithm for Byzantine-robust consensus and optimization, which is the first to provably converge to a $O(\delta_{\max}\zeta^2/\gamma^2)$ neighborhood of the stationary point for non-convex objectives under standard assumptions. Finally, we demonstrate the encouraging empirical performance of SSClip under a large number of attacks.

Structure of code
Reproduction
License
Reference

Code organization

The structure of the repository is as follows:

codes/
- Source code.
outputs/
- Store the output of the launcher scripts.
consensus.ipynb: Study the error of aggregators to the average consensus under dissensus attack.
- This notebook generates Fig. 3 in the main text and Fig. 8 in the appendix.
dumbbell.py: Study how topology + heterogeneity influence on the aggregators.
dumbbell_improvement.py: Study how to help aggregators to address topology + heterogeneity influence.
dumbbell.ipynb: Plot the results of dumbbell.py and dumbbell_improvement.py.
- Generate Fig. 4 in the main text.
optimization_delta.py: Fix p, zeta^2 and varying delta of dissensus attack for SCClip aggregator.
- Generate Fig. 5 in the main text.
honest_majority.py: Study the influence of honest majority in the text.
- Generate Fig. 6 in the main text.

Reproduction

To reproduce the results in the paper, do the following steps

Add codes/ to environment variable PYTHONPATH
Install the dependencies: pip install -r requirements.txt
Run bash run.sh and select option 2 to 9 to generate the code.
The output will be saved to the corresponding folders under outputs

Note that if the GPU memory is small (e.g. less than 16 GB), then running the previous commands may raise insufficient exception. In this case, one can decrease the level parallelism in the script by changing the order of loops and reduce the number of parallel processes.

License

This repo is covered under The MIT License.

Reference

TODO

Byzantine-robust decentralized learning via self-centered clipping

Related tags

Overview

Byzantine-robust decentralized learning via self-centered clipping

Table of contents

Code organization

Reproduction

License

Reference

Owner

EPFL Machine Learning and Optimization Laboratory

Official repository for "PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation"

A vision library for performing sliced inference on large images/small objects

Torch-based tool for quantizing high-dimensional vectors using additive codebooks

Using Language Model to Bootstrap Human Activity Recognition Ambient Sensors Based in Smart Homes

Implementation of paper "Self-supervised Learning on Graphs:Deep Insights and New Directions"

Implementation of CVPR 2020 Dual Super-Resolution Learning for Semantic Segmentation

Official implementation of "A Shared Representation for Photorealistic Driving Simulators" in PyTorch.

Codebase for Time-series Generative Adversarial Networks (TimeGAN)

pytorch implementation for Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network arXiv:1609.04802

[CVPR 2016] Unsupervised Feature Learning by Image Inpainting using GANs

Does Pretraining for Summarization Reuqire Knowledge Transfer?

Geometric Deep Learning Extension Library for PyTorch

Realtime YOLO Monster Detection With Non Maximum Supression

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

This is Unofficial Repo. Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection (CVPR 2021)

Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at [email protected]

Semi-supervised Learning for Sentiment Analysis

This is the research repository for Vid2Doppler: Synthesizing Doppler Radar Data from Videos for Training Privacy-Preserving Activity Recognition.

A framework for attentive explainable deep learning on tabular data

Demystifying How Self-Supervised Features Improve Training from Noisy Labels