Commits · global-seed · Magnet / DecLearn / declearn2 · GitLab

Mentions légales du service

Snippets Groups Projects

View open merge request

Jul 24, 2023
- Add a seed policy and wire existing seeding tools. · 55171169
  BIGAUD Nathan authored 1 year ago and ANDREY Paul committed 1 year ago
  
  Can be heavily simplified if implementation of model seeding does not require anything fancier than a seed integer.
  Verified
  
  55171169
Jul 17, 2023
- Increase CI/CD coverage statistic precision. · 50255e37
  ANDREY Paul authored 1 year ago
  
  Verified
  
  50255e37
- Refactor 'gen_docs.py' code. · 571ff500
  ANDREY Paul authored 1 year ago
  
  Verified
  
  571ff500
- Refactor 'run_as_processes' backend code. · 4cff0d37
  ANDREY Paul authored 1 year ago
  
  Verified
  
  4cff0d37
Jul 13, 2023

Merge branch '27-add-support-for-torch-2-0' into 'develop' · 861d559b
ANDREY Paul authored 1 year ago
```
Add support for Torch 2.0

Closes #27

See merge request !49
```
861d559b
Add a couple of 'no cover' pragmas. · 6d3695fd
ANDREY Paul authored 1 year ago

Verified

6d3695fd
Have some unit tests re-run using Torch-1.13 in the CI. · fdb8167f
ANDREY Paul authored 1 year ago

Verified

fdb8167f

Add and document 'torch.compile' support. · 7468f705

ANDREY Paul authored 1 year ago

- Torch 2.0 introduced `torch.compile`, a novel utilty to optimize
  computations via their JIT-compiling into optimized kernels. At
  the moment, compiled modules cannot be saved (but their states,
  which are shared with the underlying original module, can), and
  are not compatible with `torch.func` functional execution either.
- As declearn vows to support Torch 2.0, it seems crucial that end-
  users may use `torch.compile`. However, as long as Torch 1.10-13
  versions (and previous versions of DecLearn 2.X) are supported,
  this handling should not break backward compatibility.
- An initial approach was to enable compiling the handled module as
  part of `TorchModel.__init__`. However, this proves impractical,
  as it takes away the assumption that end-users should be able to
  use their customly-prepared module as-is - including pre-compiled
  ones. This is all the more true as their are many options to the
  `torch.compile` function, that DecLearn has no purpose handling.
- Therefore, the approach implemented here is to detect and handle
  models that were compiled prior to being input into `TorchModel`.

- Here are a few notes and caveats regarding the current implementation:
  - Some impractical steps were taken to ensure weights and gradients
    have the same name regardless of whether the module was compiled
    or not, _and regardless of the specific 2.x version of declearn_.
    When we move to declearn 3, it will be worth revising.
  - A positive consequence of the previous note is that the compilation
    of the module should not impair cases where some clients are using
    torch 1 and/or an older 2.x version of declearn.
  - The will to retain user-defined compilation option is mostly lost
    due to the current lack of recording of these info by torch. This
    is however expected to evolve in the future, which should enable
    sharing instructions with clients. See issue 101107 of pytorch:
    https://github.com/pytorch/pytorch/issues/101107
  - A clumsy bugfix was introduced to avoid an issue where the wrapped
    compiled model would not take weights updates into account when
    running in evaluation mode. The status of this hack should be kept
    under close look as the issue I opened to report the bug is treated:
    https://github.com/pytorch/pytorch/issues/104984

7468f705

Add support for torch model buffers in clipped gradients computation. · ef8b5662
ANDREY Paul authored 1 year ago

Verified

ef8b5662

Update 'pyproject.toml' to support both Torch 1.1X and 2.X. · 4202d562

ANDREY Paul authored 1 year ago

- Turn "all" and "torch" extras more loose, enabling (thus, in most cases,
  resulting in) installing torch >=2.0 rather than 1.1[0-3].
- Add "torch1" and "torch2" extras to specify explicitly whether the user
  would rather use Torch 1.13 or >=2.0. Opacus is made part of these two
  extras, as its version is also co-dependent with Torch/Functorch.
- Document this change as part of the package installation guide.

4202d562

Revise TorchModel to support Torch 2.0. · accb07d1

ANDREY Paul authored 1 year ago

- Torch 2.0 was released in march 2023, introducing a number of
  new features to the torch ecosystem, while being non-breaking
  wrt past versions on most points.
- One of the salient changes is the introduction of `torch.func`,
  that integrates features previously introduced as part of the
  `functorch` package, which we have been relying upon in order
  to efficiently compute and clip sample-wise gradients.
- This commit therefore introduces a new backend code branch so
  as to make use of `torch.func` when `torch~=2.0` is used, and
  retain the existing `functorch`-based branch when older torch
  versions are used.
- As part of this effort, some of the `TorchModel` backend code
  was refactored, notably deferring the functional transform of
  the input `torch.nn.Module` to the first call for sample-wise
  gradients computation. This has the nice side effect of fixing
  cases when users want to train a non-functorch-compatible model
  without DP-SGD.

accb07d1

Jul 12, 2023

Merge branch 'revise-ci-and-tests' into 'develop' · 9f0035d7
ANDREY Paul authored 1 year ago
```
Refactor tests pipeline, introducing 'scripts/run_tests.sh'.

See merge request !52
```
9f0035d7
Adjust the CI/CD configuration to 'tox.ini' evolution. · 4efde987
ANDREY Paul authored 1 year ago

Verified

4efde987

Refactor tests pipeline, introducing 'scripts/run_tests.sh'. · 31d3fb67

ANDREY Paul authored 1 year ago

- Add the 'scripts/run_tests.sh' bash script, which enables running
  various blocks of tests. This mostly refactors instructions that
  used to be part of 'tox.ini', in a more convenient way:
  - some tests are grouped (commands in a group _all_ run, but the
    group fails if any has failed once all are done);
  - instructions are (hopefully) easier to read through and should
    be easier to maintain in the future;
  - it should be easier to decline more groups in the future (e.g.
    to have a routine with only unit tests);
  - groups of tests can be run easily, including without using tox.
- Have the 'tox.ini' file trigger some groups of tests by calling
  the new bash script. This makes the tox file easier to read and
  parse through, and merely oriented at providing run isolation.
- Split test groups into various envs in tox, which may be slightly
  more costly to run, but enables writing simpler CI/CD pipelines
  that merely pass arguments to the tox command.
- Make it explicit in 'tox.ini' that tests can be run with python
  3.8 and onwards.
- Keep a "main" job that groups all others. This is useful in the
  Inria shared Gitlab Runner, as caching is disabled, so that the
  use of distinct jobs would result in redundant costful package
  installations.

31d3fb67

Merge branch 'mnist-example' into 'develop' · 8843e075
ANDREY Paul authored 1 year ago
```
Revise the MNIST example, cleaning up the code and documentation.

See merge request !51
```
8843e075
Revise the MNIST example, cleaning up the code and documentation. · 6b7a3d61
ANDREY Paul authored 1 year ago

Verified

6b7a3d61

Jul 06, 2023

Merge branch 'sklearn-13-dtype-fix' into 'develop' · c9f52458

ANDREY Paul authored 1 year ago

Fix compatibility issues with scikit-learn 1.3.0 and tensorflow 2.13.0

Closes #28 and #29

See merge request !50

c9f52458

Refactor 'SklearnSGModel' dtype choice code into a private function. · 38accb64
BIGAUD Nathan authored 1 year ago and ANDREY Paul committed 1 year ago
```
Co-authored-by: Paul ANDREY <paul.andrey@inria.fr>
```
Verified

38accb64

Prune 'test_main.py' pytest-collected test scenarios. · c4fe82b5

ANDREY Paul authored 1 year ago

The legacy 'test_main.py' integration tests are now mostly redundant
with other integration tests, adding little marginal value to the CI/CD
tests pipeline. From a Coverage POV, the only thing that is not already
tested elsewhere is the actual use of Scaffold.

Hence the test scenarios collected during PyTest runs are restricted to:
- FedAvg and Scaffold variants of Optimization strategies.
  (prior: FedAvg + FedAvgM + Scaffold)
- In non-fulltest, only the Scaffold strategy is used.
  (prior: FedAvg + Scaffold)

This results in testing 3 scenarios in the limited pipeline and 18 in the
"fulltest" one, when there previously would be 6 or 27 scenarios.

c4fe82b5

Add 'dtype' use to 'SklearnSGDModel' unit tests. · 29ecf473
ANDREY Paul authored 1 year ago

Verified

29ecf473
Refactor and fix 'build_keras_loss' util; add unit tests for it. · 95cfa2b9
ANDREY Paul authored 1 year ago

Verified

95cfa2b9
Fix 'build_keras_loss' for TensorFlow 2.13. · 808cd5fd
ANDREY Paul authored 1 year ago

Verified

808cd5fd

Improve dtype handling in 'SklearnSGDModel'. · 02afef6e

ANDREY Paul authored 1 year ago

- Scikit-Learn 1.3.0 introduced the possibility to use any dtype for
  SGD-based models' weights.
- As a consequence, this commit introduces the optional 'dtype' argument
  to 'SklearnSGDModel.__init__' and '.from_parameters' methods, which is
  only used with `sickit-learn >=1.3` and an un-initialized model.
- Coherently (and for all scikit-learn versions), additional type/dtype
  verification has been implemented under the hood of `_unpack_batch`.

02afef6e

Jun 08, 2023
- Fix Heart Disease UCI Dataset loaded due to website change. · 3b687a3a
  ANDREY Paul authored 1 year ago
  
  Verified
  
  3b687a3a
May 31, 2023
- Replace deprecated 'pkg_resources' with 'importlib.metadata'. · afc29510
  ANDREY Paul authored 1 year ago
  
  Verified
  
  afc29510
May 23, 2023
- Remove unused line in 'tox.ini'. · a8cfa7cc
  ANDREY Paul authored 1 year ago
  
  Verified
  
  a8cfa7cc
May 11, 2023

Fix Jax dependency specification. · f7a13dff
ANDREY Paul authored 1 year ago

v2.2.0 Verified

f7a13dff
Document known issues with GPU support. · 9113e8ad
ANDREY Paul authored 1 year ago

Verified

9113e8ad
Turn device-selection 'UserWarning' into 'RuntimeWarning'. · 88a990ec
ANDREY Paul authored 1 year ago

Verified

88a990ec

Fix device selection for Jax and add '--cpu-only' to its tests. · 9d1b25cb

ANDREY Paul authored 1 year ago

- Fix device-selection utils for Jax/Haiku.
- Ensure any GPU is left untouched when the device policy specifies
  that the CPU must be used.
- Fix instructions in both HaikuModel and its tests' setup code to
  achieve the former.
- Note: `jax.jit(..., device=<D>)` is used to prevent Haiku from
  using another device than the desired one; however this argument
  is being deprecated, making this solution short-lived. Since the
  supported Jax version is hard-coded however, we keep this patch
  for now, in wait of future framework evolutions.
- Add the use of the optional, custom `--cpu-only` flag in Haiku
  unit tests. This enabled testing the former changes in an env
  that has a detectable GPU, but an unproper driver version (so
  that any undesired use of that GPU would make the tests fail).

9d1b25cb

May 10, 2023
- Bump version number to 2.2.0. · 28e88152
  ANDREY Paul authored 1 year ago
  
  Verified
  
  28e88152
- Improve some documentation formatting. · 1eae849d
  ANDREY Paul authored 1 year ago
  
  Verified
  
  1eae849d
- Improve 'HaikuModel.set_trainable_weights' docs. · 164ddb3b
  ANDREY Paul authored 1 year ago
  
  Verified
  
  164ddb3b
- Fix jax dependency specification. · dd2eef50
  ANDREY Paul authored 1 year ago
  
  Verified
  
  dd2eef50
- Fix 'declearn.model' main module docstring. · e95ece2e
  ANDREY Paul authored 1 year ago
  
  Verified
  
  e95ece2e
- Merge branch 'gardening' into 'develop' · 38dbe123
  ANDREY Paul authored 1 year ago
  
  Minor gardening around the package See merge request !44
  38dbe123
- Enhance type hints for Jax/Haiku classes. · eef2033f
  ANDREY Paul authored 1 year ago
  
  Verified
  
  eef2033f
- Revise test suite's ordering. · ba2caa2f
  ANDREY Paul authored 1 year ago
  
  - First run unit tests (which are the more problematic). - Then run code quality verifications (pylint / mypy / black). - Finally run integration tests. The rationale to inverting the last two items relative to what was done prior to this update is to keep the longest-running tests for the end (rather than re-play a long pipeline due to late-detected formatting issues).
  Verified
  
  ba2caa2f
- Patch 'TomlConfig' unit tests for PyLint. · 9e6f9a8c
  ANDREY Paul authored 1 year ago
  
  Verified
  
  9e6f9a8c
- Add the 'list_aggregators' util function. · fd8abbd3
  ANDREY Paul authored 1 year ago
  
  Verified
  
  fd8abbd3