batsim issueshttps://gitlab.inria.fr/batsim/batsim/-/issues2017-09-26T12:04:41+02:00https://gitlab.inria.fr/batsim/batsim/-/issues/18Make the PFS model more generic so we don't have 2 different parametters2017-09-26T12:04:41+02:00MERCIER MichaelMake the PFS model more generic so we don't have 2 different paramettershttps://gitlab.inria.fr/batsim/batsim/-/issues/17Add a distributed file system (DFS) profile2019-08-12T10:25:21+02:00MERCIER MichaelAdd a distributed file system (DFS) profilehttps://gitlab.inria.fr/batsim/batsim/-/issues/16Find a better way to handle scheduler process lifecycle (start, stop, sync wi...2018-02-28T11:33:49+01:00MERCIER MichaelFind a better way to handle scheduler process lifecycle (start, stop, sync with batsim...)https://gitlab.inria.fr/batsim/batsim/-/issues/14Put the CI on a real server2017-09-27T17:29:49+02:00MERCIER MichaelPut the CI on a real serverNot on Millian's laptop...Not on Millian's laptop...Millian PoquetMillian Poquethttps://gitlab.inria.fr/batsim/batsim/-/issues/13Exec*: completely remove execo2018-02-28T11:37:46+01:00Millian PoquetExec*: completely remove execohttps://gitlab.inria.fr/batsim/batsim/-/issues/12Uncaught exception when an allocation is empty2017-06-06T15:07:17+02:00MERCIER MichaelUncaught exception when an allocation is emptyI have the following exception in batsim when I send an empty allocation: It is the scheduler's fault but still the error message should be better:
```
[master_host0:Scheduler REQ-REP:(497) 7603.518654] [network/INFO] Received '{"now":...I have the following exception in batsim when I send an empty allocation: It is the scheduler's fault but still the error message should be better:
```
[master_host0:Scheduler REQ-REP:(497) 7603.518654] [network/INFO] Received '{"now": 7603.518654, "events": [{"timestamp": 7603.518654, "type": "EXECUTE_JOB", "data": {"job_id": "ec4853!136", "alloc": "8 9 10 11
12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71"}}, {"timestamp": 7603.518654,
"type": "EXECUTE_JOB", "data": {"job_id": "ec4853!141", "alloc": "72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87"}}, {"timestamp": 7603.518654, "type": "EXECUTE_JOB", "data": {"job_id": "ec4853!144", "alloc": "
"}}]}'
[master_host0:server:(2) 7603.519254] [server/INFO] Server received a message of type SCHED_EXECUTE_JOB:
[a8:job_ec4853!136:(498) 7603.519254] [jobs_execution/INFO] Sleeping the whole task length
[master_host0:Scheduler REQ-REP:(497) 7603.519854] /tmp/nix-build-simgrid-batsim.drv-0/simgrid/src/xbt/exception.cpp:47: [xbt_exception/CRITICAL] Uncaught exception std::invalid_argument: stoi
[master_host0:Scheduler REQ-REP:(497) 7603.519854] /tmp/nix-build-simgrid-batsim.drv-0/simgrid/src/xbt/exception.cpp:80: [xbt_exception/CRITICAL] Current backtrace:
[master_host0:Scheduler REQ-REP:(497) 7603.519854] /tmp/nix-build-simgrid-batsim.drv-0/simgrid/src/xbt/exception.cpp:82: [xbt_exception/CRITICAL] -> ???9366a at /nix/store/fwm5shhj7p6rrvpdaz5zv202zvbdcbp1-simg
rid-batsim/lib/libsimgrid.so.3.13.91(_ZN7simgrid3xbt9backtraceEv+0x2f) [0x7fdec35b8def], 0x7fdec35b8def
[master_host0:Scheduler REQ-REP:(497) 7603.519854] /tmp/nix-build-simgrid-batsim.drv-0/simgrid/src/xbt/exception.cpp:82: [xbt_exception/CRITICAL] -> ???9366a at /nix/store/fwm5shhj7p6rrvpdaz5zv202zvbdcbp1-simg
rid-batsim/lib/libsimgrid.so.3.13.91(+0x20ae28) [0x7fdec36e2e28], 0x7fdec36e2e28
[master_host0:Scheduler REQ-REP:(497) 7603.519854] /tmp/nix-build-simgrid-batsim.drv-0/simgrid/src/xbt/exception.cpp:82: [xbt_exception/CRITICAL] -> ???9366a at /nix/store/f0pvl01br5rm76kf2f9i8iapi01i8nr8-gcc-
6.3.0-lib/lib/libstdc++.so.6(+0x8c956) [0x7fdec1627956], 0x7fdec1627956
[master_host0:Scheduler REQ-REP:(497) 7603.519854] /tmp/nix-build-simgrid-batsim.drv-0/simgrid/src/xbt/exception.cpp:82: [xbt_exception/CRITICAL] -> ???9366a at /nix/store/f0pvl01br5rm76kf2f9i8iapi01i8nr8-gcc-
6.3.0-lib/lib/libstdc++.so.6(+0x8c9a1) [0x7fdec16279a1], 0x7fdec16279a1
[master_host0:Scheduler REQ-REP:(497) 7603.519854] /tmp/nix-build-simgrid-batsim.drv-0/simgrid/src/xbt/exception.cpp:82: [xbt_exception/CRITICAL] -> ???9366a at /nix/store/f0pvl01br5rm76kf2f9i8iapi01i8nr8-gcc-
6.3.0-lib/lib/libstdc++.so.6(+0x8cbb8) [0x7fdec1627bb8], 0x7fdec1627bb8
[master_host0:Scheduler REQ-REP:(497) 7603.519854] /tmp/nix-build-simgrid-batsim.drv-0/simgrid/src/xbt/exception.cpp:82: [xbt_exception/CRITICAL] -> ???9366a at /nix/store/f0pvl01br5rm76kf2f9i8iapi01i8nr8-gcc-
6.3.0-lib/lib/libstdc++.so.6(_ZSt24__throw_invalid_argumentPKc+0x3f) [0x7fdec165010f], 0x7fdec165010f
[master_host0:Scheduler REQ-REP:(497) 7603.519854] /tmp/nix-build-simgrid-batsim.drv-0/simgrid/src/xbt/exception.cpp:82: [xbt_exception/CRITICAL] -> ???9366a at /nix/store/5c6a0nfzsv73sf6hji6q76hf2ig9w86a-bats
im/bin/batsim(_ZN9__gnu_cxx6__stoaIlicJiEEET0_PFT_PKT1_PPS3_DpT2_EPKcS5_PmS9_+0xc8) [0x446318], 0x446318
[master_host0:Scheduler REQ-REP:(497) 7603.519854] /tmp/nix-build-simgrid-batsim.drv-0/simgrid/src/xbt/exception.cpp:82: [xbt_exception/CRITICAL] -> ???9366a at /nix/store/5c6a0nfzsv73sf6hji6q76hf2ig9w86a-bats
im/bin/batsim(_ZN12MachineRange18from_string_hyphenERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEES7_S7_S7_+0x70e) [0x479cbe], 0x479cbe
[master_host0:Scheduler REQ-REP:(497) 7603.519854] /tmp/nix-build-simgrid-batsim.drv-0/simgrid/src/xbt/exception.cpp:82: [xbt_exception/CRITICAL] -> ???9366a at /nix/store/5c6a0nfzsv73sf6hji6q76hf2ig9w86a-bats
im/bin/batsim(_ZN18JsonProtocolReader18handle_execute_jobEidRKN9rapidjson12GenericValueINS0_4UTF8IcEENS0_19MemoryPoolAllocatorINS0_12CrtAllocatorEEEEE+0x62a) [0x49366a], 0x49366a
Segmentation fault.
```
It seems to be an uncaught exception from stoi that is not able to convert the input:
https://stackoverflow.com/a/31928967/2165830Millian PoquetMillian Poquethttps://gitlab.inria.fr/batsim/batsim/-/issues/11CI: add sonar2017-09-27T22:47:55+02:00Millian PoquetCI: add sonarMillian PoquetMillian Poquethttps://gitlab.inria.fr/batsim/batsim/-/issues/10Clean server function: create subfunctions for each case?2018-10-27T23:00:25+02:00Millian PoquetClean server function: create subfunctions for each case?Millian PoquetMillian Poquethttps://gitlab.inria.fr/batsim/batsim/-/issues/9Debug OBFH2021-05-20T17:47:58+02:00Millian PoquetDebug OBFHBatsim 3.0Millian PoquetMillian Poquethttps://gitlab.inria.fr/batsim/batsim/-/issues/8Exec* : dump instances info more often2018-02-28T11:37:14+01:00Millian PoquetExec* : dump instances info more oftenMillian PoquetMillian Poquethttps://gitlab.inria.fr/batsim/batsim/-/issues/7Separate exec1, execN in another repo2018-04-19T18:10:56+02:00Millian PoquetSeparate exec1, execN in another repoMillian PoquetMillian Poquethttps://gitlab.inria.fr/batsim/batsim/-/issues/6Take EXECUTE_JOB mapping into account2017-08-31T15:17:37+02:00Millian PoquetTake EXECUTE_JOB mapping into accounthttps://gitlab.inria.fr/batsim/batsim/-/issues/5Allows the Decision Process to attach user-defined metadata to jobs2017-11-21T01:30:21+01:00Millian PoquetAllows the Decision Process to attach user-defined metadata to jobsAdd in the protocol the possibility for the decision process to set user defined metadata about the jobs executed.
Write these metadata in the ``*_jobs.csv`` output file in a ``metadata`` column.Add in the protocol the possibility for the decision process to set user defined metadata about the jobs executed.
Write these metadata in the ``*_jobs.csv`` output file in a ``metadata`` column.https://gitlab.inria.fr/batsim/batsim/-/issues/4Batsim resources unit should be core instead of host2022-01-20T12:24:57+01:00Millian PoquetBatsim resources unit should be core instead of hostFor multiple reasons:
- Most of the existing workloads (SWF) contain core base resource request
- Node level is detailed enough to see core based allocation when using time sharing
- On current HPC centers, the request of resources is do...For multiple reasons:
- Most of the existing workloads (SWF) contain core base resource request
- Node level is detailed enough to see core based allocation when using time sharing
- On current HPC centers, the request of resources is done by core (even if is ceiled depending on the number of core per nodes because there is no time sharing)
- Batsim workload that contains node resource requests are not portable across platforms especially for SMPI applications that are modeled at the core level
The list of feature and changes we need to implement this:
- [ ] change Batsim output to cores level (interval set of resources at core level) while keeping node information (and at the node level to be able to keep heterogeneity information).
- [ ] support new resource requests like `nb_core` and `nb_host` and remove the ambiguous `nb_res`.
- [ ] make the mapping work at core level (not sure of this... @mpoquet?)5.0.0https://gitlab.inria.fr/batsim/batsim/-/issues/3Allow jobs to be executed on specific pstates2018-11-14T01:02:41+01:00Millian PoquetAllow jobs to be executed on specific pstatesMillian PoquetMillian Poquethttps://gitlab.inria.fr/batsim/batsim/-/issues/2JSON protocol (if performance isn't sacrificed)2017-05-26T15:23:52+02:00Millian PoquetJSON protocol (if performance isn't sacrificed)Millian PoquetMillian Poquethttps://gitlab.inria.fr/batsim/batsim/-/issues/1Compare parsing performance (boost vs rapidjson)2017-05-26T15:23:52+02:00Millian PoquetCompare parsing performance (boost vs rapidjson)Millian PoquetMillian Poquet