RobotLearn Cluster monitor
Link: robotlearn.gitlabpages.inria.fr/cluster-monitor
Implementation overview
The cluster monitor counts three entities:
- The data fetcher is running directly on an inria workstation (currently
alya
). It is performingssh
commands to other nodes (especiallyaccess1-cp
) to gather the cluster state.
Code:rl_hm/data_fetcher/
- The backend server is running on a Linux server external to Inria and is receiving the data
fetcher updates through a TCP socket connection running over an SSH tunnel.
It exposes a Socket.IO server to the web clients that connect to it.
As soon as it receives an update from the data fetcher, it pushes it to all of the connected
clients through the socket-io connection.
Code:rl_hm/backend/
- Finally, the web clients run a javascript application that connects to the socket-io server
(backend) and updates the html page with the reveived data.
Code:public/
Acknowlegment
- David Emukpere: for his numerous advice and help about web development and infrastructure.
- Tanguy Lepage: for the front-end HTML/CSS design.
- Anand Ballou: for his advice and his help on the cluster data fetching.