TODO: faire article ! Indiquer avantages/inconvenient, type de monitoring (hard, services, processings), alert/etat courant, graphes etc...

Managing such a cluster requires some tools to raise alerts when anomalies occur (hardware failures or system issues) but also to investigate current or past state of the cluster. Here are a few tools we use at Cersat.

  • Nagios

  • Munin (deprecated: pourquoi !?)

  • Ganglia

  • OpenTSDB

  • Homemade tools


Comments

comments powered by Disqus