Monitoring Infrastructure
Overview
The monitoring infrastructure provides an overview of running services and their status to the EFPF administrators and service providers. It is intended mainly to below mentioned functionalities.
Runtime Metrics
Collection of run time metrics of running containers at different hosts. This mainly includes the resource and network bandwidth usage of the running containers
Alerting
Alerting helps the administrators or the maintainers to know the service status without manually following the service status visualizations. The administrators can create different alerting rules and get notifications through different channels.
Distributed Logging
Logging gives an insight to the running applications and their behaviour. This can be used by the service providers to deduce the causes for malfunctioning and to get an overview of the client behavior.