Troubleshooting
The docs cover troubleshooting of prometheus & grafana issues on server side. i.e if any service is down then how you can check configs and logs on server.
Basics Debugging
There are 3 services running on prometheus server. i.e prometheus, grafana, node exporter.
Configuration Files
- Prometheus:
/etc/prometheus/prometheus.yml- Grafana:
/etc/grafana/grafana.ini- Node exporter:
/etc/systemd/system/node_exporter.service
To check the status of services use below commands.
- Prometheus:
sudo systemctl status prometheus.service- Grafana:
sudo systemctl status grafana-server.service- Node exporter:
sudo systemctl status node_exporter.service
To check the logs of services use below commands.
- Prometheus:
journalctl -u prometheus -n100 -f- Grafana:
tail -f /var/log/grafana/grafana.log- Node exporter:
journalctl -u node_exporter -n100 -f
Common Scenario
- If you see any issues like prometheus or granfana is not working.
- Then login into the peomtheus server & check logs and service status of prometheus and grafana.
- Then check disk,mem,cpu utilization as well.
- If prometheus is down due to high disk utilisation, then consult with team/lead first for appropriate action.