Tag Archives: monitoring

Prometheus: Alertmanager’s alerts receivers and routing based on severity level and tags

26 March 2019

We have three working environments – Dev, Stage, Production. Also, there are a bunch of alerts with different severities – info, warning и critical. For example: … – name: SSLexpiry.rules rules: – alert: SSLCertExpiring30days expr: probe_ssl_earliest_cert_expiry{job=”blackbox”} – time() < 86400 * 30 for: 10m labels: severity: info annotations: summary: “SSL certificate warning” description: “SSL certificate… Read More »

Prometheus: Alertmanager – send alerts to a “/dev/null”

26 March 2019

In addition to the Prometheus: Alertmanager’s alerts receivers and routing based on severity level and tags post. Have an Alertmanager config with routes. The task is – send all alerts from a Dev-environment into a “/dev/null”. To do this – create an empty receiver: … receivers: – name: ‘blackhole’ – name: ‘default’ slack_configs: – send_resolved:… Read More »

Monit: email alerting on an SSH logins

18 March 2019

The task is to send an email alert when SSH-login was made from a not whitelisted IPs. Will use Monit here. Install it: [simterm] root@jenkins-dev:/home/admin# apt update && apt -y install monit [/simterm] Configure email settings: set localhost (we have a local eximhere), email’s format and email’s receiver. Edit the /etc/monit/monitrc file: … set mailserver localhost… Read More »

Prometheus: RTFM blog monitoring set up with Ansible – Grafana, Loki, and promtail

10 March 2019

After implementing the Loki system on my job’s project – I decided to add it for myself, so see my RTFM blog server’s logs. Also – want to add the node_exporter and alertmanager, to be notified about high disk usage. In this post, I’ll describe the Prometheus, node_exporter, Grafana, Loki, and promtail set up process… Read More »

Prometheus: blackbox-exporter probe_http_status_code == 0 and its debug

6 March 2019

Today I decided to upgrade Grafana to already released version 6.0 and all other Docker images as well. Upgrade was successful – Loki eventually started displaying previously missed log-file names and other tags, just – immediately I got a bunch of CRITICAL alerts in our Slack from the blackbox-exporter which is used to check every… Read More »

Grafana Labs: Loki – distributed system, labels and filters

7 February 2019

The previous post – Grafana Labs: Loki – logs collecting and monitoring system. There Loki, promtail, and Grafana were configured on the same host in one Docker Compose stack. Now I want to try some distributed setup: Grafana will work on a one host Loki – on the other one Promtail – will collect logs from… Read More »

Grafana Labs: Loki – logs collecting and monitoring system

6 February 2019

Grafana 6.0 was still in Beta on Feb 2019, when the new feature was introduced – Loki, a log aggregation system available via another new ability – Explore. It’s similar to well-known ELK/EFK stack but more simple to set up and use and is intended to be used mostly with clouds and systems like Prometheus… Read More »

M	T	W	T	F	S	S
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31