Category Archives: Prometheus

Prometheus, a Cloud Native Computing Foundation project, is a systems and service monitoring system. It collects metrics from configured targets at given intervals, evaluates rule expressions, displays the results, and can trigger alerts if some condition is observed to be true.

PostgreSQL: PostgreSQL Operator for Kubernetes, and its Prometheus monitoring
0 (0)

23 September 2022

So, we’ve launched Druid, see Apache Druid: Overview, Running in Kubernetes, and Monitoring with Prometheus . So far, a local Apache Derby database is used as the default storage for metadata . Next, we will switch Druid to PostgreSQL, and later we will remove ZooKeeper from the cluster setup. To begin with, let’s start a PostgreSQL cluster in Kubernetes, add… Read More: PostgreSQL: PostgreSQL Operator for Kubernetes, and its Prometheus monitoring0 (0) »

Apache Druid: overview, running in Kubernetes and monitoring with Prometheus
0 (0)

18 September 2022

Apache Druid is a columnar database, focused on working with large amounts of data, combining the features and benefits of Time-Series Database, Data Warehouse, and a search engine. The general task is to set up monitoring of the Druid cluster in Kubernetes, so at first, we will see what it is in general and how… Read More: Apache Druid: overview, running in Kubernetes and monitoring with Prometheus0… »

AWS: Simple Email Service Bounce rate and monitoring with and Prometheus
0 (0)

14 July 2021

Recently, AWS blocked our AWS Simple Email Service because of its low bounce rate. This can be checked in the AWS SES > Reputation Dashboard, our account currently has Under review status: After we’ve connected AWS Tech Support, they enabled it back, but we must solve the issue asap, and have to monitor AWS SES… Read More: AWS: Simple Email Service Bounce rate and monitoring with and… »

Kubernetes: namespace hangs in Terminating and metrics-server non-obviousness
0 (0)

1 April 2021

Faced with a very interesting thing during removal of a Kubernetes Namespace. After a kubectl delete namespace NAMESPACE is executed, the namespace hangs in the Terminating state, and any attempt to forcibly remove it didn’t help. First, let’s see how such a force-removal can be done, and then will check the real cause and a… Read More: Kubernetes: namespace hangs in Terminating and metrics-server non-obviousness0 (0) »

Prometheus: Alertmanager Web UI alerts Silence
0 (0)

26 January 2021

Active alerts sending frequency via Alertmanager is configured via the repeat_interval in the /etc/alertmanager/config.yml file. We have this interval set to 15 minutes, and as result, we have notifications about alerts in our Slack each fifteen minutes. Still, some alerts are such a “known issues”, when we already started the investigation or fixing it, but… Read More: Prometheus: Alertmanager Web UI alerts Silence0 (0) »

Kubernetes: a cluster’s monitoring with the Prometheus Operator
0 (0)

13 August 2020

Continuing with the Kubernetes: monitoring with Prometheus – exporters, a Service Discovery, and its roles, where we configured Prometheus manually to see how it’s working – now, let’s try to use Prometheus Operator installed via Helm chart. So, the task is spin up a Prometheus server and all necessary exporter in an AWS Elastic Kubernetes… Read More: Kubernetes: a cluster’s monitoring with the Prometheus Operator0 (0) »

Kubernetes: HorizontalPodAutoscaler – an overview with examples
0 (0)

12 August 2020

Kubernetes HorizontalPodAutoscaler automatically scales Kubernetes Pods under ReplicationController, Deployment, or ReplicaSet controllers basing on its CPU, memory, or other metrics. It was shortly discussed in the Kubernetes: running metrics-server in AWS EKS for a Kubernetes Pod AutoScaler post, now let’s go deeper to check all options available for scaling. For HPA you can use three… Read More: Kubernetes: HorizontalPodAutoscaler – an overview with examples0 (0) »

Prometheus: yet-another-cloudwatch-exporter – collecting AWS CloudWatch metrics
0 (0)

23 July 2020

Currently, to collect metrics from the AWS CloudWatch we are using AWS’s own cloudwatch-exporter, see the Prometheus: CloudWatch exporter — сбор метрик из AWS и графики в Grafana post (in Rus), but it has a few gaps: it’s written in Java, so uses CPU/memory of the monitoring host doesn’t scrapes AWS tags from resources uses… Read More: Prometheus: yet-another-cloudwatch-exporter – collecting AWS CloudWatch metrics0 (0) »

Kubernetes: monitoring with Prometheus – exporters, a Service Discovery, and its roles
0 (0)

26 April 2020

The next task with our Kubernetes cluster is to set up its monitoring with Prometheus. This task is complicated by the fact, that there is the whole bunch of resources needs to be monitored: from the infrastructure side – ЕС2 WokerNodes instances, their CPU, memory, network, disks, etc key services of Kubernetes itself – its… Read More: Kubernetes: monitoring with Prometheus – exporters, a Service Discovery, and… »

Grafana: Loki – the LogQL’s Prometheus-like counters, aggregation functions and dnsmasq’s requests graphs
0 (0)

17 November 2019

The last time I configured Loki for logs collecting and monitoring was in February 2019 – almost a year ago, see the Grafana Labs: Loki – logs collecting and monitoring system post, when Loki was in its Beta state. Now we faced with outgoing traffic issues in our Production environments and can’t find who guilty for… Read More: Grafana: Loki – the LogQL’s Prometheus-like counters, aggregation functions and… »