Category Archives: Monitoring

Hardware, services and network monitoring systems

Logz.io: collection logs from Kubernetes – fluentd vs filebeat
0 (0)

1 February 2021

We are using Logz.io to collect our Kubernetes cluster logs (also, there is a local Loki instance). Logs are collected and processed by a Fluentd pod on every WorkerNode which are deployed from a DaemonSet in its default configuration, see the documentation here – logzio-k8s. The problem we faced is that those pods are consuming… Read More: Logz.io: collection logs from Kubernetes – fluentd vs filebeat0 (0) »

Prometheus: Alertmanager Web UI alerts Silence
0 (0)

26 January 2021

Active alerts sending frequency via Alertmanager is configured via the repeat_interval in the /etc/alertmanager/config.yml file. We have this interval set to 15 minutes, and as result, we have notifications about alerts in our Slack each fifteen minutes. Still, some alerts are such a “known issues”, when we already started the investigation or fixing it, but… Read More: Prometheus: Alertmanager Web UI alerts Silence0 (0) »

Linux: LEMP set up – NGINX, PHP, MySQL, SSL, monitoring, logs, and a WordPress blog migration
0 (0)

6 November 2020

Finally got time to migrate the RTFM.CO.UA blog to a new server with Debian 10. This time manually, without any automation will set up a LEMP stack Wrote a similar at 2016 – Debian: установка LEMP — NGINX + PHP-FPM + MariaDB (Rus), but in time the post is more complete of the process and… Read More: Linux: LEMP set up – NGINX, PHP, MySQL, SSL, monitoring,… »

AWS Elastic Kubernetes Service: load-testing and high-load tuning – problems and solutions
0 (0)

4 September 2020

Actually, this post was planned as a short note about using NodeAffinity for Kubernetes Pod: But then, as often happens, after starting writing about one thing, I faced another, and then another one, and as a result – I made this long-read post about Kubernetes load-testing. So, I’ve started about NodeAffinity, but then wondered how… Read More: AWS Elastic Kubernetes Service: load-testing and high-load tuning – problems… »

Kubernetes: a cluster’s monitoring with the Prometheus Operator
0 (0)

13 August 2020

Continuing with the Kubernetes: monitoring with Prometheus – exporters, a Service Discovery, and its roles, where we configured Prometheus manually to see how it’s working – now, let’s try to use Prometheus Operator installed via Helm chart. So, the task is spin up a Prometheus server and all necessary exporter in an AWS Elastic Kubernetes… Read More: Kubernetes: a cluster’s monitoring with the Prometheus Operator0 (0) »

Kubernetes: HorizontalPodAutoscaler – an overview with examples
0 (0)

12 August 2020

Kubernetes HorizontalPodAutoscaler automatically scales Kubernetes Pods under ReplicationController, Deployment, or ReplicaSet controllers basing on its CPU, memory, or other metrics. It was shortly discussed in the Kubernetes: running metrics-server in AWS EKS for a Kubernetes Pod AutoScaler post, now let’s go deeper to check all options available for scaling. For HPA you can use three… Read More: Kubernetes: HorizontalPodAutoscaler – an overview with examples0 (0) »

Prometheus: yet-another-cloudwatch-exporter – collecting AWS CloudWatch metrics
0 (0)

23 July 2020

Currently, to collect metrics from the AWS CloudWatch we are using AWS’s own cloudwatch-exporter, see the Prometheus: CloudWatch exporter — сбор метрик из AWS и графики в Grafana post (in Rus), but it has a few gaps: it’s written in Java, so uses CPU/memory of the monitoring host doesn’t scrapes AWS tags from resources uses… Read More: Prometheus: yet-another-cloudwatch-exporter – collecting AWS CloudWatch metrics0 (0) »

Kubernetes: monitoring with Prometheus – exporters, a Service Discovery, and its roles
0 (0)

26 April 2020

The next task with our Kubernetes cluster is to set up its monitoring with Prometheus. This task is complicated by the fact, that there is the whole bunch of resources needs to be monitored: from the infrastructure side – ЕС2 WokerNodes instances, their CPU, memory, network, disks, etc key services of Kubernetes itself – its… Read More: Kubernetes: monitoring with Prometheus – exporters, a Service Discovery, and… »

Redis: “psync scheduled to be closed ASAP for overcoming of output buffer limits” and the client-output-buffer-limit
0 (0)

26 February 2020

We have a Redis-cluster with Master-slave replication and Sentinel, see the Redis: replication, part 2 – Master-Slave replication, and Redis Sentinel, Redis: fork – Cannot allocate memory, Linux, virtual memory and vm.overcommit_memory, and Redis: main configuration parameters and performance tuning overview posts. The system worked great until we started using it much more actively. Redis… Read More: Redis: “psync scheduled to be closed ASAP for overcoming of… »

Kubernetes: running metrics-server in AWS EKS for a Kubernetes Pod AutoScaler
0 (0)

15 February 2020

Assuming, we already have an AWS EKS cluster with worker nodes. In this post – we will connect to a newly created cluster, will create a test deployment with an HPA – Kubernetes Horizontal Pod AutoScaler and will try to get information about resources usage using kubectl top. Kubernetes cluster Create a test cluster using… Read More: Kubernetes: running metrics-server in AWS EKS for a Kubernetes Pod… »

M	T	W	T	F	S	S
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31