Category Archives: Monitoring

Hardware, services and network monitoring systems

Loki: collecting logs from CloudWatch Logs using Lambda Promtail
0 (0)

20 May 2023

Collecting logs in Grafana Loki with Kubernetes is very simple – we just launch Promtail in DaemonSet, configure it to read all data from /var/logs – and that’s it (in fact, we don’t specify anything at all – everything works out of the box from the Helm chart). But what about CloudWatch Logs? On my new project,… Read More »

Loading

Prometheus: running Pushgateway on Kubernetes with Helm and Terraform
0 (0)

28 April 2023

We have a lot of AWS Lambda functions in the project, and developers want to be able to send metrics to our Prometheus to add their own alerts and graphs in Grafana. For this, the functions use the Prometheus library, which allows these metrics to be created (see Prometheus: Building a Custom Prometheus Exporter in… Read More »

Loading

GitLab: monitoring – Prometheus, metrics, and Grafana dashboard
0 (0)

12 March 2023

So, let’s continue our journey with migrating GitLab to Kubernetes. See previous parts: GitLab: Components, Architecture, Infrastructure, and Launching from the Helm Chart in Minikube GitLab: Helm chart of values, dependencies, and deployment in Kubernetes with AWS S3 GitLab: міграція даних з GitLab cloud та процес backup-restore у self-hosted версії в Kubernetes In general, everything is working,… Read More »

Loading

Grafana Loki: alerts from the Loki Ruler and labels from logs
0 (0)

12 March 2023

For general information on Grafana Loki, see the Grafana Loki: architecture and running in Kubernetes with AWS S3 storage and boltdb-shipper. Among other services that make up Loki, there is a separate service called ruler that is responsible for working with alerts that can be generated directly from logs. The idea is very simple: create… Read More »

Loading

Prometheus: Building a Custom Prometheus Exporter in Python
0 (0)

25 February 2023

Prometheus has many ready-to-use exporters, but sometimes you may need to collect your own metrics. For this, Prometheus provides client libraries that we can use to generate metrics with the necessary labels. Such an exporter can be included directly in the code of your application, or it can be run as a separate service that will poll one… Read More »

Loading

Kubernetes: Cluster Cost Monitoring – Kubernetes Resource Report and Kubecost
0 (0)

29 January 2023

The very useful thing is to monitor how efficiently the cluster is being used, especially if applications are deployed by developers who do not delve into requests much and set inflated values ​​”in reserve”. A reserve, of course, is needed – but simply requesting resources is a bad idea. For example, you have a WorkerNode… Read More »

Loading

Grafana Loki: alerts from Ruler and labels from logs
0 (0)

7 January 2023

For general information about Grafana Loki, see the Grafana Loki: architecture and running in Kubernetes with AWS S3 storage and boltdb-shipper post. Among other components of the Loki, there is a separate service called ruler that is responsible for working with alerts that can be generated directly from logs. The idea is very simple: create… Read More »

Loading

Grafana Loki: architecture and running in Kubernetes with AWS S3 storage and boltdb-shipper
0 (0)

7 January 2023

The last time I worked with Loki when it was still in Beta, and it looked much simpler then than it does now. In the new project, there is no logging system at all, and since we all love the Grafana stack, we also decided to use Loki for logging. Although to be honest, I… Read More »

Loading

Prometheus: Kubernetes endpoints monitoring with blackbox-exporter
0 (0)

11 December 2022

The blackbox-exporter is an exporter that can monitor various endpoints – URLs on the Internet, your LoadBalancers in AWS, or Services in a Kubernetes cluster, such as MySQL or PostgreSQL databases. Blackbox Exporter can give you HTTP response time statistics, response codes, information on SSL certificates, etc. What are we going to do in this… Read More »

Loading