Tag Archives: monitoring

Kubernetes: vertical Pods scaling with Vertical Pod Autoscaler
0 (0)

1 May 2023

In addition to the Horizontal Pod Autoscaler (HPA), which creates additional pods if the existing ones start using more CPU/Memory than configured in the HPA limits, there is also the Vertical Pod Autoscaler (VPA), which works according to a different scheme: instead of horizontal scaling, i.e. increasing the number of Pods, it changes resources.requests of a Pod, which… Read More: Kubernetes: vertical Pods scaling with Vertical Pod Autoscaler0 (0) »

Loading

Prometheus: running Pushgateway on Kubernetes with Helm and Terraform
0 (0)

28 April 2023

We have a lot of AWS Lambda functions in the project, and developers want to be able to send metrics to our Prometheus to add their own alerts and graphs in Grafana. For this, the functions use the Prometheus library, which allows these metrics to be created (see Prometheus: Building a Custom Prometheus Exporter in… Read More: Prometheus: running Pushgateway on Kubernetes with Helm and Terraform0 (0) »

Loading

GitLab: gitlab-shell timeouts, and /metrics Connection refused
0 (0)

25 April 2023

After running our self-hosted GitLab in production, we faced a bug: during git clone/pull/push operations, the request sometimes hung for 1-2 minutes. It looked like some kind of “floating” bug, that is, it could normally work 5 times, and then hangs once. The issues gitlab-shell timeouts For example, one time git clone works well: [simterm]… Read More: GitLab: gitlab-shell timeouts, and /metrics Connection refused0 (0) »

Loading

GitLab: monitoring – Prometheus, metrics, and Grafana dashboard
0 (0)

12 March 2023

So, let’s continue our journey with migrating GitLab to Kubernetes. See previous parts: GitLab: Components, Architecture, Infrastructure, and Launching from the Helm Chart in Minikube GitLab: Helm chart of values, dependencies, and deployment in Kubernetes with AWS S3 GitLab: міграція даних з GitLab cloud та процес backup-restore у self-hosted версії в Kubernetes In general, everything is working,… Read More: GitLab: monitoring – Prometheus, metrics, and Grafana dashboard0 (0) »

Loading

Grafana Loki: LogQL for logs and creating metrics for alerts
0 (0)

12 March 2023

Good – we learned how to launch Loki – Grafana Loki: architecture and running in Kubernetes with AWS S3 storage and bolted-shipper, we also figured out how to configure alerts – Grafana Loki: alerts from the Loki Ruler and labels from logs. Now it’s time to figure out what we can do in Loki using its… Read More: Grafana Loki: LogQL for logs and creating metrics for alerts0… »

Loading

Grafana Loki: alerts from the Loki Ruler and labels from logs
0 (0)

12 March 2023

For general information on Grafana Loki, see the Grafana Loki: architecture and running in Kubernetes with AWS S3 storage and boltdb-shipper. Among other services that make up Loki, there is a separate service called ruler that is responsible for working with alerts that can be generated directly from logs. The idea is very simple: create… Read More: Grafana Loki: alerts from the Loki Ruler and labels from… »

Loading

Prometheus: Building a Custom Prometheus Exporter in Python
0 (0)

25 February 2023

Prometheus has many ready-to-use exporters, but sometimes you may need to collect your own metrics. For this, Prometheus provides client libraries that we can use to generate metrics with the necessary labels. Such an exporter can be included directly in the code of your application, or it can be run as a separate service that will poll one… Read More: Prometheus: Building a Custom Prometheus Exporter in Python0 (0) »

Loading

Kubernetes: Cluster Cost Monitoring – Kubernetes Resource Report and Kubecost
0 (0)

29 January 2023

The very useful thing is to monitor how efficiently the cluster is being used, especially if applications are deployed by developers who do not delve into requests much and set inflated values ​​”in reserve”. A reserve, of course, is needed – but simply requesting resources is a bad idea. For example, you have a WorkerNode… Read More: Kubernetes: Cluster Cost Monitoring – Kubernetes Resource Report and Kubecost0… »

Loading

Elastic Stack: an overview and ELK installation on Ubuntu 20.04
0 (0)

22 February 2022

The last time I’ve worked with the ELK stack about 7 years ago, see the ELK: установка Elasticsearch+Logstash+Kibana на CentOS. Currently, we are using Logz.io, but its costs going higher and higher, so we started looking at the self-hosted ELK solution to be running on our AWS Elastic Kubernetes Service clusters. So, the task, for… Read More: Elastic Stack: an overview and ELK installation on Ubuntu 20.040… »

Loading

AWS: Trusted Advisor, part 2 – CloudWatch Alarms and Slack notifications
0 (0)

29 November 2021

Continuing with the theme of the AWS Trusted Advisor service, let’s see how can we configure notifications about new findings. The first part: AWS: Trusted Advisor, part 1 – its features overview. To have notifications, we can use Trusted Advisor metrics from the CloudWatch, see their list on the Trusted Advisor metrics and dimensions page.… Read More: AWS: Trusted Advisor, part 2 – CloudWatch Alarms and Slack… »

Loading