Monthly Archives: July 2020

Kubernetes: manually restart a Cron Job

27 July 2020
 

 We have a Kubernetes Cron Job which failed on its last run. Let’s look for the root cause and then will see how to restart such a failed job. List current jobs: Check pods of the bttrm-apps-backend-reccuring-payment-cron Cron Job: The 1595844000-jzhrl pod was failed, check its logs: Actually, here is the issue cause: Failed to… Read More »

Prometheus: yet-another-cloudwatch-exporter – collecting AWS CloudWatch metrics

23 July 2020
 

 Currently, to collect metrics from the AWS CloudWatch we are using AWS’s own cloudwatch-exporter, see the Prometheus: CloudWatch exporter — сбор метрик из AWS и графики в Grafana post (in Rus), but it has a few gaps: it’s written in Java, so uses CPU/memory of the monitoring host doesn’t scrapes AWS tags from resources uses… Read More »