Site Reliability Engineer - FinOps
Some real use cases that we did in the past:
- Analysis and scripting automation to match the resource demand with the requests of the services in the Kubernetes clusters;
- Refactory of mission-critic workflows with low scalability;
- Assess and propose architecture changes cross-company to promote simplicity and reduce resource waste;
You will:
- Design, analyze and maintain reports and dashboards about cloud usage;
- Divise ways of organizing and presenting resource consumption and cost metrics to the engineering teams;
- Identify and correct performance problems alongside engineers;
- Colabore with SRE, Data and Engineer Teams to understand the cloud usage and improvements;
What we are looking for:
- Experience with Cloud environment costs;
- Knowledge to query, analyze and summarize data;
- Knowledge of SQL databases;
- Knowledge of Kubernetes architecture (Pods, Containers, Namespace, etc)
- Experience in infrastructure as code (Terraform, Crossplane and/or Pulumi).
You will stand out if:
- Experience with AWS Athena & AWS Cost Explorer
- Knowledge of advanced Kubernetes architecture(resource allocation, scheduler, auto scaler, etc);
- Knowledge of Presto or Trino;
- Knowledge of programming (we like Python and Golang);
- Experience with observability stack (Prometheus, Grafana, OpenTelemetry)
Empresa: BairesDev
Trabalhe de Casa Arquiteto Python / Ref. 0071P
Contratação: Integral
title
Empresa: Grupo Primo
Front-end Engineer Pleno
Contratação: Integral
title