Site Reliability Engineer (SRE)
Descrição da Vaga
**Site Reliability Engineer (SRE)** ----------------------------------- **Location:** Brazil, Colombia, Guatemala, Dominican Republic, Ecuador, Honduras, Mexico, El Salvador, Puerto Rico, Nicaragua **Industry:** Technology \& IT ### **About the Role** We are looking for a **Site Reliability Engineer (SRE)** to build, scale, and maintain a secure, reliable, and fully automated infrastructure platform. This role partners closely with engineering, data, and security teams to ensure production systems are resilient, observable, and deployed with zero manual intervention. You will play a key role in infrastructure consolidation initiatives, automation strategy, and compliance efforts, including **SOC 2 Type 2**, while laying the groundwork for long\-term operational excellence. ### **Key Responsibilities** **Infrastructure \& Automation** * Design, implement, and maintain Infrastructure as Code (IaC) using **Terraform** for AWS and Snowflake * Enable consistent and auditable cloud environments (dev, staging, production, UAT, sandbox) * Automate infrastructure provisioning and deployments using **GitOps** practices **CI/CD \& Deployment** * Build and maintain end\-to\-end **CI/CD pipelines** with zero manual deployment steps * Drive 100% automated deployments for production environments * Integrate automated testing, security scanning, and compliance checks **Observability \& Monitoring** * Implement monitoring, logging, and alerting using modern observability tools * Define and manage **SLIs, SLOs, and error budgets** * Build dashboards for system health, performance, and cost visibility **Reliability \& Disaster Recovery** * Design and maintain disaster recovery strategies for internal and customer\-facing systems * Participate in and improve regular DR drills **Data Infrastructure** * Collaborate with data engineering to migrate Snowflake pipelines to **dbt** * Implement data quality testing frameworks * Manage Snowflake databases, permissions, and environment isolation **Incident Management** * Participate in on\-call rotation * Lead incident response for infrastructure and platform issues * Conduct blameless post\-mortems and drive reliability improvements ### **Additional Responsibilities** * Maintain clear and detailed infrastructure documentation (architecture, runbooks, procedures) * Mentor engineers on infrastructure and reliability best practices * Support security initiatives related to **SOC 2 Type 2**, including IAM and secrets management * Stay current through self\-education, writing, and conference participation ### **Required Skills \& Qualifications** **Infrastructure as Code** * Expert\-level experience with **Terraform** or equivalent tools * Experience managing multiple environments and cloud accounts **Cloud Platforms** * Strong hands\-on experience with **AWS** services * Working knowledge of **Snowflake** is a strong plus **CI/CD** * Experience with **GitHub Actions** or similar tools **Programming** * Strong **Python** skills for automation * Comfortable with **Bash** **Observability** * Experience with Datadog, Prometheus/Grafana, ELK, CloudWatch, or similar **Systems \& Networking** * Strong Linux, networking, and security fundamentals **SRE Practices** * Experience with SLOs, error budgets, and blameless post\-mortems **Soft Skills** * Strong English communication skills * Ability to work in ambiguous environments ### **Additional Requirements** * Willingness to participate in **on\-call rotation** * Availability for occasional **after\-hours maintenance**, scheduled in advance
Vaga originalmente publicada em: indeed
Receba vagas como esta no seu email
Crie um alerta gratuito e seja o primeiro a saber de novas oportunidades
Alertas que entendem o que você quer
Não receba qualquer vaga. Receba apenas as que combinam exatamente com o que você busca.
Filtro:
Você recebe tudo isso:
Filtro:
Você recebe apenas:
Zero ruído. Só vagas relevantes para você.
Outros exemplos de filtros precisos:
Filtros Combinados
Combine linguagem + framework + nível + localização. Seja tão específico quanto quiser.
Email Diário
Receba um resumo diário apenas com vagas que passam nos seus filtros. Sem spam.
Kanban Visual
Organize suas candidaturas em um quadro Kanban. Acompanhe cada processo seletivo.
Planos simples, sem surpresas
Comece grátis e faça upgrade quando quiser
Premium
- Tudo do plano gratuito
- Vagas salvas ilimitadas
- Quadros Kanban ilimitados
- Alertas de vagas por email
- Suporte prioritário
Pronto para encontrar sua vaga ideal?
Junte-se a milhares de desenvolvedores que já usam o Job For Dev
Encontre as melhores oportunidades para desenvolvedores no Job For Dev