Chief Site Reliability Engineer
Descrição da Vaga
We are looking for a **Chief Site Reliability Engineer** to oversee the maintenance and enhancement of enterprise applications and their infrastructure through advanced DevOps methodologies. In this role, you will lead the application of CI/CD pipelines, infrastructure automation with Terraform, and Kubernetes cluster management to ensure robust and secure cloud environments. Join us to drive reliability and operational excellence at scale. **Responsibilities** * Maintain and improve enterprise applications and infrastructure by applying DevOps best practices * Design and manage CI/CD pipelines to enable continuous software deployment * Automate infrastructure provisioning and management via Terraform * Administer Kubernetes clusters with a focus on security and high availability * Monitor system health and performance, implementing enhancements to boost reliability * Collaborate with software development teams to refine deployment and automation workflows * Address operational requests and coordinate maintenance activities to uphold system stability * Enforce security standards in infrastructure and application deployment processes * Resolve complex issues related to cloud infrastructure and application performance * Work with cross\-functional teams to facilitate enterprise\-scale software releases * Document system configurations, operational procedures, and troubleshooting instructions * Assess and integrate new tools and technologies to optimize infrastructure operations **Requirements** * Advanced proficiency in Python with at least 7 years of experience * Proven expertise with Amazon Web Services and Microsoft Azure, including API, authentication, and serverless components * Comprehensive knowledge of cloud networking, Kubernetes cluster management, security, IAM, and automation * Strong grasp of CI/CD concepts, source control, containerization, and infrastructure as code using Terraform * Experience enabling and enhancing Infrastructure as a Service (IaaS) solutions * Background in enterprise\-level software development and release management * In\-depth understanding of automation principles related to CI/CD and IaaS * Exceptional analytical and problem\-solving skills for complex issues * Ability to manage operational requests and maintenance incidents effectively * Proficient English communication skills, both written and verbal (B2\+) **We offer** * International projects with top brands * Work with global teams of highly skilled, diverse peers * Healthcare benefits * Employee financial programs * Paid time off and sick leave * Upskilling, reskilling and certification courses * Unlimited access to the LinkedIn Learning library and 22,000\+ courses * Global career opportunities * Volunteer and community involvement opportunities * EPAM Employee Groups * Award\-winning culture recognized by Glassdoor, Newsweek and LinkedIn
Vaga originalmente publicada em: linkedin
Receba vagas como esta no seu email
Crie um alerta gratuito e seja o primeiro a saber de novas oportunidades
Alertas que entendem o que você quer
Não receba qualquer vaga. Receba apenas as que combinam exatamente com o que você busca.
Filtro:
Você recebe tudo isso:
Filtro:
Você recebe apenas:
Zero ruído. Só vagas relevantes para você.
Outros exemplos de filtros precisos:
Filtros Combinados
Combine linguagem + framework + nível + localização. Seja tão específico quanto quiser.
Email Diário
Receba um resumo diário apenas com vagas que passam nos seus filtros. Sem spam.
Kanban Visual
Organize suas candidaturas em um quadro Kanban. Acompanhe cada processo seletivo.
Planos simples, sem surpresas
Comece grátis e faça upgrade quando quiser
Premium
- Tudo do plano gratuito
- Vagas salvas ilimitadas
- Quadros Kanban ilimitados
- Alertas de vagas por email
- Suporte prioritário
Pronto para encontrar sua vaga ideal?
Junte-se a milhares de desenvolvedores que já usam o Job For Dev
Encontre as melhores oportunidades para desenvolvedores no Job For Dev