Senior Site Reliability Engineer

EPAM Systems
Sênior
Remoto 🌐
Publicado em 20 de novembro de 2025

Descrição da Vaga

Join our rapidly evolving Enterprise Technology team as a **Senior Site Reliability Engineer**, where you will maintain various enterprise applications and their infrastructure. You will apply DevOps practices, tools, and engineering capabilities to deliver robust solutions that benefit the company long\-term. If you are passionate about engineering excellence and infrastructure reliability, we encourage you to apply. **Responsibilities** * Maintain and enhance enterprise applications and infrastructure using DevOps methodologies * Implement and manage CI/CD pipelines for continuous software delivery * Develop and automate infrastructure provisioning and management using Terraform * Administer and secure Kubernetes clusters ensuring high availability * Monitor system performance and implement improvements to ensure reliability * Collaborate with development teams to enhance deployment processes and automation * Handle operational requests and maintenance events to ensure system stability * Apply security best practices in infrastructure and application deployment * Troubleshoot complex issues related to cloud infrastructure and application performance * Coordinate with cross\-functional teams to support enterprise\-scale software releases * Document system configurations, procedures, and troubleshooting guides * Evaluate and implement new tools and technologies to optimize infrastructure operations **Requirements** * Expert knowledge of the Python programming language with 3\+ years experience * Demonstrable experience with Amazon Web Services and Microsoft Azure including API, authentication, and serverless * Experience with infrastructure facets including cloud networking, Kubernetes cluster administration, security, IAM, and configuration automation * Deep understanding of CI/CD, source control, containers, and infrastructure management using Terraform * Experience in IaaS enablement and enhancement * Enterprise\-scale software development and release management experience * Strong understanding of automation principles around CI/CD and IaaS * Excellent complex problem\-solving and analytical capabilities * Ability to handle operational requests and maintenance events effectively * Strong written and verbal English communication skills (B2\+)

Vaga originalmente publicada em: indeed

💼 Encontre as melhores oportunidades para desenvolvedores no Job For Dev