Senior DevOps Engineer
Descrição da Vaga
We are optimizing Kubernetes\-based GPU orchestration and Linux compute environments, and a Senior DevOps Engineer will help automate and harden operations at scale. You will manage Kubernetes administration and Volcano scheduling, enforce resource quotas, and deliver automation in Python and UNIX Shell scripting for research workloads. Apply today to join our delivery team **Responsibilities** * Set up, configure, and maintain GPU\-enabled Kubernetes clusters and standalone Linux compute environments to ensure efficient scheduling and high performance * Implement and manage Volcano scheduling workflows, including queue setup, POD execution, GPU allocation, and namespace quota enforcement * Administer Kubernetes platforms end\-to\-end, covering namespaces, RBAC, resource quotas, and workload isolation strategies * Write and maintain Python and Shell scripts to automate job submission, resource provisioning, and system reporting * Work with orchestration, optimization, and observability teams to improve scheduling efficiency, capacity utilization, and researcher workflows * Monitor infrastructure health and resource utilization and provide data for optimization and reporting requirements * Suggest and implement improvements to infrastructure, tooling, and automation workflows to enhance performance, scalability, and usability * Ensure operational processes support a seamless and efficient experience for researchers across varied AI and computational workloads **Requirements** * Minimum 3 years of professional experience in DevOps or infrastructure engineering in complex, large\-scale environments * Expert knowledge of Kubernetes administration and orchestration, including namespaces, POD scheduling/distribution, PVC, NFS, and resource quota management * Hands\-on experience with Volcano scheduler for GPU workload execution, including queue configuration and workload prioritization integrated with Kubernetes * Proven experience operating GPU cluster environments in Kubernetes and also on standalone Linux compute nodes * Advanced Python scripting skills for infrastructure automation, plus proficiency in UNIX Shell scripting such as Bash * Strong Linux administration skills, including troubleshooting, performance tuning, and configuration management * Solid understanding of infrastructure automation and orchestration principles and tooling * Fluent English communication skills (spoken and written) for direct client interaction **Nice to have** * Helm experience for Kubernetes application package management * Familiarity with observability tooling, especially Prometheus, Grafana and Loki * Experience with Infrastructure as Code tools such as Terraform * Exposure to multi\-cloud Kubernetes environments including Amazon EKS and Google GKE * Knowledge of Azure Networking, including VPN, ExpressRoute and network security * Familiarity with AI\-assisted coding tools such as GitHub Copilot, ChatGPT and Claude * Experience with hybrid (cloud and on\-premises) scheduling and resource optimization
Vaga originalmente publicada em: indeed
Receba vagas como esta no seu email
Crie um alerta gratuito e seja o primeiro a saber de novas oportunidades
Alertas que entendem o que você quer
Não receba qualquer vaga. Receba apenas as que combinam exatamente com o que você busca.
Filtro:
Você recebe tudo isso:
Filtro:
Você recebe apenas:
Zero ruído. Só vagas relevantes para você.
Outros exemplos de filtros precisos:
Filtros Combinados
Combine linguagem + framework + nível + localização. Seja tão específico quanto quiser.
Email Diário
Receba um resumo diário apenas com vagas que passam nos seus filtros. Sem spam.
Kanban Visual
Organize suas candidaturas em um quadro Kanban. Acompanhe cada processo seletivo.
Planos simples, sem surpresas
Comece grátis e faça upgrade quando quiser
Premium
- Tudo do plano gratuito
- Vagas salvas ilimitadas
- Quadros Kanban ilimitados
- Alertas de vagas por email
- Suporte prioritário
Pronto para encontrar sua vaga ideal?
Junte-se a milhares de desenvolvedores que já usam o Job For Dev
Encontre as melhores oportunidades para desenvolvedores no Job For Dev