Senior Software Engineer – LLM Evaluation (Remote)

Nexus Consulting
Lead
Remoto 🌐
Publicado em 19 de fevereiro de 2026

Descrição da Vaga

* **Title:** Senior Software Engineer – LLM Evaluation (Remote) * **Engagement:** Hourly contract (independent contractor) * **Location:** Remote **About the Opportunity** One of our global AI research clients is building advanced evaluation and training datasets to improve large language models on realistic software engineering tasks. This project focuses on creating verifiable software engineering challenges derived from public repository histories using a structured, human\-in\-the\-loop approach. The goal is to expand dataset coverage across programming languages, complexity levels, and real\-world development scenarios. **Role Overview** We are seeking experienced, tech lead–level software engineers who are comfortable working with high\-quality public GitHub repositories (500\+ stars). This role combines hands\-on engineering work with AI model evaluation, contributing directly to how AI systems interact with real\-world codebases. **What You’ll Do** * Analyze and triage GitHub issues across widely used open\-source repositories * Set up and configure repositories, including Dockerization and development environment automation * Evaluate unit test coverage, quality, and reliability * Run, modify, and debug real\-world codebases locally to assess AI model performance in bug\-fixing and implementation tasks * Collaborate with AI researchers to identify challenging repositories and issue types for LLM evaluation * Contribute to designing structured, verifiable software engineering tasks * Potentially lead and mentor junior engineers on repository validation projects **Required Skills** * 5\+ years of professional software engineering experience * Strong expertise in at least one of the following: Python, JavaScript, Java, Go, Rust, C/C\+\+, C\#, or Ruby * Deep understanding of software architecture, debugging, and code quality standards * Proficiency with Git, Docker, and development pipeline setup * Ability to navigate and evaluate complex, production\-grade codebases * Experience contributing to or reviewing open\-source projects is a plus **Nice to Have** * Experience participating in AI/LLM evaluation or research initiatives * Background in building developer tools, automation systems, or code verification agents * Experience leading small engineering teams **Engagement Details** * Contractor assignment (no medical or paid leave) * 20 hours per week with partial PST overlap * Duration: 3 months * Expected start date: Next week * Fully remote This role offers a unique opportunity to combine deep software engineering expertise with frontier AI research, directly influencing how large language models understand and solve real\-world coding problems. **APPLY NOW !**

Vaga originalmente publicada em: linkedin

Receba vagas como esta no seu email

Crie um alerta gratuito e seja o primeiro a saber de novas oportunidades

Criar Alerta Gratuito

Alertas que entendem o que você quer

Não receba qualquer vaga. Receba apenas as que combinam exatamente com o que você busca.

Alerta genérico

Filtro:

Python

Você recebe tudo isso:

Vaga de Python + Django
Vaga de Python + Flask
Vaga de Python + ETL/Data
Vaga de Python + Machine Learning
...e muito ruído no seu email
Alerta inteligente

Filtro:

Python+FastAPI

Você recebe apenas:

Desenvolvedor Python + FastAPI
Backend Engineer (FastAPI)
API Developer - Python/FastAPI

Zero ruído. Só vagas relevantes para você.

Outros exemplos de filtros precisos:

JavaScript+React+Remoto
Java+Spring Boot+Sênior
Go+Kubernetes

Filtros Combinados

Combine linguagem + framework + nível + localização. Seja tão específico quanto quiser.

Email Diário

Receba um resumo diário apenas com vagas que passam nos seus filtros. Sem spam.

Kanban Visual

Organize suas candidaturas em um quadro Kanban. Acompanhe cada processo seletivo.

Planos simples, sem surpresas

Comece grátis e faça upgrade quando quiser

Gratuito

R$ 0para sempre
  • Busca de vagas ilimitada
  • Salvar até 10 vagas
  • 1 quadro Kanban
Criar Conta Grátis
Popular

Premium

R$ 9,90/mês
  • Tudo do plano gratuito
  • Vagas salvas ilimitadas
  • Quadros Kanban ilimitados
  • Alertas de vagas por email
  • Suporte prioritário
3 dias grátis, sem cartão

Pronto para encontrar sua vaga ideal?

Junte-se a milhares de desenvolvedores que já usam o Job For Dev

Encontre as melhores oportunidades para desenvolvedores no Job For Dev

Senior Software Engineer – LLM Evaluation (Remote) - Nexus Consulting | Job For Dev