Senior Software Engineer – LLM Evaluation (Remote)

Nexus Consulting
Sênior
Remoto 🌐
Publicado em 19 de fevereiro de 2026

Descrição da Vaga

* **Title:** Senior Software Engineer – LLM Evaluation (Remote) * **Engagement:** Hourly contract (independent contractor) * **Location:** Remote **About the Opportunity** One of our global AI research clients is developing advanced evaluation and benchmarking datasets to improve the performance of large language models in real\-world software engineering scenarios. This role focuses on assessing AI\-generated code and strengthening model reliability across production\-grade engineering workflows. **Role Overview** As a Senior Software Engineer supporting AI model evaluation, you will contribute to building high\-quality datasets used for training and benchmarking large language models. You will work closely with researchers to curate code examples, provide precise technical solutions, and refine AI\-generated outputs across multiple programming languages. This role blends hands\-on software engineering expertise with structured AI evaluation and research collaboration. **Key Responsibilities** * Curate and develop realistic software engineering tasks across languages such as Python, JavaScript (including React), C/C\+\+, Java, Rust, and Go * Review, evaluate, and refine AI\-generated code for efficiency, scalability, correctness, and maintainability * Collaborate with cross\-functional research teams to enhance AI\-driven coding solutions against industry performance benchmarks * Design verification mechanisms to automatically validate software engineering solutions * Analyze stages of the software development lifecycle (architecture design, API design, prototyping, production deployment, monitoring, and maintenance) and evaluate model performance across these stages * Build internal tools or agents to detect code quality issues and error patterns **Requirements** * Several years of professional software engineering experience * At least 2 years of continuous full\-time experience at a product\-focused technology company * Strong expertise in building and deploying scalable, production\-grade applications * Deep understanding of software architecture, debugging, performance optimization, and code review standards * Experience working with modern development workflows and tooling * Strong written and verbal communication skills for documenting structured evaluation feedback **Engagement Details** * Flexible engagement: minimum 10 hours per week, up to 40 hours per week * Partial overlap with Pacific Time required * Contractor engagement (no medical or paid leave benefits) * Initial duration: 1 month, with potential extension based on performance and project needs **APPLY NOW !**

Vaga originalmente publicada em: linkedin

Receba vagas como esta no seu email

Crie um alerta gratuito e seja o primeiro a saber de novas oportunidades

Criar Alerta Gratuito

Alertas que entendem o que você quer

Não receba qualquer vaga. Receba apenas as que combinam exatamente com o que você busca.

Alerta genérico

Filtro:

Python

Você recebe tudo isso:

Vaga de Python + Django
Vaga de Python + Flask
Vaga de Python + ETL/Data
Vaga de Python + Machine Learning
...e muito ruído no seu email
Alerta inteligente

Filtro:

Python+FastAPI

Você recebe apenas:

Desenvolvedor Python + FastAPI
Backend Engineer (FastAPI)
API Developer - Python/FastAPI

Zero ruído. Só vagas relevantes para você.

Outros exemplos de filtros precisos:

JavaScript+React+Remoto
Java+Spring Boot+Sênior
Go+Kubernetes

Filtros Combinados

Combine linguagem + framework + nível + localização. Seja tão específico quanto quiser.

Email Diário

Receba um resumo diário apenas com vagas que passam nos seus filtros. Sem spam.

Kanban Visual

Organize suas candidaturas em um quadro Kanban. Acompanhe cada processo seletivo.

Planos simples, sem surpresas

Comece grátis e faça upgrade quando quiser

Gratuito

R$ 0para sempre
  • Busca de vagas ilimitada
  • Salvar até 10 vagas
  • 1 quadro Kanban
Criar Conta Grátis
Popular

Premium

R$ 9,90/mês
  • Tudo do plano gratuito
  • Vagas salvas ilimitadas
  • Quadros Kanban ilimitados
  • Alertas de vagas por email
  • Suporte prioritário
3 dias grátis, sem cartão

Pronto para encontrar sua vaga ideal?

Junte-se a milhares de desenvolvedores que já usam o Job For Dev

Encontre as melhores oportunidades para desenvolvedores no Job For Dev