Freelance AI Evaluation Engineer (Python/Full-Stack)

Mindrift

Apply Now
Portugal
$60,000 - $60,000 / year
full-time
mid
Posted March 14, 2026
via himalayas

About This Role

Create challenging coding test cases for AI systems, review and refine production codebases, and analyze AI failures. Work on part-time, non-permanent projects for leading tech companies. Requirements • Degree in Computer Science, Software Engineering, or related fields • 5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations) • Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems • Experience writing tests (functional, integration - not just running them) • Docker containers (running evaluations locally in containers) • CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results) • English proficiency - B2 Benefits • Flexible work schedule • Opportunity to work on challenging projects with leading tech companies • Potential earnings of up to $30 per hour equivalent Originally posted on Himalayas

Ready to Apply?

Click the button below to visit the company's application page.

Apply for this Position