דרושים AI Quality Evaluator (Freelance) בכל הארץ

דרושים

סמנכ"ל טכנולוגיות CTO \

AI Quality Evaluator (Freelance)

הזמן שלך חשוב? הגדר סוכן חכם

AI Quality Evaluator (Freelance)

נכון לתאריך

29/11/2025

כל הארץ

Toloka

Company Intro

At Toloka AI we create data that powers leading GenAI models and innovations. We work with frontier labs, big tech, renowned AI startups, enterprises and non-profit research organizations worldwide. We use a combination of Experts + Crowd + Tech Platform to teach AI models to reason and evaluate their efficacy and safety. We have experts in more than 50 different domains-from doctors and lawyers to physicists and engineers-and boast one of the most diverse global crowds, representing over 100 countries and speaking 40+ languages. We are a well-funded startup with an enviable portfolio of clients including Anthropic, Amazon, Microsoft, Poolside, Recraft, and Shopify.

Recently, we secured strategic investment led by Bezos Expeditions with participation from Mikhail Parakhin, CTO of Shopify and board advisor to leading GenAI companies, who now serves as our Chairman of the Board. Our remote-first team is globally distributed around the world: USA, UK, the Netherlands, Israel, Czech Republic, Serbia, and more. We are headquartered in Amsterdam.

About the role:

We are seeking an analytical and technically-minded professional to:

Evaluate AI outputs and processes
Ensure quality, accuracy, and reliability
Identify logical errors, risks, and structural inconsistencies
Provide actionable insights and recommendations to the team

Ideal candidates:

Consultants, auditors, analysts, data researchers, or business/technical analysts with strong reasoning skills
Professionals curious about AI, process improvement, and quality evaluation
Problem-solvers who enjoy analyzing complex systems, logic, and scenarios

Key Responsibilities:

Lead evaluation of AI outputs and related processes
Review tasks against expected/ideal scenarios; identify gaps and risks
Provide structured, actionable recommendations to engineers, domain experts, and managers
Maintain and improve evaluation guidelines, checklists, SOPs
Suggest new approaches, tools, and processes to enhance AI evaluation

Experience & Background:

Scenario validation, data analysis, auditing, or consulting experience
Analytical work in research, technical/business analysis, or risk evaluation

Knowledge & Skills:

Strong analytical and critical thinking
Attention to detail, reliability, and an ownership mindset
Technical understanding: JSON/YAML, basic Git/GitHub
Clear English (B2+) for communication and documentation
Independent, proactive mindset

Nice to Have:

Scenario-based testing, annotation workflows, AI/LLM evaluation
Experience in cross-functional teams

What we offer:

Freelance collaboration
Flexible and full remote work schedule
Hourly rate: 30–60 $/hour
Collaborative and supportive team environment

סמנכ"ל טכנולוגיות CTO

אנליסט נתונים Data Analyst

פרילנס

CTO (Chief Technology Officer)

משרות דומות שיכולות לעניין אותך

Map Evaluator

נכון לתאריך

17/12/2025

תל אביב

For thousands of years, maps have provided humans with the knowledge they need to make decisions. As a Maps Evaluator, you will have the opportunity t...

AI Agent Evaluation Scenario Designer (Freelance)

נכון לתאריך

29/11/2025

כל הארץ

Please submit your resume in English and indicate your level of English.

At [Mindrift](https://mindrift.ai/), innovation meets opportunity....

קרא עוד

AI Project Annotator

נכון לתאריך

20/11/2025

כל הארץ

**Job Description