Mploy - דרושים
Mploy - דרושים

דרושים AI Quality Evaluator (Freelance) בכל הארץ

 \ 

AI Quality Evaluator (Freelance)

 נכון לתאריך

 

29/11/2025

 כל הארץ

 Toloka

Company Intro

At Toloka AI we create data that powers leading GenAI models and innovations. We work with frontier labs, big tech, renowned AI startups, enterprises and non-profit research organizations worldwide. We use a combination of Experts + Crowd + Tech Platform to teach AI models to reason and evaluate their efficacy and safety. We have experts in more than 50 different domains-from doctors and lawyers to physicists and engineers-and boast one of the most diverse global crowds, representing over 100 countries and speaking 40+ languages. We are a well-funded startup with an enviable portfolio of clients including Anthropic, Amazon, Microsoft, Poolside, Recraft, and Shopify.

Recently, we secured strategic investment led by Bezos Expeditions with participation from Mikhail Parakhin, CTO of Shopify and board advisor to leading GenAI companies, who now serves as our Chairman of the Board. Our remote-first team is globally distributed around the world: USA, UK, the Netherlands, Israel, Czech Republic, Serbia, and more. We are headquartered in Amsterdam.

About the role:

We are seeking an analytical and technically-minded professional to:

  • Evaluate AI outputs and processes
  • Ensure quality, accuracy, and reliability
  • Identify logical errors, risks, and structural inconsistencies
  • Provide actionable insights and recommendations to the team

Ideal candidates:

  • Consultants, auditors, analysts, data researchers, or business/technical analysts with strong reasoning skills
  • Professionals curious about AI, process improvement, and quality evaluation
  • Problem-solvers who enjoy analyzing complex systems, logic, and scenarios

Key Responsibilities:

  • Lead evaluation of AI outputs and related processes
  • Review tasks against expected/ideal scenarios; identify gaps and risks
  • Provide structured, actionable recommendations to engineers, domain experts, and managers
  • Maintain and improve evaluation guidelines, checklists, SOPs
  • Suggest new approaches, tools, and processes to enhance AI evaluation

Experience & Background:

  • Scenario validation, data analysis, auditing, or consulting experience
  • Analytical work in research, technical/business analysis, or risk evaluation

Knowledge & Skills:

  • Strong analytical and critical thinking
  • Attention to detail, reliability, and an ownership mindset
  • Technical understanding: JSON/YAML, basic Git/GitHub
  • Clear English (B2+) for communication and documentation
  • Independent, proactive mindset

Nice to Have:

  • Scenario-based testing, annotation workflows, AI/LLM evaluation
  • Experience in cross-functional teams

What we offer:

  • Freelance collaboration
  • Flexible and full remote work schedule
  • Hourly rate: 30–60 $/hour
  • Collaborative and supportive team environment

משרות דומות שיכולות לעניין אותך

 נכון לתאריך

 

17/12/2025

 תל אביב

For thousands of years, maps have provided humans with the knowledge they need to make decisions. As a Maps Evaluator, you will have the opportunity t...  

read more

 נכון לתאריך

 

29/11/2025

 כל הארץ

Please submit your resume in English and indicate your level of English.

At [Mindrift](https://mindrift.ai/), innovation meets opportunity....  

קרא עוד

 נכון לתאריך

 

20/11/2025

 כל הארץ

**Job Description

******_This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibilit...  

read more

 נכון לתאריך

 

09/12/2025

 כל הארץ

  • Role: Artificial Intelligence Intern (Remote)
  • Location: Remote

* **Pay**: Competitive monthly compensation up to $10,000, based on r...  

read more

 נכון לתאריך

 

03/12/2025

 כל הארץ

Role: Research Evaluation Specialist

Location: Remote

We are hiring expert reviewers to contribute to a cutting-edge AI research initiat...  

read more

 נכון לתאריך

 

27/11/2025

 כל הארץ

**Job Description

**About the Role

Braintrust AI is looking for individuals with a strong background in ****Actuarial Science**** to help tr...  

read more

 נכון לתאריך

 

03/12/2025

 כל הארץ

  • Role: Audio Generalist Evaluator Expert
  • Location: Remote
  • Pay: $5,000–$7,000 per month

About the Role:

We are hiring ...  

read more

 נכון לתאריך

 

12/11/2025

 כל הארץ

Help Shape the Future of AI — From Anywhere

AI is revolutionising how we interact with technology. From news feeds to navigation, machine learning ...  

read more

 נכון לתאריך

 

27/11/2025

 כל הארץ

**Job Description

**About the Role

Braintrust AI is looking for individuals with a strong background in ****Chemistry (and Python)**** to he...  

read more
הצג משרות דומות נוספות...

Mploy אצלכם בוואטסאפ

✨ רוצים להתעדכן בכל המשרות הכי שוות ישר לנייד?

הצטרפו לקבוצות הוואטסאפ שלנו וקבלו את כל ההצעות המתאימות – בלי לחפש, ובלי לפספס. מחכים לכם! 📱😊