Mploy - דרושים

דרושים AI Task Evaluator בכל הארץ

 \ 

AI Task Evaluator

 

14/11/2025

 כל הארץ

 MyRemoteTeam Inc

Apply here 👉 AI Agent Evaluation Analyst Application Form

[email protected]

We’re Hiring: AI Task Evaluator – TAU Framework (Remote | Contract)

We’re collaborating with a leading global AI company on the TAU (Tool–Agent–User) framework, an advanced benchmark designed to evaluate how AI agents perform in realistic, multi-step environments.

As part of this project, you’ll help test and refine AI reasoning by reviewing simulated real-world interactions — where an AI Agent uses tools to complete user requests while following business rules, policies, and logic.

🧠 Role Overview

You’ll analyze and annotate AI-agent conversations and task trajectories to determine if:

  • The agent’s reasoning and tool usage are logical and consistent.
  • The policies (privacy, accuracy, authorization) are respected.
  • The final outcome matches the correct “golden path.”
  • The conversation flow is realistic, clear, and aligned with the user’s intent.

This role blends quality assurance, research, and logic-based evaluation — ideal for people who love breaking down processes and improving system intelligence.

🔍 Key Responsibilities

  • Review agent–user interactions and identify logical gaps or policy violations.
  • Validate tool sequences and end results against golden sets.
  • Flag inconsistencies, missing steps, or unrealistic actions.
  • Annotate errors and reasoning issues clearly and concisely.
  • Suggest edge cases or task improvements to enhance coverage and realism.

What We’re Looking For

  • Excellent analytical and critical-thinking skills.
  • Strong attention to detail and logical consistency.
  • Ability to understand structured workflows (JSON/YAML reading familiarity is a plus).
  • Clear English communication and documentation skills.
  • Background in QA, consulting, linguistics, research, or systems analysis preferred.

משרות דומות שיכולות לעניין אותך

 

12/11/2025

 כל הארץ

For thousands of years, maps have provided humans with the knowledge they need to make decisions. As a Maps Evaluator, you will have the opportunity t...

read more
 

14/11/2025

 כל הארץ

About the Role

We are hiring a Senior AI Agent Evaluation Analyst to help benchmark and enhance the reasoning, reliability, and policy adhere...

read more
 

08/11/2025

 כל הארץ

****Job description

**At , innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the fu...

read more
 

08/11/2025

 כל הארץ

  • Role: Audio Generalist Evaluator Expert
  • Location: Remote
  • Pay: $5,000–$7,000 per month

About the Role:

We are hiring ...

read more
 

16/10/2025

 כל הארץ

Job Title: German Language Reviewer - AI Trainer

Job Type: Part-time

Location: Remote

Job Summary:

Join our customer’s team as a Germa...

read more
 

05/11/2025

 כל הארץ

Job Title: Korean Language Expert - AI Trainer

Job Type: Part-Time

Location: Remote

Job Summa

ry:Join our customer...

read more
 

12/11/2025

 כל הארץ

Help Shape the Future of AI — From Anywhere

AI is revolutionising how we interact with technology. From news feeds to navigation, machine learning ...

read more
 

05/11/2025

 כל הארץ

We are looking for a Tutor to join our team to train AI models. You will measure the progress of these AI chatbots, evaluate their logic, and solve pr...

read more
 

18/11/2025

 כל הארץ

Why Join Us?

Join a hands-on role focused on crafting high-quality prompts, reviewing project outcomes, and ensuring the accuracy and reliability...

read more
הצג משרות דומות נוספות...

Mploy אצלכם בוואטסאפ

✨ רוצים להתעדכן בכל המשרות הכי שוות ישר לנייד?

הצטרפו לקבוצות הוואטסאפ שלנו וקבלו את כל ההצעות המתאימות – בלי לחפש, ובלי לפספס. מחכים לכם! 📱😊