Mploy - דרושים

דרושים Senior AI Agent Evaluation Analyst בכל הארץ

 \ 

Senior AI Agent Evaluation Analyst

 

14/11/2025

 כל הארץ

 MyRemoteTeam Inc

About the Role

We are hiring a Senior AI Agent Evaluation Analyst to help benchmark and enhance the reasoning, reliability, and policy adherence of advanced AI agents.

You’ll evaluate multi-step reasoning trajectories, analyze structured agent data, and ensure models behave consistently and ethically across realistic scenarios.

Key Responsibilities

  • Evaluate AI agents’ reasoning paths, tool-use accuracy, and decision logic in complex tasks.
  • Identify inconsistencies, hallucinations, or policy violations within multi-step reasoning.
  • Analyze structured data (JSON/YAML) to verify agent actions and tool calls.
  • Benchmark large language models (GPT, Claude, Gemini, etc.) for reasoning accuracy and behavioral consistency.
  • Document findings, improve evaluation frameworks, and contribute to quality standards.
  • Collaborate with Research, Policy, and AI Safety teams to refine evaluation methodologies.

Required Qualifications

  • 7+ years of professional experience in AI Agent Evaluation, LLM Assessment, AI Safety, or Red Teaming.
  • Proven expertise in multi-step reasoning evaluation, policy logic, and trajectory analysis.
  • Familiarity with LangChain, RAG, Prompt Engineering, and structured data formats (JSON/YAML).
  • Strong analytical writing, logical reasoning, and documentation skills in English.
  • Hands-on experience with LLMs such as GPT, Claude, or Gemini.
  • Excellent attention to detail and ability to evaluate abstract, ambiguous reasoning tasks.

Preferred Skills

  • Understanding of RLHF (Reinforcement Learning from Human Feedback) and model alignment.
  • Knowledge of AI Policy, Safety Evaluation, and Red Teaming frameworks.
  • Experience developing evaluation rubrics or QA frameworks for AI systems.
  • Exposure to projects with OpenAI, Anthropic, Google, Appen, TELUS, or Scale AI.

Benefits

  • 100% remote role with flexible hours.
  • Competitive compensation with performance-based incentives.
  • Opportunity to shape next-generation agentic AI systems and reasoning benchmarks.
  • Work with global experts at the intersection of AI evaluation and safety research.

E: | לפנייה למשרה יש להגיש מועמדות |

📩 Think you’ve got the brain for it?

Apply here 👉 AI Agent Evaluation Analyst Application Form

משרות דומות שיכולות לעניין אותך

 

11/11/2025

 כל הארץ

Role Overview:

We seek a Senior QA Engineer to lead the design, testing, validation, and automation of QA strategies across our Data & AI program...

read more
 

08/11/2025

 כל הארץ

****Job description

**At , innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the fu...

read more
 

30/10/2025

 כפר סבא

We are looking for a Senior AI Software Engineer to join our R&D group and help design and implement our next-generation **AI-driven data and inte...

read more
 

08/11/2025

 כל הארץ

  • Role: Software Engineer
  • Location: Remote

About the Role:

We are seeking exceptional software engineers to collaborate on a g...

read more
 

19/10/2025

 תל אביב

As a Red Team Specialist focused on Generative AI Models, you will play a critical role in enhancing the security and integrity of our cutting-edge AI...

read more
 

08/11/2025

 כל הארץ

  • Role: Audio Generalist Evaluator Expert
  • Location: Remote
  • Pay: $5,000–$7,000 per month

About the Role:

We are hiring ...

read more
 

23/10/2025

 תל אביב

Sr. Data Scientist

About the Company

Cybereason is on a mission to reverse the adversary advantage by empowering defenders with ingenuity an...

read more
 

07/11/2025

 כל הארץ

**Job Description

**We are seeking an experienced Agentic AI Engineer with a robust background in software engineering, machine learning, and adv...

read more
 

05/11/2025

 כל הארץ

Job Title: Business Consulting Expert - AI Training (MSc. or PhD)

Job Type: Full-time

Location: Remote

Job Summary:

Jo...

read more
הצג משרות דומות נוספות...

Mploy אצלכם בוואטסאפ

✨ רוצים להתעדכן בכל המשרות הכי שוות ישר לנייד?

הצטרפו לקבוצות הוואטסאפ שלנו וקבלו את כל ההצעות המתאימות – בלי לחפש, ובלי לפספס. מחכים לכם! 📱😊