Mploy - דרושים
Mploy - דרושים

דרושים GenAI engineer - Evaluation בתל אביב

 \ 

GenAI engineer - Evaluation

 נכון לתאריך

 

19/12/2025

 תל אביב

 Dream

At Dream, we redefine cyber defense vision by combining AI and human expertise to create products that protect nations and critical infrastructure. This is more than a job; It’s a Dream job. Dream is where we tackle real-world challenges, redefine AI and security, and make the digital world safer. Let’s build something extraordinary together.

Dream's AI cybersecurity platform applies a new, out-of-the-ordinary, multi-layered approach, covering endless and evolving security challenges across the entire infrastructure of the most critical and sensitive networks. Central to our Dream's proprietary Cyber Language Models are innovative technologies that provide contextual intelligence for the future of cybersecurity.

At Dream, our talented team, driven by passion, expertise, and innovative minds, inspires us daily. We are not just dreamers, we are dream-makers.

The Dream Job:

In this role, you'll be responsible for designing and implementing evaluation, validation and optimization of GenAI systems. You will define, design and develop LLMs as judges to evaluate task and system outputs across multiple applications, create datasets for benchmarking and evaluation and help design robust and scalable evaluation pipelines for both onine and offline GenAI systems.

The Dream-Maker Responsibilities:

  • Design, develop and apply state-of-the-art techniques for evaluating and validating AI agents and/or workflows.
  • Develop and implement LLM-as-a-Judge (or similar) for different tasks and roles for GenAI systems and tools.
  • Design and implement evaluation pipelines and benchmark datasets for evaluating model quality, relevance and system consistency for various applications.
  • Optimize and maintain judge LLMs to evaluate outputs for different use cases such as chatbots, RAG systems, cybersecurity experts and investigators.
  • Define evaluation KPIs and metrics for both models, systems and tools.
  • Validate and optimize datasets for various use cases.
  • Ensure the reliability, efficiency, and scalability of evaluation tools and pipelines for both online and offline use cases.
  • Work closely with AI/ML engineers to make evaluations a part of the production pipelines of GenAI applications.
  • Collaborate with cross-functional teams including product, research and data science.
  • Stay up to date with the latest developments in AI, machine learning, focusing on LLMs, exploring how emerging technologies can be applied to improve our evaluation and validation pipelines.

The Dream Skill Set:

  • Advanced knowledge and experience in NLP and use of LLMs for GenAI applications in production at scale.
  • Strong experience in designing end-to-end R&D plans for GenAI including evaluation and validation lifecycle and benchmarking.
  • Strong proficiency in Python
  • Solid understanding of Data Science and Machine Learning lifecycle and best practices evaluating and validating AI systems at scale.
  • Excellent problem-solving abilities, coupled with a creative and strategic mindset.
  • Proven ability to work effectively in a team setting.

Advantages:

  • Experience with EDD (evaluation driven development) for GenAI applications.
  • Familiarity with cybersecurity applications of GenAI.
  • Advanced skills in performance optimization for high throughput systems.

**Tech Stack:

**Python, Langchain, Langgraph (or other agentic frameworks), Langfuse/LangSmith (or other observability and tracing tools), HuggingFace, Mlflow, MongoDB

Never Stop Dreaming...:

If you think this role doesn't fully match your skills but are eager to grow and break glass ceilings, we’d love to hear from you!

משרות דומות שיכולות לעניין אותך

 נכון לתאריך

 

04/12/2025

 תל אביב

At I-Next Data, the AI innovation center of Tel Aviv Sourasky (Ichilov) Medical Center, we build and deploy production-ready LLM applications that int...  

read more

 נכון לתאריך

 

03/12/2025

 תל אביב

ActiveFence is a leading platform for Trust & Safety teams worldwide, leveraging cutting‑edge AI and world‑class expertise to protect users from the w...  

read more

 נכון לתאריך

 

04/12/2025

 תל אביב

**Job Description

**As a Backend Engineer for the Channels team, you'll play a crucial role in developing and maintaining the technology that dri...  

read more

 נכון לתאריך

 

25/11/2025

 תל אביב

Job Description

About Us

HiBob helps modern, mid-size businesses transform the way they manage people, giving HR and managers all they need to c...  

read more

 נכון לתאריך

 

31/12/2025

 תל אביב

At Dream, we redefine cyber defense vision by combining AI and human expertise to create products that protect nations and critical infrastructure. Th...  

read more

 נכון לתאריך

 

02/12/2025

 תל אביב

At Dream, we redefine cyber defense vision by combining AI and human expertise to create products that protect nations and critical infrastructure. Th...  

read more

 נכון לתאריך

 

28/11/2025

 תל אביב

At Dream, we redefine cyber defense vision by combining AI and human expertise to create products that protect nations and critical infrastructure. Th...  

read more

 נכון לתאריך

 

19/12/2025

 תל אביב

**The Role

**We are currently hiring a highly skilled and motivated Generative AI Engineer for developing an LLM application. As a Generative AI ...  

read more

 נכון לתאריך

 

17/11/2025

 תל אביב

We're Vega! One of the fastest growing start-ups in Cybersecurity - redefining the limits of Security Analytics and Operations.

We've raised a $6...  

read more
הצג משרות דומות נוספות...

Mploy אצלכם בוואטסאפ

✨ רוצים להתעדכן בכל המשרות הכי שוות ישר לנייד?

הצטרפו לקבוצות הוואטסאפ שלנו וקבלו את כל ההצעות המתאימות – בלי לחפש, ובלי לפספס. מחכים לכם! 📱😊