Mploy - דרושים

דרושים TensorRT-LLM Software Development Engineer ברעננה

 \ 

TensorRT-LLM Software Development Engineer

 נכון לתאריך

 

28/11/2025

 רעננה

 NVIDIA

We are now looking for a TensorRT-LLM Software Development Engineer!

NVIDIA is hiring software engineers for its TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT and Generative AI that have put DL at the “iPhone moment” for AI. Join the team which is building the inferencing software which is foundational to product lines within NVIDIA and across the industry! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.

What You'll Be Doing

  • Craft and develop robust inference software that can be scaled to multiple platforms for functionality and performance
  • Performance analysis, optimization, and tuning for Large Language Models (LLMs)
  • Conduct unit tests and performance tests for different stages of the inference pipeline.
  • Closely follow academic developments in the field of artificial intelligence and feature update TensorRT-LLM
  • Write safe, scalable, modular, and high-quality (C++/Python) code for our core backend software for LLM inference.
  • Collaborate across the company to guide the direction of deep learning inference, working with software, research and product teams

What We Need To See

  • Bachelors, Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience).
  • 5+ years of relevant software development experience.
  • Excellent Python programming skills, software design, and software engineering skills
  • Awareness of the latest developments in LLM architectures and LLM inference techniques
  • Experience working with deep learning frameworks like PyTorch and HuggingFace
  • Proactive and able to work without supervision
  • Excellent written and oral communication skills in English

Ways To Stand Out From The Crowd

  • Prior experience with a LLM inference framework (TensorRT-LLM, SGLang, vLLM, etc.) or a DL compiler in inference, deployment, algorithms, or implementation
  • Prior experience with performance modeling, profiling, debug, and code optimization of a DL/HPC/high-performance application
  • Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.
  • Architectural knowledge of CPU and GPU
  • GPU programming experience (CUDA or OpenCL)

JR2008357

משרות דומות שיכולות לעניין אותך

 נכון לתאריך

 

22/10/2025

 רעננה

NVIDIA is seeking a sharp, innovative, and hands-on Architect to help shape the future of LLM inference at scale. Join our dynamic E2E Architecture gr...  

read more

 נכון לתאריך

 

19/11/2025

 רעננה

Our team propels generative AI forward by building and deploying agentic systems that integrate innovative LLMs with domain tools to expedite HW and S...  

read more

 נכון לתאריך

 

15/11/2025

 רעננה

NVIDIA is building state-of-the-art accelerated computing platforms that know no boundaries. Our next-generation Infiniband, NVLink, and Ethernet syst...  

read more

 נכון לתאריך

 

12/11/2025

 רעננה

Job ID: 205238

Required Travel : Minimal

**Managerial - No

****Location: Israel- RAANANA (Amdocs Site)

****Who are we?

...  

read more

 נכון לתאריך

 

23/10/2025

 רעננה

At NiCE, we don’t limit our challenges. We challenge our limits. Always. We’re ambitious. We’re game changers. And we play to win. We set the highest ...  

read more

 נכון לתאריך

 

08/11/2025

 רעננה

NVIDIA is looking for an experienced SW Engineer with desire and ability to contribute and lead cutting edge Network Management System of most powerfu...  

read more

 נכון לתאריך

 

31/10/2025

 רעננה

NVIDIA is searching for a highly motivated, excellent Senior Software Engineer for design and verification to join the software tools group. You will ...  

read more

 נכון לתאריך

 

25/11/2025

 רעננה

NVIDIA is looking for an excellent Software Engineer to join the network management team. The team develops software responsible for configuring netwo...  

read more

 נכון לתאריך

 

05/11/2025

 רעננה

Intelligent machines powered by Artificial Intelligence computers that can learn, reason and interact with people are no longer science fiction. GPU D...  

read more
הצג משרות דומות נוספות...

Mploy אצלכם בוואטסאפ

✨ רוצים להתעדכן בכל המשרות הכי שוות ישר לנייד?

הצטרפו לקבוצות הוואטסאפ שלנו וקבלו את כל ההצעות המתאימות – בלי לחפש, ובלי לפספס. מחכים לכם! 📱😊