Overview
We are partnered with a global semiconductor leader seeking a Machine Learning Engineer to join their NPU Architecture Team.
This is a chance to shape the future of energy-efficient ML technology used in billions of devices worldwide.
What you'll be doing
Develop and enhance ML compilers using PyTorch and C++ for advanced NPU architecture
Design and implement quantization techniques to improve model efficiency and accuracy
Optimize ML workloads for high performance and low power consumption
Collaborate with strategic customers to deliver tailored ML solutions
Partner with architecture and software teams to integrate new ML technologies
Research, prototype, and implement compiler and system-level optimizations
What we're looking for
Strong background in PyTorch and C++ programming
Experience in ML compiler development, quantization, and workload analysis
Familiarity with TensorFlow or ONNX a plus
Solid understanding of hardware-aware ML solutions and performance optimization
Excellent problem-solving, collaboration, and communication skills
Preferred
Experience deploying ML models on NPUs, GPUs, or TPUs
Understanding of system-level architecture and low-level programming
Research or publications in ML-related fields
If this sounds interesting and you\'d like to learn more, please email your CV to
By applying to this role you understand that we may collect your personal data and store and process it on our systems.
For more information please see our Privacy Notice (
#J-18808-Ljbffr