Machine Learning Engineer – PyTorch / C++ / NPU Architecture
We are partnered with a global semiconductor leader seeking a Machine Learning Engineer to join their NPU Architecture Team. This is a chance to shape the future of energy-efficient ML technology used in billions of devices worldwide.
What you'll be doing:
* Develop and enhance ML compilers using PyTorch and C++ for advanced NPU architecture
* Design and implement quantization techniques to improve model efficiency and accuracy
* Optimize ML workloads for high performance and low power consumption
* Collaborate with strategic customers to deliver tailored ML solutions
* Partner with architecture and software teams to integrate new ML technologies
* Research, prototype, and implement compiler and system-level optimizations
What we're looking for:
* Strong background in PyTorch and C++ programming
* Experience in ML compiler development, quantization, and workload analysis
* Familiarity with TensorFlow or ONNX a plus
* Solid understanding of hardware-aware ML solutions and performance optimization
* Excellent problem-solving, collaboration, and communication skills
Preferred:
* Experience deploying ML models on NPUs, GPUs, or TPUs
* Understanding of system-level architecture and low-level programming
* Research or publications in ML-related fields
If this sounds interesting and you'd like to learn more, click the link below to apply or email me with a copy of your CV on
-
By applying to this role you understand that we may collect your personal data and store and process it on our systems. For more information please see our Privacy Notice (https://eu-)