We are seeking a skilled Machine Learning Engineer to join our team.
This role involves developing and optimizing machine learning compilers using PyTorch and C++ tailored for Neural Processing Unit (NPU) architecture.
Key Responsibilities:
* Create and enhance ML compilers using PyTorch and C++.
* Design and implement advanced quantization techniques to improve model efficiency and accuracy.
* Optimize ML workloads for deployment across diverse devices, ensuring top-tier performance and energy efficiency.
* Collaborate with customers to deliver impactful solutions.
* Research, prototype, and implement innovative solutions in ML compiler design and system-level optimizations.
Qualifications:
* Bachelor's or advanced degree in Computer Science, Electrical Engineering, Machine Learning, or a related discipline.
* Strong expertise in PyTorch and C++ programming.
* Experience with ML workload analysis, compiler development, and quantization techniques.
* Familiarity with deep learning frameworks such as TensorFlow or ONNX is a plus.
* Proven track record of solving complex performance and efficiency challenges in hardware-aware ML solutions.
You will be part of an innovative team that drives advancements in machine learning technology. As a member of this team, you will contribute to the development of customized solutions while shaping the future of energy-efficient machine learning.