Job Description:
We are seeking an experienced Machine Learning Engineer to join our team. The ideal candidate will have a strong background in PyTorch and C++ programming, with expertise in ML workload analysis, compiler development, and quantization techniques.
Key Responsibilities:
* Develop and enhance ML compilers using PyTorch and C++ tailored for Qualcomm's Neural Processing Unit (NPU) architecture.
* Design and implement advanced quantization techniques to improve model efficiency and accuracy.
* Optimize ML workloads for deployment across diverse devices, ensuring top-tier performance and energy efficiency.
The successful candidate will collaborate directly with strategic customers to address specific challenges and deliver tailored ML solutions. Experience with deep learning frameworks such as TensorFlow or ONNX is a plus. A proven track record of solving complex performance and efficiency challenges in hardware-aware ML solutions is essential.
Benefits:,