Job Overview
We are seeking a highly skilled AI/ML Software Engineer to join our innovative community of customers, partners, and contributors. This individual will play a key role in delivering high-performing solutions using open source technologies.
The selected candidate will collaborate with cross-functional teams to define customer needs and use cases, create robust AI/ML software tools, and contribute to a healthy open source community. Key responsibilities include:
* Collaborating with Staff Engineers, Engineering, Product Management, and User Experience to define customer needs and use cases.
* Creating robust AI/ML software tools to enable AI Application development and contributing to a healthy open source community.
* Developing and implementing comprehensive unit, integration, and end-to-end tests to guarantee the reliability and performance in the upstream project, maintaining CI/CD workflows in GitHub, and ensuring downstream quality.
* Participating in AI-assisted code reviews, utilizing tools that provide real-time feedback, identify potential bugs, security vulnerabilities, and adherence to coding standards, contributing to a more thorough and efficient review process.
* Proactively utilizing AI-assisted development tools (e.g., GitHub Copilot, Cursor, Claude Code) for code generation, auto-completion, and intelligent suggestions to accelerate development cycles and enhance code quality.
* Creating and maintaining clear, concise upstream technical documentation including API references and user guides and collaborating with internal tech writers to create robust downstream documentation.
* Evaluating and integrating the latest advancements in AI/ML technologies and toolkits to improve existing systems and develop new innovative solutions.
Requirements
The ideal candidate should have:
* At least 10 years of advanced Python development experience as a Software Engineer in Open Source communities with experience in AI/ML.
* Advanced knowledge designing robust and scalable APIs used in highly scaled and performant Distributed Systems.
* Experience with AI and Machine Learning platforms, tools, and frameworks, such as LlamaStack, LangChain, PyTorch, LLaMA.cpp, vLLM, LangGraph, and Kubeflow.
* Advanced knowledge creating automation for GitHub, using GitHub Actions or related continuous integration tools.
* Experience developing, deploying, or maintaining On-prem or Cloud Infrastructure.
* Advanced knowledge developing unit, functional, and end-to-end (E2E) test cases and automation.
* Ability to quickly learn and use new tools and technologies.