At Red Hat, our commitment to open source innovation extends beyond our products – it’s embedded in how we work and grow. Red Hatters embrace change – especially in our fast‑moving technological landscape – and have a strong growth mindset. That’s why we encourage our teams to proactively, thoughtfully, and ethically use AI to simplify their workflows, cut complexity, and boost efficiency. This empowers our associates to focus on higher‑impact work, creating smart, more innovative solutions that solve our customers’ most pressing challenges. Red Hat’s Global Engineering team is looking for a Principal Software Engineer to join the Agentic and AI Engineering Tools team to develop new APIs, tooling, providers and improve overall code quality in the upstream Llama Stack project. In this role, you’ll contribute directly to Red Hat’s rapidly growing AI/ML family of products and be responsible for the design, development, and refinement of software adding features that enable Llama Stack to achieve Enterprise readiness.
What you will do
Collaborate with Staff Engineers, Engineering, Product Management, and User Experience to define customer needs, use cases, and ensure product and testing fulfills those requirements
Collaborate with Quality Engineers to develop and implement comprehensive unit, integration, and E2E tests to guarantee the reliability and performance in the upstream project, maintaining CI/CD workflows in GitHub, and ensuring downstream quality
Participate in AI‑assisted code reviews, utilizing tools that provide real‑time feedback, identify potential bugs, security vulnerabilities, and adherence to coding standards, contributing to a more thorough and efficient review process
Leverage AI‑driven testing frameworks to automate test case generation, optimize test execution, identify potential edge cases, and enhance overall test coverage and reliability
Create and maintain clear, concise upstream technical documentation including API references and user guides and collaborate with our internal tech writers to create robust downstream documentation
What you will bring
10 years of advanced Python and 5+ years Bash/Shell development experience
The previous experience in testing, automation and quality engineering is mandatory
Advanced knowledge designing robust and scalable software used in highly scaled and performant Distributed Systems
Experience with building agentic/RAG applications, MCP integrations, or developing with LLMs
Advanced knowledge creating automation for GitHub, using GitHub Actions or related continuous integration tools
Advanced knowledge developing unit, functional, and end‑to‑end (E2E) test cases and automation, including frameworks such as PyTest
Knowledge of Kubernetes/OpenShift and operational knowledge building/deploying containers
Experience developing, deploying or maintaining On‑prem or Cloud Infrastructure
Ability to quickly learn and use new tools and technologies
Nice to Haves
Experience with Security, Observability, Performance or Scale
Understanding of DevOps methodology, scrum, and/or Jira
Experience with AI and Machine Learning platforms, tools, and frameworks, such as LlamaStack, LangChain, PyTorch, LLaMA.cpp, vLLM, LangGraph, and Kubeflow
Equal Opportunity Policy (EEO)
Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.
#J-18808-Ljbffr