Job Summary
We are seeking a highly skilled and experienced AI & Automation Engineer develop AI-driven automation and next generation AI tools. This role requires a strong understanding of data center network support and NOC operations, AI engineering, and a passion for developing innovative solutions to complex business challenges.
Responsibilities:
* Contribute toward AI roadmap for AI-driven automation aligning with strategic company and client goals.
* Collaborate with engineering and data teams to design and architect scalable, robust, and innovative AI solutions (e.g., automated network diagnostics, bot recommendation systems, AI agents NOC operations).
* Act as the Solution Owner in an Agile/Scrum environment, managing the product backlog, writing detailed user stories, defining acceptance criteria, and prioritizing features.
* Lead the evaluation, fine-tuning, and integration of open-source Large Language Models (LLMs) like Llama series & other LLMs.
* Develop and own key components of the automation framework, including PowerShell and Python runbook executors.
* Develop and train machine learning models for tasks such as anomaly detection, predictive maintenance, and capacity planning in IT environments.
* Work with large datasets of IT operational data, performing data cleaning, feature engineering, and data analysis to improve model accuracy and performance.
* Contribute to the development of internal automation tools and frameworks.
* Deliver continuous service improvements by proactively identifying opportunities for process enhancements.
Proficiently develop AI/Gen AI point solutions to meet business needs.
* Troubleshoot and resolve issues related to automation systems and AI models.
* Engage with customers, IT operators, Network Engineers, and internal stakeholders to gather requirements, validate solutions, and ensure product-market fit.
* Serve as the Subject Matter Expert (SME) on AIOps, IT Process Automation (ITPA), and Runbook Automation (RBA), providing technical guidance and insights.
* Document automation processes, code, and models clearly and concisely.
* Actively contribute to team results and work towards achieving team goals and objectives.
* Undertake designated skill/knowledge development within the organization, including training of next-level team members.
Required Skills & Competencies:
* 4+ years of experience in IT services, with at least 3 years in:
* IT Service Automation – Orchestration, Scripting, & Process Assessment.
* AI Engineering – Developing and deploying AI Agents and cutting-edge GenAI & ML solutions to address complex business challenges.
* Expertise in integrating various tools and building analytics/insights.
* Proficiency in at least two scripting languages such as PowerShell, Python, or Shell Script.
* Strong foundational knowledge across core IT domains, with a specific emphasis on Network & Network Services, including understanding network topologies, protocols, and common operational issues.
* Demonstrable experience implementing solutions using state-of-the-art LLMs, with hands-on experience with both open-source models (e.g., Llama series) and proprietary models (e.g., GPT-4).
* Proficiency with a major deep learning framework, preferably PyTorch, for model experimentation and fine-tuning.
* Proficiency in a high-level programming language (Java, Python) and experience with containerization technologies (Docker, Kubernetes) for deploying AI models.
* Solid understanding and practical experience with cloud platforms such as Azure or GCP, including knowledge of their networking services (e.g., VNets, VPCs).
* Experience with AI/ML libraries and frameworks.
* A strong understanding of ITIL process on one or more service lifecycle or service capability modules.
* Knowledge in one or more System administrative activities like monitoring, service requests, incident management, change management, & maintenance.
* Excellent communication skills with an ability to explain concepts and solutions clearly and concisely.
* Experience in training/grooming people.
* Excellent analytical and problem-solving skills.