Description:
The DevOps Engineer will be responsible for designing, implementing, and maintaining software solutions that manage server racks and data center infrastructure for AI accelerator products. This role involves close collaboration with internal software teams, ODMs, ISVs, and cross-functional engineering groups to deliver scalable, automated, and secure rack management systems.
Please note this role will require 5 days per week onsite in our Cork office*
Key Responsibilities
- Develop and maintain CI/CD pipelines for rack software deployment and updates.
- Automate provisioning, monitoring, and configuration of rack-scale infrastructure.
- Integrate rack management software with orchestration platforms (e.g., Foreman, OpenRMC).
- Implement APIs and automation tools to interface with rack hardware, sensors, and management systems.
- Troubleshoot and resolve issues related to rack software and integration.
- Collaborate with hardware engineers, ODMs, and ISVs to define software requirements and ensure compatibility.
- Support firmware and OS deployment across server racks in bare-metal environments.
Preferred Qualifications
- 5+ years of experience in DevOps, infrastructure automation, or hardware-related software development.
- Proficiency in Python, C++, and shell scripting.
- Experience with Redfish, IPMI, SNMP, and other hardware management protocols.
- Familiarity with DCIM tools (e.g., Foreman, Ansible).
- Strong understanding of networking, server hardware, and Linux/Windows OS environments.
- Excellent problem-solving, communication, and documentation skills.
- Experience with OpenRMC and rack-scale firmware integration.
- Knowledge of container orchestration (Kubernetes, Docker).
- Exposure to cloud platforms (AWS, Azure, GCP) and hybrid deployments.
- Background in AI/ML infrastructure or accelerator-based systems.
- Experience with monitoring tools (Prometheus, Grafana, ELK Stack).