Site Reliability Engineer

 

Description:

At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation.


Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive.

Join the team developing the software that powers half the internet!

NGINX, now part of F5, is the product group behind the popular open-source NGINX web server project. In addition to NGINX Open Source, we create open source and commercial technologies that provide developers and enterprises alike with the traffic management solutions they need. NGINX traffic management technologies include the load balancer/reverse proxy and web server we’re known for around the world, as well as API gateway, app server, WAF and DoS protection, ingress controller, and service mesh.

Our teams are highly collaborative and innovative. We focus on solving the real problems our customers face when they run large-scale applications. If you want to be challenged and have the freedom to learn from your mistakes, NGINX is the team for you.

Position Summary
In this position, you will play a key role in building and maintaining automation tools, services and processes for operations support, incident handling, monitoring and alerts and automation to support our world-class SaaS products. You will champion efforts to improve support, security, reliability, and efficiency in these environments, as well as explore and lead efforts towards new strategies and architectures for CI/CD pipeline services, infrastructure, and tooling. When necessary, you are comfortable wearing a developer hat to build a solution. You are passionate about automation and tools.

Primary Responsibilities

  • Work with engineering team to design, build, maintain and support SaaS-based services. This includes performing first-level of triage for issues reported from multiple sources (monitoring and alerting systems, customer bugs, support issues).

  • Ensure that the SaaS services and associated infrastructure maintain required levels of security, availability, reliability, scalability, and performance to meet SLAs.

  • Build incident management, operational monitoring, and alerting capabilities to proactively report, troubleshoot, and fix problems.

  • Assist in achieving and maintaining industry security audit certifications (ISO, SOC2, HIPAA, etc.).

  • Build automation around the infrastructure and services used in product development, testing, and CI/CD pipelines.

  • Research modern technology areas, innovations, and ideas.

Knowledge, Skills, and Experience

  • Experience setting up and using incident and on-call management systems like PagerDuty.

  • Experience setting up and building tools to collect and visualize data (logs, metrics, alerts), building dashboards, alerting, and monitoring systems.

  • Experience with deploying secure infrastructure and services in one or more cloud environments such as Azure or AWS.

  • Experience with configuration management and deployment automation tools, such as Terraform, Ansible, Packer, etc.

  • Proficiency in scripting languages such as Python and Bash.

  • Strong understanding and experience working with CI/CD pipeline frameworks (Gitlab-CI, GitHub Actions, Jenkins, etc.).

  • Experience with container (Docker) and orchestration systems (Kubernetes).

  • Solid understanding of Linux OS + systems administration skills

  • Good understanding of networking fundamentals: TCP/IP, HTTP, DNS, load balancing, firewalling, etc.

  • Excellent analytical and trouble-shooting skills.

  • Dynamic collaborator who thrives in diverse, geographically distributed locales.

  • Team player that demonstrates diplomacy, promotion of sound ideas & concepts, paired with the desire to help others grow their skills.

  • Strong verbal and written communication skills.

  • Experience with NGINX technologies a strong plus.

 

Organization f5
Industry IT / Telecom / Software
Occupational Category Site Reliability Engineer
Job Location Cork,Ireland
Shift Type Morning
Job Type Full Time
Gender No Preference
Career Level Intermediate
Experience 3 Years
Posted at 2022-11-03 11:17 am
Expires on Expired