Site Reliability Engineer



Primary Responsibilities


- **Handling Major Incidents:**

- Manage Critical Issue Response System (CIRS) for major incidents.

- Provide frequent updates on CES-based CIRS until the issue is stabilized.

- Perform deep dive troubleshooting on applications.


- **Preventive Actions and Requests:**

- Identify and create preventive action items for CIRS.

- Handle CIRS-based requests, including DFs, feature toggles, and deployments.

- Follow up on major production incidents to ensure resolution.


- **Monitoring and Planned Activities:**

- Utilize monitoring tools such as Dynatrace, Kibana, etc.

- Drive and monitor planned activities.

- Write new monitoring scripts and enhance the existing monitoring scope.


- **Customer Escalations and Application Issues:**

- Handle customer escalations efficiently.

- Deep dive into application issues to identify root causes.

- Create Splunk alerts based on CIRS learnings.

- Troubleshoot and coordinate customer escalations raised by Support and Engineering teams.


- **Ad-hoc Requests:**

- Address ad-hoc requests from CES teams.

Industry Engineering
Occupational Category Site Reliability Engineer
Job Location Dublin,Ireland
Shift Type Morning
Job Type Full Time
Gender No Preference
Career Level Experienced Professional
Experience 5 Years
Posted at 2024-07-09 5:32 pm
Expires on 2024-08-23