Summary
The Incident Response Analyst is a mid level role responsible for monitoring the performance of Cosm’s infrastructure and systems to ensure site reliability and operational continuity across live entertainment venues and live broadcast environments. This role operates in a real time, high pressure setting where rapid incident detection, triage, and coordination are critical.
A core responsibility is leading clear, structured verbal communication during highvisibility incidents, ensuring all stakeholders remain aligned on impact, current state, and next steps in real time.
Responsibilities
Monitor infrastructure, services, and applications to identify and resolve issues before they impact end users
Lead real time communication during major incidents, delivering clear, concise, and structured verbal updates on incident bridges
Rapidly establish and communicate situational awareness including impact, scope, timeline, and actions in progress
Own the current state narrative during incidents, ensuring alignment acrossOps Center, leadership, engineering, and field teams
Drive structured communication flow during incidents to maintain clarity, ownership, and decision making
Act as a primary point of contact for high impact incidents and escalations
Coordinate cross functional teams to drive timely mitigation and resolution
Ensure accurate incident tracking, documentation, and timeline management throughout the incident lifecycle
Lead clear end of shift handoffs, summarizing incident status, risks, and ongoing investigations
Partner with engineering on remediation and follow up actions
Generate incident reports and provide operational and trend analysis to stakeholders
Coordinate planned maintenance, upgrades, and outages across teams
Mentor junior team members and provide feedback to improve incident communication practices
Contribute to improving Ops Center tools, processes, and documentation
Collaborate with field services and customer support teams to improve overall reliability and customer experience
Experience
Bachelor’s degree in Computer Science, Information Technology, or related field
5+ years of experience in an Ops Center, incident management, or similar role
Proven experience leading real time verbal communication during high severity incidents, delivering clear and structured updates on incident bridges
Experience coordinating cross functional teams during major incidents
Proficiency with incident management and monitoring tools (e.g., Grafana, ServiceNow)
Strong analytical, communication, and problem solving skills
Ability to operate decisively in high pressure, real time environments
Experience managing multiple concurrent incidents
Familiarity with ITIL or similar incident management frameworks
Previous experience in a 24/7 operations environment
Experience communicating directly with customers or clients (bonus)
Work Environment
This is a live event role requiring flexibility to work irregular shifts, including early mornings, nights, weekends, and holidays based on operational needs.