Job Description
Job Summary
This position will be focused on the governance and oversight of the Business Resiliency Program, which includes traditional disaster recovery lifecycle activities, operational and technology resilience, and business continuity.Job Description
Required Qualifications & Skills
- Education: Bachelor’s degree in Information Technology, Computer Science, or a related field.
- Experience: 2–5 years of experience in IT disaster recovery, business continuity, or system administration.
- Technical Knowledge: Understanding of network architecture, cloud-based solutions (AWS, Azure), data backup solutions, and virtualization.
- Tools: Familiarity with business continuity management software (e.g., ServiceNow) and recovery concepts.
- Soft Skills: Strong analytical, project management, and communication skills; ability to work under pressure during crises
- Experience Desired: Load testing: Experience supporting load testing activities to validate system performance under expected and peak usage conditions.
Certifications desired:
Certified Business Continuity Professional (CBCP)
- Certified Information Systems Auditor (CISA)
- DRII Certifications (Disaster Recovery Institute International)
- ITIL Foundation
Key Responsibilities
- Strengthen the team’s cloud disaster recovery readiness by guiding IT teams grounded in hybrid/cloud DR principles and tools
- Risk Assessment & Mitigation: Conduct Business Impact Analysis (BIA) and risk assessments to identify vulnerabilities, critical systems, and potential threats.
- Testing & Exercises: Assist other team members to plan, coordinate, and execute testing activities. This includes developing test scenarios, validating recovery time objectives, and documenting test outcomes.
- Infrastructure & Backup Monitoring: Guide teams with best practices and adopted standards on data replication, backup strategies, and system health to ensure they meet defined Recovery Time Objectives (RTO) and Recovery Point Objectives (RPO).
- Incident Response: Participate in or lead response efforts during IT outages to restore functionality, acting as a liaison between technical teams and business units.
- Compliance & Reporting: Ensure plans comply with company standards and prepare reports on DR readiness for management and auditors.
Typical Daily Activities
- Providing support, direction to IT teams for recovery documentation, such as server, network, and application recovery guides.
- Meeting with application owners to discuss & plan DR exercises.
- Analyzing previous incidents to recommend and implement improvements.
