Senior Systems Engineer - Observability & Infra Automation (Central Infrasecurity)
Job Description
Role & Responsibilities
- Manage Elastic SaaS availability, security (RBAC, ILM policies), performance tuning, autoscaling, upgrades, DR testing, patching, and cost optimization via tiered storage
- Operate Logstash servers, Elastic Agents/policies, and supporting infrastructure such as Nutanix hypervisors and F5 load balancers
- Maintain observability pipelines for data ingestion, parsing, normalization, dashboards, alerting rules, and Kibana visualizations
- Integrate with automation platforms (e.g., Ansible), ServiceNow for incident workflows, and AIOps for event correlation and self-healing remediation
- Troubleshoot ingestion issues, conduct root cause analysis, and collaborate on resiliency metrics and SLOs
- Collaborate with development and operations teams to instrument applications and infrastructure for better visibility
- Document processes, configurations, and best practices to ensure knowledge sharing and continuity