Job Summary:
Squarepoint is seeking a Platform Specialist to join our global Platform Compute (PLC) team. This role is ideal for experienced engineers with a strong software development background who are passionate about building scalable, resilient infrastructure systems. You will architect, develop, and optimize compute platforms that support our high-performance, data-intensive workloads. You'll work closely with global engineering teams to design and implement infrastructure-as-code, observability pipelines, and self-healing systems. This is a hands-on engineering role with a strong emphasis on automation, performance tuning, and developer enablement.
Main Duties & Responsibilities:
- Architect and evolve scalable compute platforms using modern infrastructure engineering practices.
- Design and implement automation frameworks for provisioning, configuration management, and lifecycle operations.
- Develop internal tooling and APIs to abstract infrastructure complexity and improve productivity.
- Take part in bulk server provisioning which may include using remote hands to complete tasks.
- Drive observability initiatives by building and integrating telemetry pipelines (metrics, logs, traces).
- Collaborate with software engineering teams to ensure infrastructure supports application scalability, reliability, and security.
- Mentor junior engineers and contribute to engineering best practices across the team.
- Participate in on-call, incident response and postmortems, driving long-term improvements through automation and architectural changes.
Qualifications/Skills Desired:
- Expert-level Linux systems administration (RHEL/CentOS/Ubuntu)
- Configuration management (e.g., Ansible, Chef, SaltStack)
- Proficient in Python, Go, or Rust for infrastructure tooling
- Experience with AWS, GCP, or Azure (compute, storage, networking, IAM)
- Infrastructure-as-Code (e.g., Terraform, Pulumi)
- Monitoring and alerting (e.g., Prometheus, Grafana, Datadog, Zabbix)
- Logging and tracing (e.g., ELK stack, Fluentd, OpenTelemetry, Jaeger
- Identity and access management (e.g., LDAP, Kerberos, OAuth2)
- Cloud-native services (e.g., S3, EBS, GKE, EKS, Cloud Functions)
- Git-based workflows and version control
- Agile methodologies and DevOps culture
- Test-driven infrastructure development and automated testing frameworks
- Bachelor's or Master's degree in Computer Science, Engineering, or a related field
Nice to have:
- CI/CD pipelines (e.g., GitLab CI, Jenkins, ArgoCD)
- Hybrid cloud architecture and workload migration strategies
- Virtualization technologies (e.g., KVM, VMware, Hyper-V)
- Contributions to open-source infrastructure projects or internal developer platforms
- Containerization and orchestration (e.g., Docker, Kubernetes, Helm)
- Secure system design and hardening (e.g., SELinux, AppArmor, CIS benchmarks)
- Experience with Netbox using IPAM and custom modules