Staff Platform Engineer (LATAM)
Job Description
Remote Work from (Brazil, Peru, Chile, Argentina)
- Conduct performance testing and database health & tuning.
- Maintain and enhance our Kubernetes clusters.
- Improve monitoring systems and update alerts.
- Automate deployments and developer environments.
- Explore and implement CI/CD options.
- Maintain developer environments and CI pipelines.
- Provide on-call support to ensure system reliability and performance. (On-call rotations are designed to be fair and prevent burnout).
- Perform backend feature work and integrations using Python/Django
- Database Tuning/Monitoring (e.g adding/removing indexes, partitioning, sharding)
- Introducing/Training the usage of technologies used for scaling (e.g DynamoDB, Redis, ElasticSearch)
Tech stack
Frontend - ReactJS, Redux, RTK Query, Backend - Django, DRF, Postgres, DevOps - Docker, Github Actions, Terraform, AWS, Kubernetes, Helm, Redis.
Employee benefits:
Remote-first company (Remote work)
Competitive salary with employee stock options, bonus and access to a 401K
Medical, dental and vision coverage
Healthcare, 401K, flexible PTO.
Who You Are
Backend Engineer Expertise: (All must have experience)
You have experience as Staff Engineer.
Python/Django.
Python + DB Performance optimization.
API / SFTP Integrations.
Feature Development experience.
DevOps/SRE Expertise:
Will need to be comfortable with AWS, Maintaining EKS (Kubernetes on AWS).
DevOps/SRE Skills:
Docker, Kubernetes, Helm, Terraform, Github Actions, AWS, Postgres, Redis.
They don't need to have DevOps/SRE title, but from their resume they should have some of the DevOps keywords listed right above.
A lot of early startups don't have dedicated SREs but a senior BE architect/engineer is usually responsible for the DevOps work.
Comfortable with:
Scalable & Maintainable Backend Feature Development [Experience is Required]
Scalable & Maintainable API/SFTP Integrations [Experience is Required]
Monitoring and troubleshooting Postgres DB (adding/removing indexes, partitioning/sharding)
Maintaining developer environments.
Managing CI pipelines and runners.
Setting up and managing alerting systems.
Providing on-call support and guidance to other developers to triage issues.
Impact and Expectations:
Contribute to key performance indicators (KPIs) such as system uptime, performance metrics, and compliance standards.
Make a significant impact within the first 6-12 months by leading initiatives to improve infrastructure and operational practices.
Interview process:
1.Intro Call with Hiring Manager.
2.Optional Take Home Exam.
3.Remote Onsite (2 Technical Interviews) (2.5 hours)
4. Final Round (Either Technical Interview / Work Trial)