SRE/Software Engineer

We are seeking a Senior SRE Engineer to join a team that works on a complex distributed architecture, spanning physical machines and virtualizing on-prem host/cloud computing.

Type:

Remote

Full-time

Job ID:

JR - 52010

Apply now
Technologies:
Linux
Python
K8S
IaC tools
Cloud/On-Prem Computing
Locations:
Argentina
Brazil
Mexico

Table of contents

Apply now
Let’s be in touch!

Job Description

The customer develops and deploys systematic financial strategies across a variety of asset classes and global markets. We seek to produce high-quality predictive signals (alphas) through our proprietary research platform to employ financial strategies focused on exploiting market inefficiencies. Our teams work collaboratively to drive the production of alphas and financial strategies – the foundation of a sustainable, global investment platform.

The role is to help set up centralized DevOps and help existing teams adopt more centralized best practices. The ideal candidate will have the ability to manage complexity and tackle problems across multiple stack layers as a part of a small team championing operational excellence.

Our environment is relaxed yet intellectually intense. Our teams are lean and agile, which means rapid prototyping of products with immediate user feedback. We seek people who think in code, aspire to solve undiscovered computer science challenges, and are motivated by being around like-minded people. In fact, of the 600 employees globally, approximately 500 of them code daily.

Requirements:

  • Linux systems: Deep expertise and hands-on experience working with Linux-based systems, with a focus on optimization and troubleshooting;
  • Python proficiency: Strong OOP and Python knowledge with hands-on experience on automation, scripting and system management;
  • Containerization & orchestration: In-depth knowledge of container orchestration technologies such as Kubernetes (K8S). Experience with other cluster management tools like Slurm is a plus;
  • Infrastructure as Code (IaC): Hands-on experience with tools like Helm, Terraform, and Ansible;
  • Container technologies: Strong knowledge with containerization technologies (Ex. Docker and Podman) to ensure reliable and consistent deployments;
  • CI/CD pipelines: Experience working with CI/CD tools, especially GitLab (preferred), GitHub, or Git, to ensure smooth and rapid delivery cycles;
  • Monitoring & logging: Experience with monitoring and logging solutions such as Prometheus, Grafana, and the ELK stack to provide comprehensive insights into system performance and health;
  • Relational databases: Understanding of relational databases, their performance tuning, and management in distributed systems (Ex. PSQL, DynamoDB, Cassandra, etc.);
  • Agile development: Familiarity with Agile development methodologies, with a focus on continuous improvement and collaboration;
  • Cloud experience: Exposure to cloud technologies such as AWS or Google Cloud (GCP) is a strong plus;
  • Collaboration & communication: A team-first attitude with excellent verbal and written communication skills in English, able to work collaboratively with peers across the organization.

Responsibilities:

  • Architecture and automation: Design and deploy as-a-service solutions using open-source software to automate system management, scaling, and monitoring;
  • System optimization: Develop tools to streamline deployment, monitoring, and incident management for large-scale, distributed environments;
  • Collaboration across teams: Work with Development and Operations teams to design and implement software solutions that enhance service reliability;
  • Monitoring & incident response: Set up, configure, and maintain monitoring and alerting systems to ensure real-time visibility into our systems;
  • Participate in on-call rotations;
  • Contribute to on-going DevOps/agile transformation;
  • Leverage container orchestration tools (kubernetes);
  • Use cloud infrastructure (AWS, GCP, Azure, etc.) and IaC tools (Helm, Ansible, Terraform) to ensure fast, safe and reliable deployments.

What We Offer:

  • USD monthly payment;
  • 100% remote opportunity;
  • 10 business days of paid vacation per year (can be taken after 6 months in Customertimes);
  • Up to 10 national holidays (either US or country of residence);
  • 5 personal days off (can be taken after 3 months in Customertimes);
  • Travel expenses are covered when applicable;
  • Referral program;
  • Paid certification program;
  • Personalized personal development plan (PDP);
  • Access to the language learning platform.

Apply now

SRE/Software Engineer