Site Reliability Engineer

  • Australia
  • Sydney
  • Permanent
  • + ESOP

Our client:

Our client is one of the fastest-growing AI startups globally, building a new category of interactive entertainment. With 20M+ users in year one, a profitable and bootstrapped model, and a high-performing team, they’ve scaled rapidly and are now pushing toward hundreds of millions of users.

This is a rare opportunity to join early and help scale a product aiming to redefine how people engage with digital content.

The role:

This is the first dedicated Site Reliability Engineer, responsible for owning platform reliability, performance, and scalability. You’ll play a critical role in shaping infrastructure decisions, improving system resilience, and ensuring the platform can scale seamlessly as usage accelerates.

Key responsibilities:

  • Own reliability, uptime, and performance across core systems
  • Improve uptime and reduce RTO across critical services
  • Orchestrate and optimise GPU clusters handling high-volume AI workloads
  • Build and enforce observability across metrics, tracing, and alerting
  • Define and maintain SLOs across the platform
  • Optimise AWS infrastructure for performance and cost efficiency
  • Lead incident response and drive root cause resolution

Skills and experience:

  • 5+ years operating production systems at scale
  • Strong AWS experience (infra-as-code, high-scale compute, Kubernetes/ECS)
  • Deep experience in observability, monitoring, and incident response
  • Strong CI/CD and deployment pipeline knowledge
  • Ability to write code and solve root causes, not just symptoms
  • Familiarity with modern stacks (TypeScript, React, Postgres, AWS etc.)
  • Thrives in high-growth environments

Benefits and additional information:

  • Extremly competitive salary with meaningful upside
  • Company card for food, coffee, tools, and productivity
  • Daily team lunch and dinner
  • Unlimited workspace budget
  • High ownership, high impact from day one

How to apply:

If you’re looking to own reliability at scale and build systems from the ground up in a high-growth AI environment, apply now or reach out to Sienna at Talent International for a confidential discussion.

Apply now

Submit your details and attach your resume below. Hint: make sure all relevant experience is included in your CV and keep your message to the hiring team short and sweet - 2000 characters or less is perfect.