Site Reliability Engineer (Intermediate)
Join us as an Intermediate Site Reliability Engineer helping build reliable, scalable cloud infrastructure. You’ll work alongside senior engineers to own projects, deepen platform skills, and support teams operating large distributed systems.
You’ll focus on one of three streams: Kubernetes, Observability, or Developer Experience.
What you'll be doing
- Improve infrastructure reliability, scale, and security across cloud-native systems.
- Deliver features and upgrades through infrastructure-as-code.
- Collaborate with product teams on debugging, migrations, and operational readiness.
- Support incident response, capacity planning, and performance improvements.
- Automate repeatable workflows to reduce operational load across engineering.
Stream Focus Areas
Kubernetes Platform
You’ll help operate and evolve shared Kubernetes platforms used by many product teams.
Typical work:
- Maintain and upgrade clusters, networking, ArgoCD, and IaC patterns.
- Build or extend reusable infra modules (XRDs, Helm, Terraform) to standardize onboarding.
- Partner with teams to plan and execute migrations safely
- Handle inbound maintenance, patching, and legacy stack stability work.
Ideal tools: Kubernetes, Terraform, ArgoCD, Atlantis, Helm, AWS/GCP, Postgres/Redis basics.
Observability Platform
You’ll help deliver a modern telemetry platform powering metrics, logs, and traces for engineering teams.
Typical work:
- Build and operate OTEL-based telemetry pipelines across environments.
- Support migrations to VictoriaMetrics and maintain data accuracy during transitions.
- Improve SLOs, alerting strategies, and reliability of observability systems.
- Contribute to IaC automation for observability deployments.
Ideal tools: OTEL, Prometheus, VictoriaMetrics, VM Alert, Grafana, Terraform, GitHub Actions.
Developer Experience / CI/CD
You’ll help maintain and strengthen the CI/CD ecosystem powering builds, tests, and deployments.
Typical work:
- Maintain pipelines, update dependencies, and improve the reliability of GitHub Actions.
- Migrate workloads away from legacy tooling to a new Tailscale / OIDC-based platform.
- Triage support requests, follow runbooks, and assist product teams during migrations.
- Reduce operational load by standardizing patterns and supporting migrations.
Ideal tools: GitHub Actions, Docker, Tailscale, Terraform, and container registry best practices.
Your Background
- 3 - 5 years of experience as an SRE. Minimum 1+ years as a software engineer.
- Keen to deepen your software engineering skills and play a bigger role in how our systems are built and operated.
- Comfortable writing and debugging code in Go, Python, or a similar language.
- Curious about platform reliability, excited to learn deeper system internals over time.
- Communicate clearly with engineers across teams and time zones.
- Focus on automation, reproducibility, and practical reliability over “heroics.”
- Bring some experience in cloud infrastructure and want to grow into owning larger systems.
About Us
At Bit Complete, we craft software solutions that make a difference, backed by tech veterans from YouTube, Slack and Thumbtack. With a team of 30 engineers, we tackle tough client challenges and run experiments through side projects.
We’re growing but staying true to our roots. Our focus is on creating a sustainable, profitable company that lets us do what we love while taking on projects that are challenging, interesting, and avoid harming the world. If you’re looking for work that you can genuinely care about, with a team that truly has your back, you’re in the right place. Learn more about our culture and how we see ourselves in the software services industry.
Benefits
- Work-life balance and the set-up to do your best work: We believe in work that fits into your life, not the other way around. Enjoy four weeks of paid vacation, flexible hours, a MacBook Pro, $75/month internet reimbursement, and a $700/year stipend for your home office setup.
- No VC strings attached: We're profitable, bootstrapped, and committed to sharing that success with our team. Expect generous profit-sharing bonuses tied to the company’s performance.
- Comprehensive group benefits: Including drugs, paramedical practitioners, dental, vision care, virtual health care, virtual mental health care, and travel insurance.
- Top-up for parental leave
Compensation
CAD $117,610 - $158,240 annually.
Our ranges include base salary and conservative bonus target.
Interested?
We're excited about working with you, so get in touch! Submit your application here.
The world of work today is overflowing with systems, processes, tools, and assumptions that are flawed and that can push directly against our ability to express what is unique about each of us in the work we do every day. We believe people from diverse backgrounds, with different identities and experiences, make our company better. No matter your background, we'd love to hear from you! Alignment with our values is just as important as experience. Also, please let us know if there are ways we can make our interview process better for you - we're always happy to listen and accommodate where possible.