Site Reliability Engineer (Senior)
Join us as a Senior Site Reliability Engineer to design, build, and scale the infrastructure behind large-scale, cloud-native systems. You’ll partner with experienced engineers to combine your understanding of distributed systems, security, and scalability with your expertise in Kubernetes and modern cloud platforms. Your work will help ensure our services are reliable, performant, and secure.
This is a role for senior, self-sufficient engineers who enjoy solving ambiguous problems, owning systems end-to-end, and mentoring peers as we push the limits of our product. As a senior member of the team, you will be responsible for whole projects with strategic value, bring clarity to areas of deeper complexity, and empower a team to do its best work. Your scope of work affects multiple projects or streams of work. In return you can expect latitude in how you choose to run projects and design systems, while receiving direct support, guidance, and coaching from Bit Complete’s Partners.
What you'll be doing
- System Design: Design, implement, and operate highly available, high-performance, and scalable systems in cloud-native environments.
- Kubernetes: Build and scale Kubernetes clusters and containerized workloads from the ground up.
- Infrastructure-as-Code: Develop and maintain infrastructure-as-code using tools like Terraform and Pulumi.
- Databases & Services: Maintain and optimize databases and backend services (PostgreSQL, DynamoDB, Redis) to meet evolving needs.
- Scalability & Observability: Lead initiatives on stateless architectures, CI/CD pipelines, and observability (Prometheus, Grafana, OpenTSDB, Envoy) to enhance scalability, maintainability, and reliability.
- Leadership & Mentorship: Provide guidance and mentorship to the infrastructure team, fostering a culture of collaboration and continuous improvement.
Your Background
- Programming Experience: Experience coding in at least one modern language such as Python, Go, or JavaScript. We value technology generalists who are curious and comfortable learning new areas of the stack, and we’ll support you as you grow into new technologies.
- SRE Expertise: A background in Site Reliability Engineering or a similar role, with hands-on experience scaling systems, contributing technical leadership, and designing distributed, cloud-native architectures.
- Cloud & Containers: Experience with cloud infrastructure and container orchestration, including AWS or GCP, Docker, Kubernetes, and Infrastructure-as-Code tools such as Terraform or Pulumi.
- Databases & Observability: Knowledge of databases and observability technologies such as PostgreSQL, DynamoDB, Redis, Prometheus, Grafana, OpenTSDB, plus solid Linux, debugging, and troubleshooting skills.
- Communication: Strong communication skills with the ability to work effectively with both technical and non-technical stakeholders.
About Us
At Bit Complete, we craft software solutions that make a difference, backed by tech veterans from YouTube, Slack and Thumbtack. With a team of 30 engineers, we tackle tough client challenges and run experiments through side projects.
We’re growing but staying true to our roots. Our focus is on creating a sustainable, profitable company that lets us do what we love while taking on projects that are challenging, interesting, and avoid harming the world. If you’re looking for work that you can genuinely care about, with a team that truly has your back, you’re in the right place. Learn more about our culture and how we see ourselves in the software services industry.
Benefits
- Work-life balance and the set-up to do your best work: We believe in work that fits into your life, not the other way around. Enjoy four weeks of paid vacation, flexible hours, a MacBook Pro, $75/month internet reimbursement, and a $700/year stipend for your home office setup.
- No VC strings attached: We're profitable, bootstrapped, and committed to sharing that success with our team. Expect generous profit-sharing bonuses tied to the company’s performance.
- Comprehensive group benefits: Including drugs, paramedical practitioners, dental, vision care, virtual health care, virtual mental health care, and travel insurance.
- Top-up for parental leave
Compensation
CAD $148,988 - $249,260 annually.
Our ranges include base salary and conservative bonus target.
Interested?
We're excited about working with you, so get in touch! Submit your application here.
The world of work today is overflowing with systems, processes, tools, and assumptions that are flawed and that can push directly against our ability to express what is unique about each of us in the work we do every day. We believe people from diverse backgrounds, with different identities and experiences, make our company better. No matter your background, we'd love to hear from you! Alignment with our values is just as important as experience. Also, please let us know if there are ways we can make our interview process better for you - we're always happy to listen and accommodate where possible.