Sally Roth
Principal Site Reliability Engineer
resume.sallyroth.dev
Professional Summary
Seasoned Site Reliability Engineer with 16+ years of experience in DevOps, SRE, and infrastructure/platform engineering. Proven track record designing and maintaining large-scale, highly available cloud systems for companies including GoDaddy, Ripple, Oracle, Auth0, and GitHub. Expert in AWS and multi-cloud environments, automation (Terraform, Kubernetes, CI/CD pipelines), and observability tools (Prometheus, Grafana, Datadog) to ensure reliable and secure services.
Skills
- Cloud & Platforms: AWS (EC2, S3, Fargate, CloudWatch), Google Cloud Platform, Microsoft Azure
- Containers & Orchestration: Docker, Kubernetes
- Infrastructure as Code: Terraform, CloudFormation, Puppet, Chef, SaltStack
- CI/CD & Automation: Jenkins, GitHub Actions, GitLab CI, Slack (ChatOps), Capistrano, Hubot
- Monitoring & Observability: Prometheus, Grafana, ELK Stack, Loki, Datadog, Sumo Logic, Splunk
- Programming & Scripting: Python, Ruby (Rails), Go, Bash/Shell
- Databases: MySQL, MongoDB
- AI & Developer Tools: Claude Code (custom skills, agents, hooks, MCP servers), Prompt Engineering & AI-Assisted Development, OpenAI Codex CLI, Cursor, Windsurf
- OS & Tools: Linux (Ubuntu, CentOS), Git, Vault, Okta, LDAP
Work Experience
GoDaddy — Principal Site Reliability Engineer
Nov 2022 – Present
- Develop and maintain highly available Infrastructure as Code for GoDaddy's high-volume SMTP API platform.
- Leverage a hybrid cloud environment with AWS services (Fargate, Kinesis) and on-premise resources.
- Implement and enhance CI/CD pipelines using GitHub Actions.
- Collaborate with development teams to embed robust security and observability.
Ripple — Staff Technical Operations Engineer
Sep 2019 – Nov 2022
- Led multi-cloud observability platform (AWS/GCP) using Kubernetes, Prometheus, Grafana, Loki.
- Consulted development teams on instrumentation best practices.
- Consolidated SaaS monitoring tools into a unified observability stack.
- Led major RBAC redesign for customer-facing infrastructure.
Oracle Corporation — Site Reliability Engineer
May 2017 – Sep 2019
- Built developer infrastructure tools using Python, Ruby, Chef on CentOS.
- Designed internal PaaS with Prometheus, ELK stack, GitLab.
- Operations at massive scale (tens of thousands of nodes).
Auth0 — Production Engineer
Jun 2016 – May 2017
- ChatOps-driven CI/CD on AWS/Azure. MongoDB clusters with Terraform/SaltStack.
- Monitoring with Datadog and Kibana. 24/7 on-call rotations.
GoDaddy — Systems Engineer
Feb 2015 – Jun 2016
- ChatOps CI/CD pipeline for Email Marketing product.
- Puppet automation. Deployment pipeline for 4,000+ legacy MySQL instances.
GitHub, Inc. — Email Infrastructure Engineer
Oct 2013 – Dec 2014
- Scalable email infrastructure with Puppet, Rails, Bash.
- Bulk email delivery (PowerMTA, Postfix). Customer issue resolution via Splunk.
Mad Mimi, LLC — Chief of Email Infrastructure & Delivery
Nov 2009 – Oct 2013
- Led email deliverability and anti-abuse. Managed team of two.
- High-volume email infrastructure (Rails, PowerMTA, Postfix).
Education
B.S.E., Electrical Engineering concentration, Minor in Mathematics — Walla Walla University, 2009
Speaking
DevOps DC, PuppetConf 2015–2017, ScaleConf Colombia, RubyFuza (South Africa), SendGrid Training events
Community
Founded Engineers Without Borders chapter (WWU). Founded Phoenix Puppet Users Group. Active open source contributor.