Sally Roth

Principal Site Reliability Engineer

resume.sallyroth.dev

Professional Summary

Seasoned Site Reliability Engineer with 16+ years of experience in DevOps, SRE, and infrastructure/platform engineering. Proven track record designing and maintaining large-scale, highly available cloud systems for companies including GoDaddy, Ripple, Oracle, Auth0, and GitHub. Expert in AWS and multi-cloud environments, automation (Terraform, Kubernetes, CI/CD pipelines), and observability tools (Prometheus, Grafana, Datadog) to ensure reliable and secure services.

Skills

Cloud & Platforms: AWS (EC2, S3, Fargate, CloudWatch), Google Cloud Platform, Microsoft Azure
Containers & Orchestration: Docker, Kubernetes
Infrastructure as Code: Terraform, CloudFormation, Puppet, Chef, SaltStack
CI/CD & Automation: Jenkins, GitHub Actions, GitLab CI, Slack (ChatOps), Capistrano, Hubot
Monitoring & Observability: Prometheus, Grafana, ELK Stack, Loki, Datadog, Sumo Logic, Splunk
Programming & Scripting: Python, Ruby (Rails), Go, Bash/Shell
Databases: MySQL, MongoDB
AI & Developer Tools: Claude Code (custom skills, agents, hooks, MCP servers), Prompt Engineering & AI-Assisted Development, OpenAI Codex CLI, Cursor, Windsurf
OS & Tools: Linux (Ubuntu, CentOS), Git, Vault, Okta, LDAP

Work Experience

GoDaddy — Principal Site Reliability Engineer

Nov 2022 – Present

Develop and maintain highly available Infrastructure as Code for GoDaddy's high-volume SMTP API platform.
Leverage a hybrid cloud environment with AWS services (Fargate, Kinesis) and on-premise resources.
Implement and enhance CI/CD pipelines using GitHub Actions.
Collaborate with development teams to embed robust security and observability.

Ripple — Staff Technical Operations Engineer

Sep 2019 – Nov 2022

Led multi-cloud observability platform (AWS/GCP) using Kubernetes, Prometheus, Grafana, Loki.
Consulted development teams on instrumentation best practices.
Consolidated SaaS monitoring tools into a unified observability stack.
Led major RBAC redesign for customer-facing infrastructure.

Oracle Corporation — Site Reliability Engineer

May 2017 – Sep 2019

Built developer infrastructure tools using Python, Ruby, Chef on CentOS.
Designed internal PaaS with Prometheus, ELK stack, GitLab.
Operations at massive scale (tens of thousands of nodes).

Auth0 — Production Engineer

Jun 2016 – May 2017

ChatOps-driven CI/CD on AWS/Azure. MongoDB clusters with Terraform/SaltStack.
Monitoring with Datadog and Kibana. 24/7 on-call rotations.

GoDaddy — Systems Engineer

Feb 2015 – Jun 2016

ChatOps CI/CD pipeline for Email Marketing product.
Puppet automation. Deployment pipeline for 4,000+ legacy MySQL instances.

GitHub, Inc. — Email Infrastructure Engineer

Oct 2013 – Dec 2014

Scalable email infrastructure with Puppet, Rails, Bash.
Bulk email delivery (PowerMTA, Postfix). Customer issue resolution via Splunk.

Mad Mimi, LLC — Chief of Email Infrastructure & Delivery

Nov 2009 – Oct 2013

Led email deliverability and anti-abuse. Managed team of two.
High-volume email infrastructure (Rails, PowerMTA, Postfix).

Education

B.S.E., Electrical Engineering concentration, Minor in Mathematics — Walla Walla University, 2009

Speaking

DevOps DC, PuppetConf 2015–2017, ScaleConf Colombia, RubyFuza (South Africa), SendGrid Training events

Community

Founded Engineers Without Borders chapter (WWU). Founded Phoenix Puppet Users Group. Active open source contributor.