Senior Site Reliability Engineer
2 weeks ago
Establish and refine deployment patterns and tooling for use by feature development teams, targeting Kubernetes / EKS, ECS, and Lambda, running on AWS.
Write Terraform, yaml, and scripts to fully automate the deployment of services and environments.
Continuously improve the observability of our tech stack by evolving the use of logs, metrics, tracing, dashboards, alerts, and integration of metrics with CD pipelines.
Partner with other engineers to troubleshoot issues in production and other environments, identifying opportunities to improve automation.
Migrate legacy technologies and environments to EKS or ECS on fully automated environments.
Troubleshoot issues in production and other environments, opportunistically improve observability and automation.