We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Site Reliability Engineer

Empower Retirement
$87,400.00 - $123,400.00
life insurance, parental leave, paid time off, tuition reimbursement, 401(k)
8525 East Orchard Road (Show on map)
Mar 04, 2026

Our vision for the future is based on the idea that transforming financial lives starts by giving our people the freedom to transform their own. We have a flexible work environment, and fluid career paths. We not only encourage but celebrate internal mobility. We also recognize the importance of purpose, well-being, and work-life balance. Within Empower and our communities, we work hard to create a welcoming and inclusive environment, and our associates dedicate thousands of hours to volunteering for causes that matter most to them.

Chart your own path and grow your career while helping more customers achieve financial freedom. Empower Yourself.

***Applicants must be authorized to work for any employer in the U.S. We are unable to sponsor or take over sponsorship of an employment visa at this time, including CPT/OPT.***

We are seeking a Site Reliability Engineer (SRE) to own the reliability, availability, and operational excellence of our AWS-based data platform. This role is focused on applying core SRE principles - production engineering, incident management, root cause elimination, observability, automation, and capacity planning - to large-scale data infrastructure supporting EMR, EMR Serverless, Redshift, DynamoDB, and S3.

You will treat data pipelines and analytics platforms as production systems, designing and enforcing SLAs/SLOs for uptime, performance, scalability, and data freshness. You will lead incident response, perform deep root cause analysis, implement durable fixes, and eliminate toil through automation and infrastructure-as-code.

What you will do:

  • Own and improve the reliability, stability, scalability, and performance of our core data platforms and services

  • Provide operational support for large-scale, distributed data systems, ensuring high availability and strong SLAs

  • Partner closely with full-stack, data, and platform engineering teams to deliver continuous improvements

  • Operate and support EMR and EMR Serverless (Python/Spark) workloads and data pipelines

  • Support and optimize Amazon Redshift and DynamoDB in high-throughput, production environments

  • Design, build, and evolve monitoring, alerting, and observability frameworks with a focus on symptoms, not just outages

  • Lead incident response, troubleshooting production issues across the full stack and coordinating with internal and external stakeholders

  • Perform root cause analysis (RCA) and readiness reviews; turn findings into durable fixes and automation

  • Create and maintain runbooks, SOPs, and operational documentation

  • Collaborate with engineering teams to optimize performance, reliability, and cost

  • Participate in an on-call rotation to respond to incidents impacting customer-facing systems

  • Recommend and influence the use of AWS managed services and architectural patterns

  • Continuously evaluate system performance, capacity, and cost to scale efficiently

What you will bring:

  • 4-6 years of experience building or operating systems across multiple architecture domains: application, data, integration, infrastructure, and security

  • 4+ years of hands-on AWS experience, with strong production exposure to several of the following:

  • Redshift, DynamoDB, EMR, EMR Serverless, EC2, S3

  • Lambda, Step Functions, EventBridge, RDS, IAM

  • Proven experience operating data platforms such as data lakes and data warehouses in production

  • Strong SQL skills and experience working with modern databases (e.g., Redshift, DynamoDB, Postgres, MySQL, Oracle)

  • 4+ years of Python experience, including scripting, automation, or data workloads

  • Experience with CloudWatch, infrastructure monitoring, and alerting

  • Hands-on experience with incident management, uptime SLAs, and customer-impacting systems

  • Strong understanding of Git-based workflows (GitHub, Git Flow, or similar)

  • Experience working in Agile environments (Scrum / Kanban) using tools such as Jira and Confluence

  • Bachelor's in Computer Science, Information Systems, Data/Analytics, or related; equivalent practical experience welcomed.

What will set you apart:

  • Experience with Terraform or other Infrastructure-as-Code tools

  • Exposure to Snowflake or experience supporting analytics platforms beyond Redshift

  • Experience in financial services or other highly regulated environments

  • Knowledge of DevOps and CI/CD best practices

  • Familiarity with observability tools such as Splunk, AppDynamics, or advanced CloudWatch usage

  • Comfortable working across Linux/Unix environments

  • Strong communication skills during incident response with both technical and non-technical stakeholders

  • Security-minded approach to building secure, reliable, and durable systems

  • Willingness to support occasional off-hours or weekend incidents as part of on-call responsibilities

  • Streaming/event pipelines (Kafka/Kinesis), CDC patterns, and backfill strategies.

  • Experience with OpenLineage/Marquez and catalog integrations (Collibra/Alation/Purview).

  • Prior FinOps or capacity-planning ownership for data platforms.

  • Familiarity with BI semantic layers and contract enforcement at consumption (Looker/Power BI/Tableau).

Work conditions

Participate in an on-call rotation; occasional change windows outside business hours to support safe releases and resiliency drills.

This job description is not intended to be an exhaustive list of all duties, responsibilities and qualifications of the job. The employer has the right to revise this job description at any time. You will be evaluated in part based on your performance of the responsibilities and/or tasks listed in this job description. You may be required perform other duties that are not included on this job description. The job description is not a contract for employment, and either you or the employer may terminate employment at any time, for any reason.

What we offer you

We offer an array of diverse and inclusive benefits regardless of where you are in your career. We believe that providing our employees with the means to lead healthy balanced lives results in the best possible work performance.

  • Medical, dental, vision and life insurance
  • Retirement savings - 401(k) plan with generous company matching contributions (up to 6%), financial advisory services, potential company discretionary contribution, and a broad investment lineup
  • Tuition reimbursement up to $5,250/year
  • Business-casual environment that includes the option to wear jeans
  • Generous paid time off upon hire - including a paid time off program plus ten paid company holidays and three floating holidays each calendar year
  • Paid volunteer time - 16 hours per calendar year
  • Leave of absence programs - including paid parental leave, paid short- and long-term disability, and Family and Medical Leave (FMLA)
  • Business Resource Groups (BRGs) - BRGs facilitate inclusion and collaboration across our business internally and throughout the communities where we live, work and play. BRGs are open to all.

Base Salary Range

$87,400.00 - $123,400.00

The salary range above shows the typical minimum to maximum base salary range for this position in the location listed. Non-sales positions have the opportunity to participate in a bonus program. Sales positions are eligible for sales incentives, and in some instances a bonus plan, whereby total compensation may far exceed base salary depending on individual performance. Actual compensation offered may vary from posted hiring range based upon geographic location, work experience, education, licensure requirements and/or skill level and will be finalized at the time of offer.

Equal opportunity employer * Drug-free workplace

We are an equal opportunity employer with a commitment to diversity. All individuals, regardless of personal characteristics, are encouraged to apply. All qualified applicants will receive consideration for employment without regard to age (40 and over), race, color, national origin, ancestry, sex, sexual orientation, gender, gender identity, gender expression, marital status, pregnancy, religion, physical or mental disability, military or veteran status, genetic information, or any other status protected by applicable state or local law.

***For remote and hybrid positions you will be required to provide reliable high-speed internet with a wired connection as well as a place in your home to work with limited disruption. You must have reliable connectivity from an internet service provider that is fiber, cable or DSL internet. Other necessary computer equipment, will be provided. You may be required to work in the office if you do not have an adequate home work environment and the required internet connection.***

Job Posting End Date at 12:01 am on:

03-05-2026

Want the latest money news and views shaping how we live, work and play? Sign up for Empower's free newsletter and check out The Currency.

Applied = 0

(web-6bcf49d48d-ksmjz)