Utah IT Jobs

Mobile utah department of workforce services Logo

Job Information

Ellie Mae Sr Manager, SRE 210341 in Provo, Utah

Sr Manager, SRE 210341

Provo, UT /

Technology & Operations – Cloud Operations /

Full time

Ellie Mae is the leading cloud-based platform provider for the mortgage finance industry. Ellie Mae’s technology solutions enable lenders to originate more loans, reduce origination costs, and reduce the time to close, all while ensuring the highest levels of compliance, quality and efficiency. Visit‪ EllieMae.comto learn more.

The SRE manager drives strategic and tactical management of Production Incidents, improve production Quality of Service and metrics for Availability, Scalability, Performance, etc. This is a fantastic opportunity to work and collaborate closely with our Tech Support, Software Engineering, Architecture, Infrastructure, DBA, SRE, DevOps and Cloud Platform teams at Ellie Mae. Partner with our Site Reliability Engineers (SRE) who are responsible for ensuring Ellie Mae services are highly available, reliable, secure and scalable.

Responsibilities & Objectives

  • Build and manage a team of highly efficient and motivated SRE engineers disturbed globally.

  • Partner with other SREs and lead by example - Contributor more than a delegator.

  • Mentor the team and help the elevate their skills to develop a strong high performing team with an open culture.

  • Set Strategic and Operational goals for team, and work with team to deliver on goals. (SMART Goals)

  • Work with Cloud Platform and Operations leaders to develop narratives, backlog grooming, epic planning and overall sprint planning processes

  • Employ deep troubleshooting skills to improve the availability, performance, and security of Ellie Mae/IMT Services.

  • Ideation and Hands-On expertise on Coding and Automation of Applications on Cloud Platform

  • Implement automated tests, automated deployments, and operational tools

  • Collaborate with Product and Support teams to plan and deploy product releases

  • Work with Engineering leadership to build shared services that meet the requirements and need of the platform and application teams

  • Implementation of true Observability & VALET based Monitoring.

  • Define non-functional requirements as part of the product lifecycle to influence the new

  • designs, standards, and methods for scalable, highly available distributed systems

  • Contribute to product development / engineering as needed to ensure Quality of Service

  • of Highly Available services.

  • Good to Have :)

  • You can manage stress and be calm when racing against time

  • You have read google SRE book few times :)

  • You love to find problems and have mentality to get restless until it is resolved

  • Your team members love to move with you

Requirements

  • 5 years of experience building and managing an SRE / Operations team managing customer facing production environments.

  • 10+ years of Systems/Applications automation in 24x7 Production Services environments and being a Technology Leader

  • BS in Computer Science, Computer Engineering, Math, or equivalent professional experience

  • Fluency with one or more current generation scripting language used by DevOps professionals (Python, Perl, PHP, Ruby) + Java Development and/or .NET

  • Excellent troubleshooter, utilizing a systematic problem-solving approach

  • Demonstrated experience in designing, analyzing, and diagnosing large-scale distributed

  • systems + Windows Server and/or Linux systems internals (system libraries, file systems, client-server protocols)

  • Experience with elastically scalable, fault tolerance and other cloud architecture patterns

  • Experience operating on AWS (both PaaS and IaaS offerings)

  • Experience in both Windows (2k8R2+) and Linux (centos) + Security triage & forensic analysis

  • Experience with Continuous Integration and Continuous Delivery concepts, including Infrastructure as code utilizing tools like Terraform, Cloudformation and Chef/SaltStack

  • Expert in Containerization concepts like Docker, and PaaS services on AWS.

Good to Have :)

  • You can manage stress and be calm when racing against time

  • You have read google SRE book few times :)

  • You love to find problems and have mentality to get restless until it is resolved

  • Your team members love to move with you

#LI-TM1

Ellie Mae is an equal opportunity and affirmative action employer. Women, minorities, people with disabilities, and veterans are encouraged to apply.

We do not accept resumes from headhunters, placement agencies, or other suppliers that have not signed a formal agreement with us.

DirectEmployers