Utah IT Jobs

Mobile utah department of workforce services Logo

Job Information

NICE Systems, Inc. Site Reliability Engineer in SALT LAKE CITY, Utah

Site Reliability Engineer Location: Salt Lake City, UT Site Reliability Engineer will work with the Trust Office team and is responsible for leading efforts across the organization within dynamic platform and service performance availability controls. Our platform runs with at an incredibly high percentage uptime and is mission critical for our business operations and customer success. Be a critical member of our team by managing our platform's and product service's continuous availability to keep our environment reliable and resilient.Provides technical expertise in all aspects computing and networking uptime calculations, performance assurances, and elastic redundant systems. This position will engineer, create, coordinate projects, document, present, and processes platform and product data to ensure high infrastructure uptime and services performance availability.This role is responsible for the infrastructure availability management, resiliency calculations, network architecture design, software engineering in cloud environments, configuration management, automation development, and statistical modeling.As a Site Reliability Engineer, a Typical Day Might Include the Following: *Develop reliability and resilient calculable models to sustain enterprise infrastructure availability. *Oversee and manage platform and product uptime reporting program. *Work with system and net operations and product teams to develop and deploy statistical models to establish monitoring and alerting strategy. *Develop and ensure the scalability, performance, and resiliency of multiple hosting platforms and product lines. *Build, test, and deploy multi-levels of automation and replication controls to seamlessly deliver fault-tolerant availability of datacenter and cloud environments. *Restore healthy operation of platform functionality, applications, and services through sustainable incident response operations. *Design and implement server provisioning and processing scripts for secure, reliant, and continuous data-flow. *Conduct and manage reliability and system performance testing; work closely with risk management teams to conduct reliability, resilient regression, and replication testing. *Analyze, troubleshoot, and solve product performance concerns to provision continuous and sustained customer-to-product availability. *Maintain the highest level of personal certification, integrity and objectivity, following the company Code of Ethics and Nice inContact policies and procedures at all times. To Land This Gig You'll Need: *Bachelor's degree in Business Information Systems, Economics, Statistics, Computer Engineering, Computer Science, Information Systems Security or related field or equivalent work experience required. *5+ years of site reliability engineering experience. *3+ years in or with cloud information systems replication and business continuity operations. *Experience building fault-tolerant system and application performance calculations across cloud environments. *Extensive experience with Chef, Docker, and Kubernetes. *Extensive experience coordinating with multi-disciplinary engineering teams throughout incident response operations. *Excellent skills in risk assessment processes, policy development, proposals, work statements, product evaluations, and delivery of software. *Experience in scripting languages/tools such as PowerShell, Python, Bash, and JavaScript. *Demonstrable skills in innovating with intent to improve reliability and efficiency. *Experience analyzing and documenting post-mortem results; strong written skills are a must to deliver effective communication content throughout the company. *Strong software development and deployment skills to automate continuous availability cloud requirements. *Demonstrates a strong ability to follow best practices within software development lifecycles, to include integrated security and configuration testing through continuous implementation frameworks. *Ex