Site Reliability Engineer (SRE) Job at innovitusa, Des Moines, IA

ZlZEYmlyenBHZUoyRW1TcUhVdVVEekQ5N2c9PQ==
  • innovitusa
  • Des Moines, IA

Job Description

Hiring: W2 Candidates Only
Visa: Open to any visa type with valid work authorization in the USA

Summary
A Site Reliability Engineer (SRE) is responsible for ensuring the reliability scalability and performance of software systems and infrastructure. This role bridges the gap between development and operations by applying software engineering principles to IT operations automating processes and monitoring system health to prevent downtime and improve system efficiency.

Key Responsibilities

  • Design implement and maintain reliable scalable and highly available infrastructure and services.
  • Monitor system performance availability and capacity; respond proactively to incidents and outages.
  • Develop and maintain automation tools for deployment monitoring and infrastructure management.
  • Collaborate with software engineers to design systems with reliability and maintainability in mind.
  • Troubleshoot debug and resolve complex production issues across multiple systems and services.
  • Implement and maintain CI/CD pipelines configuration management and version control best practices.
  • Conduct post-incident reviews identify root causes and implement corrective actions to prevent recurrence.
  • Define and enforce service-level objectives (SLOs) service-level indicators (SLIs) and service-level agreements (SLAs).
  • Optimize system performance cost and resource utilization through analysis and continuous improvement.
  • Document infrastructure operational procedures incident reports and monitoring configurations.
  • Mentor junior engineers and promote best practices for reliability automation and observability.
  • Stay current with emerging technologies and DevOps practices to improve operational excellence.

Qualifications

  • Bachelors degree in Computer Science Information Technology or a related field.
  • 3-6 years of experience in site reliability engineering DevOps or system administration.
  • Strong understanding of Linux/Unix systems networking and cloud platforms (AWS Azure GCP).
  • Proficiency in scripting and programming languages such as Python Bash Go or Java.
  • Experience with monitoring logging and observability tools (Prometheus Grafana ELK Stack).
  • Familiarity with containerization and orchestration tools (Docker Kubernetes).

Preferred Skills / Duties

  • Experience with Infrastructure as Code (Terraform Ansible CloudFormation).
  • Knowledge of CI/CD tools and pipelines (Jenkins GitLab CircleCI).
  • Understanding of distributed systems microservices architecture and high-availability systems.
  • Strong problem-solving analytical and communication skills.
  • Ability to implement security best practices in operational environments.
  • Experience in automating repetitive operational tasks and improving system reliability

Job Tags

Full time

Similar Jobs

NSNA

Prototype Specialist II Job at NSNA

 ...tier one supplier of instrument clusters and head-up display units for FCA US, General Motors, BMW, Honda, Harley Davidson, Suzuki, Polaris, Arctic Cat and other OEM's. Purpose of Job The Prototype Specialist is responsible for managing all prototype build... 

Nation Security

School Security Officer - Bilingual English/ Spanish Job at Nation Security

 ...Nation Security is seeking professional, dependable, and bilingual Security Officers (English/Spanish) to join our dedicated team. In...  ...secure environment for students, staff, and visitors within a private school setting. The ideal candidate is reliable, alert, and committed... 

Confidential

Womenswear Designers - Wovens Job at Confidential

 ...Hard sticks and needles, urine, thyroid testing Processing Detail oriented and good experience Computer skills Data Entry and processing Minimum 1.5 years of experience Summary: The main function of a phlebotomist is to assist in performing various... 

One World Global Services

Japanese: US-Based Interpreter Job at One World Global Services

 ...internal professional training. Communicate and report to your team leader. YOUR BACKGROUND AND EXPERIENCE: ~ Proficiency/Bilingual/Native level of English and target language. ~1+ years of interpreting experience (Desirable). ~ High emotional intelligence and... 

Friendship Automotive

Savannah Motorsports Team Member Job at Friendship Automotive

 ...develop your career with an award-winning, customer focused automotive group.We're looking for new TEAM MEMBERS at SAVANNAH MOTORSPORTS Responsibilities:* Assist with providing an excellent customer experience* Effectively communicate with Leadership*...