i4DM Logo

i4DM

Associate Site Reliability Engineer

Posted 2 Days Ago
Remote
Hiring Remotely in USA
Junior
Remote
Hiring Remotely in USA
Junior
Support senior SREs to maintain availability, performance, and reliability of VA enterprise platforms. Assist with monitoring, incident response, automation, CI/CD, cloud/container operations (AWS, containers), documentation, and security/compliance under Federal requirements while developing SRE skills.
The summary above was generated by AI
Description

About Our Team 

Our employees thrive in a culture that is fast-paced, collaborative, and ego-free, where innovation and teamwork are encouraged at every level. We provide Federal agencies with immediate access to highly skilled professionals who understand complex mission challenges and deliver efficient, scalable solutions. By continuously investing in talent, technology, and specialized capabilities, we maintain expert teams prepared to support evolving Federal missions through tailored technical solutions and modern service delivery approaches. 

We value diverse perspectives and strive to attract talent from all backgrounds. We are seeking professionals who are passionate about technology, mission success, and solving complex operational challenges with creativity and purpose. If you enjoy expanding your technical expertise while supporting impactful Federal initiatives, you will thrive within our organization. Veterans and military spouses are strongly encouraged to apply and bring their valuable experience to our team. 

About the Role 

We are seeking a motivated and detail-oriented Associate Site Reliability Engineer to support the Technical Director’s team in advancing site reliability engineering, cloud operations, automation, and resilient service delivery for VA enterprise healthcare platforms and applications. 

In this role, you will work closely with senior engineers, the Technical Director, platform and operations teams, and VA stakeholders to support the availability, performance, and operational stability of mission-critical enterprise environments. 

The Associate Site Reliability Engineer will help apply software engineering and operational best practices to improve monitoring, automation, incident response, and service reliability while building foundational experience in cloud and platform engineering within a Federal environment. 


RESPONSIBILITIES 

Site Reliability Engineering Support 

  • Support senior engineers and the Technical Director’s team in day-to-day Site Reliability Engineering activities across platform services and hosted applications. 
  • Assist with maintaining service reliability, availability, and performance by following established operational practices, runbooks, and team standards. 
  • Help gather and review operational metrics, alerts, and system health information to identify issues and support service improvements. 

Monitoring, Observability & Incident Support 

  • Assist with monitoring, logging, alerting, and dashboard maintenance to improve visibility into system health and application performance. 
  • Participate in incident response activities, service restoration efforts, and post-incident follow-up under the guidance of senior team members. 
  • Support documentation of incidents, recurring issues, and operational procedures to improve team readiness and response consistency. 

Automation, CI/CD & Platform Operations 

  • Contribute to simple automation tasks, scripts, and pipeline updates that reduce manual effort and improve operational consistency. 
  • Support CI/CD processes and environment maintenance for application and infrastructure delivery in development, test, and production environments. 
  • Assist with infrastructure and configuration changes using approved tools, templates, and team guidance. 

Cloud & Environment Support 

  • Support cloud and hosted environments in AWS and container-based platforms by performing routine operational tasks, checks, and updates. 
  • Help maintain system documentation, inventory, and configuration information for services and environments managed by the team. 
  • Assist with validation, testing, and operational readiness activities for new releases and environment changes. 

Security, Compliance & Team Collaboration 

  • Follow established security, access, and operational procedures that support Federal compliance and secure system administration. 
  • Collaborate with software, infrastructure, operations, and support teams to resolve issues and support reliable service delivery. 
  • Continuously develop technical skills in cloud engineering, observability, automation, and reliability practices through hands-on work and mentorship. 

TAG: #LI-I4DM

TAG: INDMJC

Requirements

QUALIFICATIONS 

  • Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related technical field, or equivalent practical experience. 
  • 1–3 years of experience in Site Reliability Engineering, DevOps, systems administration, cloud operations, platform support, software engineering, or a related technical role. 
  • Foundational understanding of Linux systems, cloud infrastructure concepts, networking basics, and application support in enterprise environments. 
  • Exposure to scripting or programming using languages such as Python, Bash, PowerShell, or similar technologies. 
  • Familiarity with monitoring, logging, troubleshooting, and incident response concepts. 
  • Basic knowledge of CI/CD, version control, automation, or Infrastructure as Code concepts. 
  • Ability to follow technical procedures, learn new tools quickly, and work effectively in a collaborative team environment. 
  • Candidates must be eligible to obtain and maintain a Public Trust clearance. 

PREFERRED QUALIFICATIONS 

  • Internship, academic, lab, or hands-on experience with cloud platforms such as AWS, Azure, or Google Cloud. 
  • Familiarity with containers and orchestration technologies such as Docker, Kubernetes, EKS, or ECS. 
  • Exposure to observability and monitoring tools such as CloudWatch, Grafana, Prometheus, ELK, or Splunk. 
  • Experience with Git-based workflows, pipeline tooling, or automation through coursework, labs, or professional experience. 
  • Understanding of Federal security, compliance, or healthcare technology environments. 
  • Relevant foundational certifications such as AWS Certified Cloud Practitioner, AWS Certified Developer Associate, CompTIA Linux+, Security+, or HashiCorp Terraform Associate.

Similar Jobs

22 Days Ago
In-Office or Remote
Georgia, USA
Senior level
Senior level
Fintech • Information Technology • Software • Financial Services
Design, build, and maintain real-time, secure distributed systems and observability UIs/APIs. Implement CI/CD, containerized deployments (Docker/Kubernetes/OpenShift), integrate observability stack (Elasticsearch/Logstash/Grafana), and apply secure coding and API security standards to ensure reliability, performance, and incident automation. Collaborate in Agile teams and explore AI to improve resiliency.
Top Skills: Agentic AiCi/CdDockerElasticsearchGrafanaJava Spring BootKafkaKubernetesLogstashMariadbNode.jsOauth2OpenshiftReactSecrets Management
12 Minutes Ago
Remote or Hybrid
USA
125K-180K Annually
Expert/Leader
125K-180K Annually
Expert/Leader
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Manage a team of TPRM analysts to run the vendor risk lifecycle, improve tooling and automation (ServiceNow TPRM, AI), perform assessments and audits, develop TPRM policies aligned to frameworks (NIST/ISO/SOC 2), partner with procurement/legal/IT, track KPIs, and support audit and reporting to leadership.
Top Skills: Ai/Ml ToolsCloud EnvironmentsCrowdstrike ProductsFairIso 27001Nist 800-53Nist CsfSecure CodingServicenowServicenow TprmSigSoc 2
An Hour Ago
Remote or Hybrid
255K-445K Annually
Expert/Leader
255K-445K Annually
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Set technical direction for a multi-cloud, cloud-native platform: design control planes, multi-cluster topology, workload isolation, identity/trust fabrics, and reliability at scale. Solve ambiguous platform problems, build critical components (operators, control planes), influence architecture across orgs, and mentor senior engineers.
Top Skills: AksAWSAzureCniCrossplaneEksGCPGitopsGkeGoInfrastructure-As-CodeKata ContainersKubernetesMtlsObservability (MetricsOci BundlingOperator/Controller PatternOperatorsService MeshSlos)SpiffeSpireTracing

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account