Site Reliability Engineer III
Job Description
Job Description
We are seeking a Site Reliability Engineer (SRE) to support a greenfield initiative within the Trade Compliance and Innovation team. This role will serve as the primary SRE for one squad within a geographically distributed team and will support a second squad as needed. The SRE will play a key role in enabling scalable, secure, and highly reliable infrastructure from development through production while partnering closely with development and QA teams.
This is a hands‑on role requiring strong DevOps and cloud infrastructure expertise, a software‑engineering mindset, and the ability to independently research, design, and implement solutions that improve system reliability and operational efficiency.
Key Responsibilities:
· Apply software engineering practices to IT operations to maintain scalable, secure, and highly available production environments.
· Act as a bridge between development and operations by applying engineering rigor to system administration and infrastructure management.
· Design, build, and support infrastructure across DEV, Test, and Production environments.
· Develop and maintain automation using code to analyze logs, monitor systems, test environments, and respond to incidents.
· Implement and manage Infrastructure as Code (IaC) using Terraform following organizational best practices.
· Support deployments of Java and Python‑based microservices, containerized workloads, and related cloud services.
· Implement and manage blue‑green deployments, scaling strategies (horizontal and vertical), resiliency, and security postures.
· Support Azure Container Apps (ACA) and Kubernetes platforms (AKS).
· Work with messaging systems, webhooks, Azure Functions, and distributed integrations.
· Support monitoring, logging, and observability using enterprise tools (e.g., ELK, Grafana).
· Partner closely with global Dev, QA, and SRE team members to resolve infrastructure and reliability issues.
· Research, learn, and apply new technologies and solutions as required.
Qualifications:
· 3–5+ years of experience in a Site Reliability Engineering, DevOps, or infrastructure‑focused role.
· Bachelor’s degree in Computer Science, Computer Engineering, Information Technology, or a related field.
· Strong hands‑on experience with Azure cloud infrastructure.
· Proven expertise with Infrastructure as Code (Terraform).
· Strong DevOps/SRE skillset with the ability to work independently and collaborate with other SREs and Dev teams.
· Experience supporting Java and Python microservices in cloud environments.
· Experience with CI/CD pipelines, specifically GitHub Actions/Pipelines.
· Strong understanding of NFRs, including performance, scalability, resiliency, and security.
· Proficiency in one or more programming/scripting languages such as Python, Go, Java, .NET, or Node.js.
· Experience with infrastructure monitoring and logging platforms.
· Strong problem‑solving, research, and multitasking capabilities.
· Clear communication skills to explain technical problems and solutions to diverse stakeholders.
· Ability to support infrastructure needs across multiple time zones with early CST availability.
Preferred Qualifications:
· Experience with Azure Kubernetes Service (AKS) and Azure Container Apps (ACA).
· Experience with messaging systems, event‑driven architectures, and webhooks.
· Hands‑on experience with Azure Functions.
· Experience deploying and managing Azure OpenAI services.
· Familiarity with ELK stack, Grafana, or similar observability tools.
· Azure certifications.
· Experience with MLOps, LLMOps, or AI‑focused infrastructure.
· Prior experience supporting globally distributed teams.
Recommended Jobs
Principal Customer Success Manager, Large Law
We offer a flexible working policy that supports a healthy balance between personal and professional well-being. This role requires in-office presence on Tuesdays & Thursdays to collaborate, connect,…
Caregiver - Elk Grove
Would you like to work for an Award Winning Caregiving Company? Come join us at Senior Helpers of Greater Chicago! We are Senior Helpers Store of the Year 2024 and the first national in-home care comp…
Technology Sales Specialist - SE region
GlassHouse Systems (GHS) is an enterprise systems, and managed services solutions provider that develops, designs and deploys solutions for leading enterprises in Canada and the US. For almost 33 yea…
Senior Business Analyst - Compliance Documentation
Job Description Job Description Job Title: Senior Business Analyst – Compliance Documentation Location: Chicago, IL Onsite: 5 days/week - 100% onsite Duration: 4/21/25 to 7/31/25 (st…
PJM Interconnection Project Manager
Job Responsibilities: Experience with land acquisitions, zoning, easements, and permitting Experience managing large electrical, gas, or water utility projects This position will be responsi…
Beauty Advisor
Sephora is seeking a part-time Beauty Advisor in Skokie, United States, to deliver personalized beauty experiences and drive sales results. The role requires previous retail or hospitality experience,…
Ford Diesel Technician
Ron Hopkins Ford, is looking to bring on experienced Service Technicians to join our team and to continue to grow and develop in their careers. We believe the customer comes first, and we're looking t…
Desktop Support Technician - ADSDJ
Job Description Job Description · Provide onsite support to Authorized Users with operational and technical support and to meet specified SLAs. · Troubleshoot, diagnose and resolve…
Executive Coordinator
Mission The mission of the Latino Policy Forum is to build the power, influence, and leadership of the Latino community through collective action to transform public policies that ensure the well…
Power Apps Developer (Hiring Immediately)
Job Family : SAAS/PAAS/Cloud Consulting Travel Required : Up to 25% Clearance Required : Ability to Obtain Secret What You Will Do : We are looking for a Power Platfor…