Site Reliability Engineering (SRE) Architect Job at Quantum Technologies. LLC, Atlanta, GA

TTF2VWNtUFdHbmhKeUw1bm1ISnY5aS94
  • Quantum Technologies. LLC
  • Atlanta, GA

Job Description

Overview Site Reliability Engineering (SRE) Architect Location: Atlanta, GA Duration: 12 Months+ Extension Hourly Rate: Depending on Experience (DOE) Work Authorization: As an SRE Architect, you will be a pivotal technical leader responsible for designing, building, and evolving the foundational systems and practices that ensure the reliability, scalability, performance, and efficiency of our critical services. Moving beyond day-to-day operations, you will focus on the strategic architectural direction of SRE function, defining standards, blueprints, and frameworks that enable development teams and fellow SRE operations team to build and operate highly resilient systems. Leverage deep expertise in software engineering, distributed systems, cloud infrastructure, and SRE principles to influence technology choices, establish best practices, and foster a proactive culture of reliability across the organization and much beyond observability pillar. Responsibilities Reliability Strategy & Design: Architect and design highly available, scalable, secure, and cost-effective infrastructure and application patterns on AWS Reliability Strategy & Design: Define and evangelize SRE best practices, standards, and blueprints for service design, deployment, monitoring, and operational readiness across the engineering organization Reliability Strategy & Design: Review current observability implementation to identify gaps and define steps to reach next level maturity of observability setup to provide deep insights into system health and behaviour Reliability Strategy & Design: With overall maturity lead the definition and implementation strategy for Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Error Budgets for critical services Reliability Strategy & Design: Design solutions to systematically reduce operational toil through automation and improved system design Reliability Strategy & Design: Evaluate current SRE tools and automation frameworks (e.g., CI/CD pipelines, Infrastructure as Code modules, automated incident remediation, chaos engineering platforms) and suggest enhancement that will help overall enhancement of capability Reliability Strategy & Design: Evaluate, prototype, and recommend new technologies, tools, and methodologies to enhance system reliability, developer productivity, and operational efficiency Technical Leadership & Consultation: Act as a senior technical advisor and subject matter expert on reliability, scalability, and performance for development and platform teams Technical Leadership & Consultation: Provide architectural guidance during the design phase of new services and features to ensure reliability principles are embedded early (shift-left) Technical Leadership & Consultation: Mentor and coach other SREs and engineers, fostering technical excellence and adherence to SRE principles Technical Leadership & Consultation: Lead architectural reviews and production readiness assessments for critical systems Resilience: Lead blameless postmortems for significant incidents, ensuring root causes are identified and systemic architectural improvements are prioritized and implemented Resilience: Architect and advocate for resilience patterns (e.g., circuit breaking, rate limiting, graceful degradation, chaos engineering) within applications and infrastructure Required Qualifications Proven experience in an architectural role, designing solutions for reliability, scalability, and performance Deep understanding and practical application of SRE principles (SLIs/SLOs, error budgets, toil reduction, automation, incident management, postmortems) Expertise in cloud computing platforms (e.g., AWS) including infrastructure, networking, and security services Strong experience with containerization and orchestration technologies (Kubernetes, Docker, serverless computing) Solid experience designing and implementing observability solutions (e.g., Dynatrace, Prometheus, Grafana, ELK/EFK Stack, Jaeger, OpenTelemetry) Strong programming/scripting skills (e.g., Python, Go, Bash) for automation and tool development Excellent analytical, problem-solving, and strategic thinking skills. Strong communication, collaboration, and leadership skills with the ability to influence technical direction across teams Preferred Qualifications Experience designing and implementing chaos engineering practices and platforms QUANTUM TECHNOLOGIES LLC is an equal opportunity employer inclusive of female, minority, disability and veterans, (M/F/D/V). Hiring, promotion, transfer, compensation, benefits, discipline, termination and all other employment decisions are made without regard to race, color, religion, sex, sexual orientation, gender identity, age, disability, national origin, citizenship/immigration status, veteran status or any other protected status. QUANTUM TECHNOLOGIES LLC will not make any posting or employment decision that does not comply with applicable laws relating to labor and employment, equal opportunity, employment eligibility requirements or related matters. Nor will QUANTUM TECHNOLOGIES LLC require in a posting or otherwise U.S. citizenship or lawful permanent residency in the U.S. as a condition of employment except as necessary to comply with law, regulation, executive order, or federal, state, or local government contract #J-18808-Ljbffr Quantum Technologies. LLC

Job Tags

Hourly pay, Permanent employment, Contract work, Local area, Early shift

Similar Jobs

Sessions College for Professional Design

UI/UX Designer Job at Sessions College for Professional Design

UI/UX Designer is a cool new title for an in-demand Web design role.UI/UX Designer For better or worse, the terms UI and UX are increasingly...  ..., ensure they work, and make them an exciting, and memorable part of the customer experience.UI/UX Designers : Turning the user... 

North Cook Intermediate Service Center

Student Advocate Job at North Cook Intermediate Service Center

 ...officially certified as a "Great Place to Work!" Work Calendar - August 1 - June 30 annually (11 months) Work Schedule - Student Advocates will follow the teacher calendar plus August and June days. Student Advocates do not work in July Winter break off (paid... 

FedEx Group

Warehouse Fulfillment Specialist Job at FedEx Group

A leading logistics company in Houston is looking for a Part Time Handler. This non-driving role requires the movement of packages and documents...  ...6:00 AM to 9:00 AM, with a pay rate of $18.06 per hour. Join a team that values safety and efficiency.#J-18808-Ljbffr FedEx Group

G. BYRD Trucking, LLC.

Truck Owner Operator Job at G. BYRD Trucking, LLC.

Job Description Job Description Owner Ops- We are seeking Owner Operators to join our team! We Offer consistent Dry Van freight wherever you want to run!!-Regional Owner operators averaging $5800.00-6500.00 Gross per week, Home on the Weekends!!!-OTR Drivers...

Hadrian Automation

Manufacturing Engineer, Additive (DED) Job at Hadrian Automation

Hadrian - Manufacturing the FutureHadrian is building autonomous factories that help aerospace...  ...& Documentation: Maintain clear engineering documentation, process records, work instructions...  ...Coordination: Work closely with Additive Manufacturing, Manufacturing Engineering...