Site Reliability Engineer

Attention Job Applicants:

At Amgen, we are relentless in applying the highest ethical standards to our products, services and communications. Consistent with this, we expect all job applicants to act with honesty and integrity. Providing any false or misleading information, or omitting material information during the hiring process, may result in immediate disqualification from the hiring process or termination if already employed. We appreciate your cooperation in helping us uphold these standards.

Site Reliability Engineer

India - Hyderabad APLICAR AHORA

ID de la oferta R-221564 País: India - Hyderabad Estado: On Site Fecha de publicación Aug. 29, 2025 CATEGORÍA DE EMPLEO: Information Systems

Join Amgen’s Mission of Serving Patients

At Amgen, if you feel like you’re part of something bigger, it’s because you are. Our shared mission—to serve patients living with serious illnesses—drives all that we do.

Since 1980, we’ve helped pioneer the world of biotech in our fight against the world’s toughest diseases. With our focus on four therapeutic areas –Oncology, Inflammation, General Medicine, and Rare Disease– we reach millions of patients each year. As a member of the Amgen team, you’ll help make a lasting impact on the lives of patients as we research, manufacture, and deliver innovative medicines to help people live longer, fuller happier lives.

Our award-winning culture is collaborative, innovative, and science based. If you have a passion for challenges and the opportunities that lay within them, you’ll thrive as part of the Amgen team. Join us and transform the lives of patients while transforming your career.

Site Reliability Engineer

What you will do

Let’s do this. Let’s change the world. In this vital role you will responsible for the reliability, stability, performance, scalability, and security of platforms that support Amgen’s digital products and engineering teams. This hands-on role focuses on supporting cloud-based infrastructure, automating operations, maintaining observability, and improving platform reliability through code.

You’ll work closely with senior engineers and cross-functional teams to support CI/CD workflows, container platforms, incident response, and enterprise tooling—all while adopting modern SRE principles and practices.

This role is ideal for engineers who have foundational site reliability experience and are looking to expand their skills in a cloud-native, enterprise-scale environment.

Roles & Responsibilities:

Infrastructure & Platform Support

Provision and manage cloud infrastructure using Infrastructure as Code (IaC)
Support container orchestration platforms, ensuring availability, access control, and resource management
Assist in configuring and maintaining CI/CD pipelines and environments

Monitoring & Incident Response

Set up and maintain observability tools to track system health and performance
Participate in alert tuning, incident resolution, and root cause analysis
Support integration of observability platforms with incident response workflows

Automation & Platform Operations

Automate routine platform tasks such as provisioning, patching, and configuration
Write scripts to improve platform reliability, reduce manual work, and enforce compliance
Participate in platform upgrades, maintenance windows, and service validation efforts

AI Enablement & Intelligence

Support the adoption of AI-assisted operational tools for log analysis, anomaly detection, and predictive alerts
Collaborate with senior engineers to evaluate AI/ML-based observability and automation platforms
Assist in integrating AI-driven insights into dashboards, alerts, or incident workflows
Stay current with emerging AI trends in infrastructure and site reliability, and contribute to tool evaluations and pilots

Collaboration & Enablement

Work with development, QA, and security teams to ensure reliable and secure deployments
Document operational procedures, playbooks, and system runbooks
Learn and support enterprise collaboration platforms and internal tooling
Participate in Agile and SAFe delivery processes—including sprint planning, stand-ups, retrospectives, and PI planning—to ensure security and platform reliability are embedded across development cycles.

What we expect of you

We are all different, yet we all use our unique contributions to serve patients. The [vital attribute] professional we seek is a [type of person] with these qualifications.

Basic Qualifications:

Master's degree / Bachelor's degree and 5 to 9 years in Computer Science, IT or related field
4 years of hands-on related experience in site reliability, DevOps, or platform engineering roles
Hands-on experience with cloud platforms preferably AWS
Familiarity with Kubernetes or container orchestration technologies
Exposure to CI/CD practices and pipeline automation
Experience troubleshooting Linux systems, processes, and services

Preferred Qualifications:

Must-Have Skills:

Practical experience with cloud platforms (e.g., AWS, Azure, or GCP), including compute, networking, IAM, and storage services
Familiarity with container orchestration platforms (e.g., Kubernetes, Docker), including basic workload deployment and troubleshooting
Experience using Infrastructure as Code (IaC) tools such as Terraform or CloudFormation
Working knowledge of Linux administration, including system services, package management, and file system structures
Hands-on exposure to CI/CD platforms (e.g., GitLab CI, Jenkins, GitHub Actions) and pipeline troubleshooting
Proficiency in scripting or automation languages like Python, Bash, or Go
Exposure to observability tooling (e.g., Dynatrace, Prometheus, or Grafana) for monitoring and alerting
Familiarity with incident management practices and tools (e.g., runbooks, escalation workflows, basic alert tuning)
Version control skills using Git and understanding of branching strategies
Experience supporting or integrating enterprise collaboration platforms (e.g., Jira, Confluence, ServiceNow)
Interest and basic understanding of AI/ML tools used in infrastructure and operations (e.g., anomaly detection, intelligent alerting, log analysis)

Good-to-Have Skills:

Experience using Infrastructure as Code (IaC) tools like Terraform or CloudFormation
Familiarity with IT incident response workflows and ticketing platforms
Knowledge of secrets management, configuration management tools (e.g., Ansible), or logging frameworks
Exposure to AI-assisted tooling (e.g., AIOps platforms, AI-enhanced alerting, anomaly detection)

Professional Certifications (Preferred)

Cloud DevOps Certification (AWS/Azure/GCP)
Certified Kubernetes Administrator (CKA) or Security Specialist (CKS)
CI/CD Platform Certification
ITIL Foundation or equivalent service management certification

Soft Skills:

Strong analytical and troubleshooting skills
Collaborative and proactive mindset
Effective communication and documentation practices
Curiosity and willingness to adopt new tools and methods, including AI integrations
Ability to manage time and prioritize tasks in dynamic environments

Shift Information: This position is an onsite role and may require working during later hours to align with business hours. Candidates must be willing and able to work outside of standard hours as required to meet business needs.

What you can expect of us

As we work to develop treatments that take care of others, we also work to care for your professional and personal growth and well-being. From our competitive benefits to our collaborative culture, we’ll support your journey every step of the way.

In addition to the base salary, Amgen offers competitive and comprehensive Total Rewards Plans that are aligned with local industry standards.

Apply now and make a lasting impact with the Amgen team.

careers.amgen.com

As an organization dedicated to improving the quality of life for people around the world, Amgen fosters an inclusive environment of diverse, ethical, committed and highly accomplished people who respect each other and live the Amgen values to continue advancing science to serve patients. Together, we compete in the fight against serious disease.

Amgen is an Equal Opportunity employer and will consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, or any other basis protected by applicable law.

We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

APLICAR AHORA

VIVE. GANA. PROSPERA.

Regístrate para recibir alertas de empleo

Mantente al día con las noticias y oportunidades de Amgen. Regístrate para recibir alertas sobre puestos que se adapten a tus habilidades e intereses profesionales.

CORREO ELECTRÓNICO

Me interesa:Indique las primeras letras de una categoría y luego elija una a partir de las sugerencias. Después entre las primeras letras de un enlace y elija la opción que prefiera. Por último, haga clic en “Añadir” para crear su propia alerta.

Categoría

UBICACIÓN

Information Systems, Hyderabad, State of Telangāna, IndiaBorrar
Borrar

Site Reliability Engineer

Site Reliability Engineer

Join Amgen’s Mission of Serving Patients

What you will do

Infrastructure & Platform Support

AI Enablement & Intelligence

Collaboration & Enablement

What we expect of you

Must-Have Skills:

Good-to-Have Skills:

Soft Skills:

What you can expect of us

Apply now and make a lasting impact with the Amgen team.

careers.amgen.com

COMPARTIR ESTE EMPLEO

Regístrate para recibir alertas de empleo