Associate Data Engineer
Associate Data Engineer
India - Hyderabad APLICAR AHORAABOUT AMGEN
Amgen harnesses the best of biology and technology to fight the world’s toughest diseases, and make people’s lives easier, fuller and longer. We discover, develop, manufacture and deliver innovative medicines to help millions of patients. Amgen helped establish the biotechnology industry more than 40 years ago and remains on the cutting-edge of innovation, using technology and human genetic data to push beyond what’s known today.
ABOUT THE ROLE
Role Description:
The role is responsible for designing, building, maintaining, analyzing, and interpreting data to provide actionable insights that drive business decisions. This role involves working with large datasets, developing reports, supporting and executing data governance initiativesand visualizing data to ensure data is accessible, reliable, and efficiently managed. The ideal candidate has strong technical skills, experience with big data technologies, and a deep understanding of data architecture and ETL processes
Roles & Responsibilities:
Analyze complex datasets (pricing, rebates, provider contracting, market access) using Excel, SQL, and Databricks to create reports, dashboards, and analytics-ready datasets that drive decision-making.
Identify trends, anomalies, and opportunities within large-scale datasets, delivering actionable insights and strategic recommendations in partnership with business teams.
Clean, organize, and validate data for accuracy and consistency; build and maintain ETL/ELT workflows (Databricks, PySpark, SQL); apply data quality checks, dictionaries, and governance standards.
Collaborate with cross-functional teams (pricing, contracting, finance, product, and data science) to integrate data from multiple sources (cloud storage, APIs, SQL databases) and bridge technical and business needs.
Automate ingestion, preparation, and reporting pipelines with Databricks Jobs and scheduling tools, ensuring timely delivery of datasets and insights.
Optimize queries, transformations, and Spark jobs for scalability and performance, applying caching, partitioning, and indexing techniques.
Document data sources, methodologies, and business rules clearly; communicate insights through structured reports, visualizations, and presentations; adhere to governance, security, and privacy requirements.
Coordinate with global stakeholders across time zones to support reporting needs, troubleshoot issues, and align on priorities; contribute to agile practices such as sprint planning and estimations.
Basic Qualifications and Experience:
Bachelor’s / Master’s degree and 3 to 5 years of Computer Science, IT or related field experience
Functional Skills:
Must-Have Skills
Hands-on experience with big data technologies and platforms such as Databricks and Apache Spark (PySpark, SparkSQL), including workflow orchestration and performance tuning for large-scale data processing.
Proficiency in SQL and advanced Excel (pivot tables, lookups, advanced formulas) for data analysis, with experience generating clear, actionable reports and summaries.
Excellent problem-solving skills with the ability to work with large, complex datasets to identify trends, anomalies, and actionable insights.
Strong understanding of data governance frameworks, tools, and best practices, with knowledge of data protection regulations (e.g., GDPR, CCPA).
Good-to-Have Skills:
Experience with ETL tools such as Apache Spark, and familiarity with various Python packages related to data processing and machine learning model development.
Strong understanding of data modeling, data warehousing, and data integration concepts to support analytics readiness.
Knowledge of Python/R, Databricks, SageMaker, and cloud data platforms (AWS preferred, Snowflake exposure a plus).
Experience with data visualization tools (Power BI, Tableau, or equivalent) to present insights effectively to business stakeholders.
Familiarity with healthcare, pharmaceutical market access, pricing, rebates, or contracting datasets to provide context-driven analytics.
Ability to automate workflows using Databricks Jobs, notebooks, and scheduling tools, ensuring timely delivery of analytics and reports.
Professional Certifications (Preferred):
Certified Data Engineer / Data Analyst (preferred on Databricks or cloud environments)
Soft Skills:
Excellent critical-thinking and problem-solving skills
Strong communication and collaboration skills
Demonstrated awareness of how to function in a team setting
Demonstrated presentation skills
Shift Information:
This position requires you to work a later shift and may be assigned a second or third shift schedule. Candidates must be willing and able to work during evening or night shifts, as required based on business requirements.
EQUAL OPPORTUNITY STATEMENT
Amgen is an Equal Opportunity employer and will consider you without regard to your race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or disability status.
We will ensure that individuals with disabilities are provided with reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request an accommodation.