Senior Data Engineer - Data Pipelines

AstraZeneca · Barcelona

Nivel de experiencia---
Tipo de contratoA tiempo completo
Publicada19 ene.

Python Docker Cloud Coumputing Kubernetes AWS Bash Terraform Office

Are you ready to build high-performance data pipelines that turn complex science into real impact for patients? In this role, you will transform raw bioinformatics and scientific data into trusted, reusable assets that drive discovery and decision-making across our research programs.

You will join a team that fuses data engineering with cutting-edge science, using HPC and AWS to deliver reproducible workflows at scale. From first ingestion to consumption by scientists and AI models, you will set the standard for reliability, speed, and governance across our data foundation.

Do you thrive where learning is continuous and bold ideas are encouraged? You will have the freedom to experiment, the support to grow, and the opportunity to see your work influence breakthroughs as they take shape.

Accountabilities:
- Pipeline Engineering: Design, implement, and operate fit-for-purpose data pipelines for bioinformatics and scientific data, from ingestion to consumption.
- Workflow Orchestration: Build reproducible pipelines using frameworks such as Nextflow (preferred) or Snakemake; integrate with schedulers and HPC/cloud resources.
- Data Platforms: Develop data models, warehousing layers, and metadata/lineage; ensure data quality, reliability, and governance.
- Scalability and Performance: Optimize pipelines for throughput and cost across Unix/Linux HPC and cloud environments (AWS preferred); implement observability and reliability practices.
- Collaboration: Translate scientific and business requirements into technical designs; partner with CPSS stakeholders, R&D IT, and DS&AI to co-create solutions.
- Engineering Excellence: Establish and maintain version control, CI/CD, automated testing, code review, and design patterns to ensure maintainability and compliance.
- Enablement: Produce documentation and reusable components; mentor peers and promote best practices in data engineering and scientific computing.

Essential Skills/Experience:
- Pipeline engineering: Design, implement, and operate fit-for-purpose data pipelines for bioinformatics and scientific data, from ingestion to consumption.
- Workflow orchestration: Build reproducible pipelines using frameworks such as Nextflow (preferred) or Snakemake; integrate with schedulers and HPC/cloud resources.
- Data platforms: Develop data models, warehousing layers, and metadata/lineage; ensure data quality, reliability, and governance.
- Scalability and performance: Optimize pipelines for throughput and cost across Unix/Linux HPC and cloud environments (AWS preferred); implement observability and reliability practices.
- Collaboration: Translate scientific and business requirements into technical designs; partner with CPSS stakeholders, R&D IT, and DS&AI to co-create solutions.
- Engineering excellence: Establish and maintain version control, CI/CD, automated testing, code review, and design patterns to ensure maintainability and compliance.
- Enablement: Produce documentation and reusable components; mentor peers and promote best practices in data engineering and scientific computing.

Desirable Skills/Experience:
- Strong programming in Python and Bash for workflow development and scientific computing.
- Experience with containerization and packaging (Docker, Singularity, Conda) for reproducible pipelines.
- Familiarity with data warehousing and analytics platforms (e.g., Redshift, Snowflake, Databricks) and data catalog/lineage tools.
- Experience with observability and reliability tooling (Prometheus/Grafana, ELK, tracing) in HPC and cloud contexts.
- Knowledge of infrastructure as code and cloud orchestration (Terraform, CloudFormation, Kubernetes).
- Understanding of FAIR data principles and domain-specific bioinformatics formats and standards.
- Track record of mentoring engineers and enabling cross-functional teams with reusable components and documentation.
- Experience optimizing performance and cost on AWS, including spot strategies, autoscaling, and storage tiers.

When we put unexpected teams in the same room, we unleash bold thinking with the power to inspire life-changing medicines. In-person working gives us the platform we need to connect, work at pace and challenge perceptions. That´s why we work, on average, a minimum of three days per week from the office. But that doesn´t mean we´re not flexible. We balance the expectation of being in the office while respecting individual flexibility. Join us in our unique and ambitious world.

Why AstraZeneca:
Your engineering craft will fuel science at the crossroads of biology, data, and technology. You will collaborate with researchers, data scientists, and technologists to tackle complex diseases, using modern platforms and inclusive ways of working to turn uncertainty into insight. We value kindness alongside ambition, nurture resilience and curiosity, and pair the resources of a global leader with the agility to move at pace-from hands-on experimentation to shared learning and tangible impact for patients.

¡No te pierdas nada!

Únete a la comunidad de wijobs y recibe por email las mejores ofertas de empleo

Nunca compartiremos tu email con nadie y no te vamos a enviar spam

Suscríbete Ahora

Últimas ofertas de empleo de Ingeniero/a de Datos en Barcelona

Senior Data Engineer

Nueva

Deloitte

´Te imaginas participando en la transformación de las principales organizaciones nacionales e internacionales? En...

Python TSQL Azure Teletrabajo

AWS Cloud Engineer con inglés

20 ene.

Aubay

Barcelona, ES

Funciones - Diseñar y arquitectar infraestructuras AWS seguras y escalables. - Implementar Infraestructura como Código...

Python Cloud Coumputing AWS

Senior Site Reliability Engineer

19 ene.

Factorial

Hello! We´re looking for a Senior Site Reliability Engineer to join our Infrastructure Team in the Platform Domain.

MySQL Agile Azure Teletrabajo

Data Engineer - Proyecto Build-up y Transformación Tecnológica

18 ene.

AMH HEADHUNTING

Hola, Soy Alejandro Martínez , Headhunter. Actualmente acompaño a un grupo líder en formación para el empleo en España ...

. Python TSQL Teletrabajo

Test Engineer

17 ene.

Sener

SENER Aeroespacial is hiring engineers for the RF/EMC/Electronics test laboratory in the MAIT Area (Manufacturing...

. Python Teletrabajo

Engineer Data Engineer

16 ene.

Krell Consulting & Training

Barcelona, ES

Descripción Buscamos un/a Data Engineer con entre 5 y 7 años de experiencia en proyectos de ingeniería de datos y...

Python TSQL Azure

Ingeniero/a Cloud

15 ene.

Krell Consulting & Training

Barcelona, ES

Descripción 🔐Ingeniero/a Cloud – Sector Asegurador Modalidad: Híbrido (2 días presenciales/semana) Ubicación: Barcelona...

Cloud Coumputing

Data Engineer

15 ene.

Aubay

Barcelona, ES

Funciones Buscamos un/a Data Engineer con experiencia para incorporarse a un proyecto internacional en un entorno...

Python TSQL Azure

DevOps Engineer

12 ene.

Seidorcons

Barcelona, ES

´TE APUNTAS AL RETO? Como DevOps Engineer te incorporarás a un equipo de Operaciones Cloud / DevOps, responsable de...

Jenkins Cloud Coumputing Git

DevOps Integration Expert

12 ene.

Sanofi

Barcelona, ES

- Location: Barcelona Collaborative Platform Strategy and Program: At Sanofi, we are committed to a digital...

Agile TSQL Cloud Coumputing

Ver más ofertas

Tipo	Nombre	Finalidad	Duración
Sesión	ASP.NET_SessionId	Administra la sesión del usuario en el sitio web	Durante la sesión del usuario
Sesión	wj_uuid	Identifca al usuario ente distintas sesiones	1 año
Anti falsificación	.AspNetCore.Antiforgery.*	Proporciona protección contra ataques de falsificación de solicitudes entre sitios	Durante la sesión del usuario
Autentificación	.AspNetCore.Cookies	Almacena datos encriptados del usuario que se requieren para acceder o mostrar datos en el sitio	Durante la sesión del usuario
RGPD	.AspNet.Consent wj_con_pe wj_con_ad wj_con_an	Almacenan información relativa a las preferencias del usuario sobre el Reglamento General de Protección de Datos o RGPD	1 año

Tipo	Nombre	Finalidad	Duración
Idioma	.AspNetCore.Culture	Almacena información relativa a tu preferencia de idioma	1 año
Búsqueda	wj_loc wj_search wj_tags wj_tags_loc	Almacena información para recordar tus preferencias de búsqueda	1 año
Favoritos	wj_bookmarks wj_likes	Almacena información relativa a tu contenido favorito	1 año
Alertas por email	wj_e_sub	Indica si el usuario esta o no suscrito a las alerta por email	1 año
Alertas por email	wj_e_sub_v	Indica si el usuario ha verificado o no su suscripción por email	1 año
Alertas por email	wj_e_sub_a	Indica si el usuario tiene o no activas las alertas por email	1 año
Alertas con OneSignal	__cfduid	Puedes conocer cómo OneSignal usa la información de sitios o aplicaciones que usan sus servicios visitando su sitio web	1 mes
Sesión	wj_tv	Indica si el usuario es recurrente	1 año

Tipo	Nombre	Finalidad y duración
Google Analytics	_ga _gat _gid AMP_TOKEN _gac_* _lc.visitor_id.*	Puedes conocer cómo Google usa la información de sitios o aplicaciones que usan sus servicios visitando su sitio web
Hotjar	_hjClosedSurveyInvites _hjDonePolls _hjMinimizedPolls _hjShownFeedbackMessage _hjid _hjRecordingLastActivity _hjTLDTest _hjUserAttributesHash _hjLocalStorageTest _hjIncludedInPageviewSample _hjIncludedInSessionSample _hjAbsoluteSessionInProgress	Puedes conocer cómo Hotjat usa la información de sitios o aplicaciones que usan sus servicios visitando su sitio web

Senior Data Engineer - Data Pipelines

AstraZeneca · Barcelona

¡No te pierdas nada!

Últimas ofertas de empleo de Ingeniero/a de Datos en Barcelona

Senior Data Engineer

Deloitte

AWS Cloud Engineer con inglés

Aubay

Senior Site Reliability Engineer

Factorial

Data Engineer - Proyecto Build-up y Transformación Tecnológica

AMH HEADHUNTING

Test Engineer

Sener

Engineer Data Engineer

Krell Consulting & Training

Ingeniero/a Cloud

Krell Consulting & Training

Data Engineer

Aubay

DevOps Engineer

Seidorcons

DevOps Integration Expert

Sanofi

¡No te pierdas nada!

Top Zonas

Top Empleos