‹ Back
About us
Owkin is an AI company on a mission to solve the complexity of biology.
It is building the first Biology Super Intelligence (BASI) by combining powerful biological large language models, multimodal patient data, and agentic software. At the heart of this system is Owkin K, an AI copilot and its new LLM fine-tuned on biology called Owkin Zero, used by researchers, clinicians, and drug developers to better understand biology, validate scientific hypotheses, and deliver better diagnostics and therapies faster. Position is based in our London office or remotely in UK and Germany. Please submit your CV in EnglishAbout the role:You will be part of the Engineering team.
This role involves designing, building, and optimizing scalable ETL/ELT pipelines with Airflow to process complex datasets efficiently while ensuring reliability and performance.
You will organize and structure data systems, aligning them with business objectives, and demonstrate expertise in scientific and healthcare information systems to deliver data products tailored for machine learning and AI research. Clear reporting and meticulous attention to detail are essential, as is the ability to manage high-volume, complex workstreams while prioritizing multiple deadlines.
The role requires professional interpersonal skills to collaborate with diverse stakeholders in biotechnology and the ability to streamline production workflows for scientific processing and quality assurance. Organize and structure data systems at both macro and micro levels, designing and implementing data architectures that support business goalsOptimize data pipelines for performance, reliability, and scalabilityDesign, build, and maintain scalable ETL/ELT pipelines with Airflow to process large-scale, complex datasetsDemonstrate ability to delivery of of data products useful for machine learning and AI research and development (data models, metadata and semantics)Strong organizational skills to effectively manage high-volume, complex workstreams while prioritizing multiple deadlineDemonstrate knowledge of scientific and healthcare information systems and data sources and relevant software toolsDemonstrate ability to handle a variety of activities across operational delivery and development and initiativesDemonstrate professional interpersonal skills with ability to work both independently and collaboratively with a variety of stakeholders on complex biotechnology areas. Streamline the process of taking scientific processing and quality check in production, ensuring proper monitoring of the production workflows.
In particular, you will:Design and optimizing data pipelines using AirflowDevelop robust solutions in Python and SQLDesign, develop, and operate scalable ETL/ELT pipelines to process and transform datasets. Work with cross-functional teams, including data scientists, business developers, software engineers and bio medical researchers to deliver high-quality data solutions. Manage and monitor containerized data infrastructures with Docker and Kubernetes and other cloud platforms. Implement and enforce best practices for data governance, security, and compliance. Build, optimize and maintain data architectures, including data lakes, data warehouses, and analytical Insights Productionize the data processing pipelines, setting and enforcing standards and best practices across scientific teams to deliver high quality data in an efficient and scalable way. About youRequired
qualifications
/ experience:Master degree in computer sciences or specialization in Data Significant experience (5+ years) as a Data Engineer and have good knowledge of DataOps practices. Experience in Python and SQL and you have familiarity with RExperience in architectural design of complex data platformsProficient in the technologies like Airflow, AWS steps functions, PostgreSQL, Docker, Kubernetes, Grafana, Infrastructure as CodeAutonomous, meticulous, and enjoy teamworkSoftware development with a focus on code quality, simplicity, maintainabilityExperience in designing data architecture and building data productsExperience handling sensitive personal informationFluent in EnglishPreferred
qualifications
/bonus:Knowledge in healthcare or biology areasData quality tools: Great expectation, pydantic, pandera, SQLMesh etc, Debugging Refactoring skillsWhat we offerFlexible work organization Friendly and informal working environmentOpportunity to work with an international team with high technical and scientific backgroundsRecruitment Process SecurityPlease complete the form and submit your CV. Owkin is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, sex, gender, sexual orientation, age, color, religion, national origin, protected veteran status or on the basis of disability. Owkin is a great place to work.
As a coveted workplace we are, unfortunately, vulnerable to recruitment phishing scams.
We urge all job seekers and candidates to be wary of potential scams. Most of these have individuals posing as representatives of prominent companies, including Owkin, with the aim of obtaining personal, sensitive, or financial information from applicants.
These scams prey upon an individual’s desire to obtain a job and can sometimes “feel” like a genuine recruitment process. Some red flags are identified below. Should you encounter a recruitment process that claims to be for Owkin but is not consistent with the below, please do not provide any personal or financial information:Legitimate Owkin recruitment processes include communication with candidates through recognized professional networks, such as LinkedIn. Communication is always through an official Owkin email address (from the @owkin. com domain), over the phone or through our applicant tracking system (Greenhouse).
The Owkin talent team do use platforms such as LinkedIn and Job Teaser, however if you have any concern or doubt about this contact, please ask for them to send an email from @Owkin. com.
The Owkin talent team will not solicit personal data from candidates during the application phase including , but not limited to, date of birth, social security numbers, or bank account information;Legitimate Owkin interviews may be conducted over the phone, in person, or via an approved enterprise videoconferencing service (Google Meets).
They will not occur via Signal, Telegram or MessengerOwkin offers of employment are based on merit and only extended once a candidate has interviewed with members of the talent and hiring team. Offers will be extended both verbally and in written format.
If you think that you have been a victim of fraud, Check the identity of the talent team on LinkedInCheck our senior team on our website https://owkin. com/team/Check the existence of the position on our website: https://www. owkin. com/careers#current-opportunitiesNotify Owkin's recruitment unit at this address hiring@owkin. comcontact the following authorities:[FR] https://internet-signalement. gouv. fr/[UK] https://www. actionfraud. police. uk/reporting-fraud-and-cyber-crime[US] https://reportfraud. ftc. gov/.
Data Engineer
JOB SUMMARY
Roles
Skills & Technologies
Job details
About us
Owkin is an AI company on a mission to solve the complexity of biology.
It is building the first Biology Super Intelligence (BASI) by combining powerful biological large language models, multimodal patient data, and agentic software. At the heart of this system is Owkin K, an AI copilot and its new LLM fine-tuned on biology called Owkin Zero, used by researchers, clinicians, and drug developers to better understand biology, validate scientific hypotheses, and deliver better diagnostics and therapies faster. Position is based in our London office or remotely in UK and Germany. Please submit your CV in EnglishAbout the role:You will be part of the Engineering team.
This role involves designing, building, and optimizing scalable ETL/ELT pipelines with Airflow to process complex datasets efficiently while ensuring reliability and performance.
You will organize and structure data systems, aligning them with business objectives, and demonstrate expertise in scientific and healthcare information systems to deliver data products tailored for machine learning and AI research. Clear reporting and meticulous attention to detail are essential, as is the ability to manage high-volume, complex workstreams while prioritizing multiple deadlines.
The role requires professional interpersonal skills to collaborate with diverse stakeholders in biotechnology and the ability to streamline production workflows for scientific processing and quality assurance. Organize and structure data systems at both macro and micro levels, designing and implementing data architectures that support business goalsOptimize data pipelines for performance, reliability, and scalabilityDesign, build, and maintain scalable ETL/ELT pipelines with Airflow to process large-scale, complex datasetsDemonstrate ability to delivery of of data products useful for machine learning and AI research and development (data models, metadata and semantics)Strong organizational skills to effectively manage high-volume, complex workstreams while prioritizing multiple deadlineDemonstrate knowledge of scientific and healthcare information systems and data sources and relevant software toolsDemonstrate ability to handle a variety of activities across operational delivery and development and initiativesDemonstrate professional interpersonal skills with ability to work both independently and collaboratively with a variety of stakeholders on complex biotechnology areas. Streamline the process of taking scientific processing and quality check in production, ensuring proper monitoring of the production workflows.
In particular, you will:Design and optimizing data pipelines using AirflowDevelop robust solutions in Python and SQLDesign, develop, and operate scalable ETL/ELT pipelines to process and transform datasets. Work with cross-functional teams, including data scientists, business developers, software engineers and bio medical researchers to deliver high-quality data solutions. Manage and monitor containerized data infrastructures with Docker and Kubernetes and other cloud platforms. Implement and enforce best practices for data governance, security, and compliance. Build, optimize and maintain data architectures, including data lakes, data warehouses, and analytical Insights Productionize the data processing pipelines, setting and enforcing standards and best practices across scientific teams to deliver high quality data in an efficient and scalable way. About youRequired
qualifications
/ experience:Master degree in computer sciences or specialization in Data Significant experience (5+ years) as a Data Engineer and have good knowledge of DataOps practices. Experience in Python and SQL and you have familiarity with RExperience in architectural design of complex data platformsProficient in the technologies like Airflow, AWS steps functions, PostgreSQL, Docker, Kubernetes, Grafana, Infrastructure as CodeAutonomous, meticulous, and enjoy teamworkSoftware development with a focus on code quality, simplicity, maintainabilityExperience in designing data architecture and building data productsExperience handling sensitive personal informationFluent in EnglishPreferred
qualifications
/bonus:Knowledge in healthcare or biology areasData quality tools: Great expectation, pydantic, pandera, SQLMesh etc, Debugging Refactoring skillsWhat we offerFlexible work organization Friendly and informal working environmentOpportunity to work with an international team with high technical and scientific backgroundsRecruitment Process SecurityPlease complete the form and submit your CV. Owkin is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, sex, gender, sexual orientation, age, color, religion, national origin, protected veteran status or on the basis of disability. Owkin is a great place to work.
As a coveted workplace we are, unfortunately, vulnerable to recruitment phishing scams.
We urge all job seekers and candidates to be wary of potential scams. Most of these have individuals posing as representatives of prominent companies, including Owkin, with the aim of obtaining personal, sensitive, or financial information from applicants.
These scams prey upon an individual’s desire to obtain a job and can sometimes “feel” like a genuine recruitment process. Some red flags are identified below. Should you encounter a recruitment process that claims to be for Owkin but is not consistent with the below, please do not provide any personal or financial information:Legitimate Owkin recruitment processes include communication with candidates through recognized professional networks, such as LinkedIn. Communication is always through an official Owkin email address (from the @owkin. com domain), over the phone or through our applicant tracking system (Greenhouse).
The Owkin talent team do use platforms such as LinkedIn and Job Teaser, however if you have any concern or doubt about this contact, please ask for them to send an email from @Owkin. com.
The Owkin talent team will not solicit personal data from candidates during the application phase including , but not limited to, date of birth, social security numbers, or bank account information;Legitimate Owkin interviews may be conducted over the phone, in person, or via an approved enterprise videoconferencing service (Google Meets).
They will not occur via Signal, Telegram or MessengerOwkin offers of employment are based on merit and only extended once a candidate has interviewed with members of the talent and hiring team. Offers will be extended both verbally and in written format.
If you think that you have been a victim of fraud, Check the identity of the talent team on LinkedInCheck our senior team on our website https://owkin. com/team/Check the existence of the position on our website: https://www. owkin. com/careers#current-opportunitiesNotify Owkin's recruitment unit at this address hiring@owkin. comcontact the following authorities:[FR] https://internet-signalement. gouv. fr/[UK] https://www. actionfraud. police. uk/reporting-fraud-and-cyber-crime[US] https://reportfraud. ftc. gov/.
Discover the company
Explore other offers from this company or learn more about Owkin.
The company
O
Owkin Germany, United Kingdom




