Principal Data Engineer
At Lundbeck’s RnD Biometrics division we are looking for a determined, proactive data engineer to join our Data Science department and drive our data systems and operations. If you are an experienced data engineer, interested to have an impact on the future of Machine Learning, Deep Learning, and Natural Language Processing for drug discovery, if you are intrigued when establishing systems and pipelines for big, complex, and heterogeneous datasets, and most importantly if you feel excited about becoming part of a multidisciplinary team aiming to advance treatments in brain health, we may have an great job offer for you!
Description
Biometrics Division has the responsibility for handling and analyzing data across the drug development value chains. The division consists of 6 departments: Data Science, Digital Solutions, Data Management, Biostatistics, Statistical Programming, and Digital Agile Lab. In total we are more than 70 people.
Data Science currently employs 9 highly skilled members. The primary responsibility of the Data Science department is to contribute in Lundbeck’s RnD projects. Our target is to improve portfolio decision making, enhance clinical trials, and reduce the risk for research and development by building trust in data-driven insights.
Our work is focused on feature extraction, modeling of disease trajectories, detecting disease sub-populations, NLP, text mining, patient level predictions, population level estimations, biomarker discovery, functional and structural medical imaging, EEGs, exploration of RWD, patient-generated data, medical signals, and time series.
Your job and key responsibilities
As a Principal Data Engineering you will be responsible for establishing and maintaining data pipelines for ML, DL, and NLP projects. Additionally, you are expected to be able to support our HPC systems and applications.
You will have to:
Establish and maintain the smooth operation of our computational systems, data management, and analytical applications.
Establish standardized and automated pipelines from raw data to validated results.
Execute projects that contribute in data standardization, integration, quality assurance, and quality control.
Create data visualizations that communicate a clear message to both researchers and decision makers.
Collaborate with multidisciplinary Lundbeck’s RnD teams.
Identify challenges at project design phase and communicate efficient solutions.
As an expert in your field, provide guidance to your colleagues and keep our organization updated on the latest developments.
Work in a regulated area involving patient data. You must appreciate the restrictions this imposes as well as the impact of analyses on major decisions in drug-development projects and budgets.
Few travel days can be expected.
We offer
Lundbeck’s passion links to our purpose about restoring brain health so every person can be their best. We strive to make a real difference to patients. By developing innovative treatments, we improve the lives of people living with brain diseases. Your dedication is crucial and that is why you can expect us to be committed to your progress, so you can stay committed to ours.
We offer a great workplace that is based on a flat structure, forming a collaborative working environment built on respect and equality. We employ dedicated colleagues and encourage continuous development. In Biometrics, we strive daily to bring our expertise within data handling and analysis into play and make a real difference to patients.
The position is placed in our Headquarter in Copenhagen. The desired start date is no later than March 1st, 2022.
Qualifications
Our ideal candidate has the following personal and professional qualifications:
- 3+ years of experience in data engineering, including working and optimizing SQL and No-SQL database management systems.
- 2+ years of experience working with (R or Python) and SQL.
- 1+ years of experience in Linux based environments, OS-level virtualization (e.g. docker containers), and bash scripting.
- Experience in data visualization.
- Experience with DataOps, MLOps, and version control.
- Familiar with AWS. Azure or GC is also a plus.
- Familiar with massively parallel processing databases.
- Highly motivated to continuously improve our data handling and analysis.
- Strong verbal and written communication skills.
- Fluent in oral and written English.
- Work independently with the ability to prioritize activities.
Additional Qualifications
- Experience with RWD, claims data, electronic medical records, or registry data.
- Experience with Elasticsearch.
- Experience with transformer and recurrent language models like BERT, GPT2, LSTM, Word2Vec.
- Methodological understanding and hands-on experience with Data Mining, Machine Learning, Deep Learning, Natural Language Processing, or Image Processing.
- Pharmaceutical industry experience or work experience in clinical or biomedical research.
Further information
For further information, please contact
Iannis Drakos, Data Science Director on
[email protected].
Your application and CV should not be sent via email.
We also recommend that you have a look at our website, LinkedIn and Instagram.
Your application
Please click on the apply button. Applications can be submitted no later than 10th of January 2022. We will be reviewing the applications and sending interview invitations on a rolling basis, so be sure to send yours early.