The European Bioinformatics Institute (EMBL-EBI) is seeking to recruit a highly motivated Lead Data Scientist/Bioinformatician to lead the integration of biomedical genomics and genetic variation data with the knowledge on proteins in the Universal Protein Resource (UniProt), a world-leading provider of protein information, infrastructure and tools.
The integration of biomedical data from large scale international initiatives into the UniProt Knowledgebase (UniProtKB) is essential to provide users with comprehensive information on the role of proteins in health and disease. Proteins shape our cells, tissues, and organs in response to the complex interplay of genomics with environmental and developmental cues. Proteins are key in understanding how individual genomic variation affects our phenotype, disease susceptibility, and drug responses. UniProt develops pipelines for the integration of sequence variation data from resources such as gnomAD and ClinVar. New technologies such as deep mutational scanning are emerging to determine the functional effects of thousands of variants in a certain protein in a single experiment. The development of computational tools will be critical in supporting variant analysis.
The successful candidate will have the wonderful opportunity of working with software engineers and scientists in a world-leading resource on proteins (UniProt) at EMBL-EBI. S(he) will join the UniProt development team whose work spans computational annotation of proteins, software development, large-scale compute, big data, data production and automation. We are part of the UniProt Consortium with teams in US and Europe and collaborate with consortia and scientific communities from all over the world to research how proteins work in cell biology.
The Protein Function development team maintains bioinformatics infrastructure of the world leading Universal Protein Resource (UniProt), Gene Ontology Annotation and Enzyme resource. This is a very dynamic and interdisciplinary team comprising talented and enthusiastic software engineers and bioinformaticians working closely together with the biocurators in these resources. The UniProt resource is one of the most internationally used protein databases serving a large and diverse research community in genomics, proteins and proteomics.
The Data Scientist will lead the analysis of biomedical data, work on integration of relevant genomic and variation data in UniProt, engage with relevant international initiatives, and take a lead role in the proposal and development of pipelines, tools and User Interfaces (UIs) specific to scientists’ field of expertise. The post holder’s responsibilities will include:
- Lead the identification, analysis and pipeline development and maintenance of genomic and clinical data resources relevant to proteins in UniProt
- Lead the analysis and integration of genomic annotation data into UniProtKB; including relevant clinical and general annotated variant data resources.
- Investigate and propose the integration and representation of active biomolecules into UniProtKB.
- Contribute with code and work with a devoted and enthusiastic team of software developers and bioinformatician to achieve the project goals
- Engage with researchers in relevant initiatives and present in meetings and conferences
- Propose methods for representing data types for diverse biological science users.
- Support the team with biological and bioinformatics expertise
- A degree in Life sciences and/or Bioinformatics, Data Sciences
- Good understanding of genomics, genetic variation, and protein function/structures
- Proficiency in a programming language (e.g. Python or Java)
- Experience working with multidisciplinary professional or academic team environment
- Ability to analyze, represent and manage large data sets, and design and implement data workflows
- Excellent communication, organizational and interpersonal skills
- Ability to write reports and present the team project in international settings
You might also have
- Experience of working with biomedical databases
- Experience using biological databases e.g. UniProt
- Knowledge of relational SQL programming
- Knowledge of NoSQL/graph databases
- Experience working with high-performance compute clusters
Why join us
EMBL-EBI is a worldwide leader in bioinformatics. We encourage personal and professional development and are committed to ensuring our employees feel valued, supported and empowered to reach their potential at work.
Surrounded by beautiful South Cambridgeshire countryside, our state-of-the-art Campus provides employees with a relaxed atmosphere where the pursuit of scientific discovery and meaningful work, underpin every role. In addition, our employees have access to a wide range of benefits including:
- 30 days annual leave per year plus 8 bank holidays
- Flexible work arrangements
- Unique pension scheme including 17% employer contribution
- Annual salary review
- Private medical insurance for you and your immediate family (including all prescriptions and generous dental & optical cover)
- Death benefit, long-term care, accident at work and unemployment insurances
- Relocation package including installation grant (if required)
- On-site library and subsidised on-site gym and cafeteria
- Casual dress code
- Free shuttle bus to and from Campus
- Extensive sports and social club activities (run on campus and remotely)
- Family benefits: Onsite nursery, child sick leave, generous parental leave, holiday clubs on campus and monthly family and child allowances
- Benefits for non-UK residents:Visa exemption, education grant for private schooling, financial support to travel back to your home country every second year and a monthly non-resident allowance
What else you need to know
To apply please submit a covering letter and CV through our online system.
At EMBL-EBI, we strongly believe that inclusive and diverse teams benefit from higher levels of innovation and creative thought. We encourage applications from women, LGBTQ and individuals from all nationalities. Visa exemptions will be offered to international candidates and details will be discussed during interview. We believe in science without borders and this continues after Brexit.
This role is based in Hinxton, UK and you will be required to relocate once it is safe to do so if you are currently based abroad. More info on how we are recruiting in this period is available on our page: Recruitment and Onboarding during Covid19.