Data Engineer (Knowledge Graph)

Job description

Do you have a passion for Science? Would you like to apply your expertise to impact a company that follows the science and turns ideas into life changing medicines? Then AstraZeneca might be the one for you!

At AstraZeneca, we put patients first and strive to meet their unmet needs worldwide. Working here means being ambitious, thinking big and working together to make the impossible a reality. If you are swift to action, confident to lead, willing to collaborate, and curious about what science can do, then you’re our kind of person.

Ready to explore and innovate in a dynamic environment? Join our team unlocking the power of what science can do! We're seeking a skilled and motivated Data Engineer to work on knowledge graphs and help transform the R&D process and speed up the design and delivery of new medicines to patients. You would be joining our Machine Learning and AI team within the Biopharmaceutical Development (BPD) R&D group.

As a Data Engineer specializing in knowledge graph development, your role will be pivotal in the design, creation, and maintenance of our organization's knowledge graph infrastructure. Your expertise will be instrumental in advancing knowledge management and discovery, with a primary focus on ensuring it is Findable, Accessible, Interoperable, and Reusable (FAIR) for scientists across the company. Our work revolves around collaboration with scientists and engineers from diverse fields within biopharmaceutical development to effectively analyze data, extracting valuable information, knowledge, and insights. Our collective mission is to optimize and revolutionize the R&D workflows, expediting the design and delivery of innovative medicines to patients.

Our Gaithersburg site is one of AstraZeneca’s strategic R&D centers, and the main hub for the Biopharmaceutical Development function. Here, we follow the science to explore and innovate; working towards treating, preventing, modifying, and even curing some of the world's most complex diseases. We're committed to making a difference by fusing data and technology with the latest scientific innovations to streamline and transform the R&D process.

In this role, you will:

  • Implement and fine-tune knowledge graph solutions to architect, build, and continuously evolve knowledge-based systems that support BPD’s data science initiatives.
  • Integrate data from diverse sources (structured and unstructured) into the knowledge graphs, upholding data quality and consistency.
  • Develop and maintain data models, schemas, and ontologies for knowledge graphs, ensuring they accurately represent complex concepts and relationships.
  • Transform and pre-process data to fit into the knowledge graph structure, including data cleaning, enrichment, and normalization.
  • Continuously monitor and optimize knowledge graphs for performance and scalability.
  • Develop graph query services to extract insights from knowledge graphs for users including data scientists and lab scientists.
  • Collaborate with cross-functional teams and partners to understand business requirements, identify knowledge gaps, and define data-driven strategies to streamline R&D processes.
  • Stay up to date with cutting edge technologies in the field and streaming data processing and help further a culture of innovation and excellence within the team

Essential for the role:

  • M.Sc. in a relevant field (such as mathematics, computer science, engineering) with 5+ years of industry experience or BS with 10 plus years of industry experience.
  • Experience with graph database systems (Neo4j, Amazon Neptune, Virtuoso) and related query languages (Cypher, SPARQL)
  • Strong programming skills (Python, Java etc.)
  • Excellent problem-solving skills and ability to work collaboratively in a cross-functional environment.
  • Familiarity with semantic web technologies, ontology modeling, and graph algorithms

Desirable for the role:

  • Advanced degree in a relevant field (such as mathematics, computer science, engineering) with a track-record of industry experience
  • Experience with data transformation and ETL processes
  • Prior experience with Natural Language Processing (NLP) and common machine learning models
  • Experience with cloud computing platforms such as AWS (preferred), Google Cloud, or Azure would be advantageous.

Why AstraZeneca? We offer an inclusive environment where you can work seamlessly as one while expanding your horizons. Our leaders inspire us by engaging others, building compelling cases, and sharing ideas for change. We invest in understanding disease and creating the next generation of therapeutics. Our work impacts over one billion patients worldwide!

Benefits include lifelong learning opportunities through our diverse portfolio, teamwork, cutting-edge innovations, early talent programs, postdoctoral opportunities, women as leaders initiatives, and more.

Are you ready to push the boundaries of science? Apply now! AstraZeneca is an equal opportunity employer that values diversity and inclusion. We welcome all qualified applicants regardless of race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, disability, veteran status, or any other legally protected characteristics.

About AstraZeneca in Gaithersburg, MD:

Our Gaithersburg, Maryland facility creates life-changing medicines for people around the world. This campus employs more than 3,500 experts in our field and is only a short drive from Washington, DC. This modern and vibrant scientific campus is the home of R&D and Oncology in the US. Here, we play host to some of the most pioneering technology and lab spaces, all designed to inspire collaboration and multi-functional science. We believe employees benefit from being challenged and inspired at work. We are dedicated to creating a culture of inclusion and collaboration.

The Gaithersburg site offers a variety of amenities to help boost productivity and help keep our employees happy and healthy. This includes a fitness center, employee healthcare clinic, electric vehicle charging stations, dry cleaning, full-service cafeteria and copy center. This is where you’ll find newly designed, activity-based work spaces to suit a variety of working styles while increasing collaboration between teams.

Please let the company know that you found this position on this Job Board as a way to support us, so we can keep posting cool jobs.

Similar jobs

Browse All Jobs
September 23, 2023

Data Engineer H/F

September 23, 2023

Data Engineer H/F

September 23, 2023

Data Engineer H/F