The Center for Computational Biomedicine (CCB) is a new center within the Blavatnik Institute at Harvard Medical School. Our mission is to provide cutting-edge computational capabilities, data analysis, and data integration technologies to support medical and biological research within the Medical School.
We seek a highly motivated, collaborative individual with excellent communication skills to join our team of technologists and scientists as a Principal Data Engineer. You will help build out the necessary data warehousing infrastructure to support the downstream machine learning analysis and integration of large, complex data sets that will provide a nuanced longitudinal perspective on population- and individual-level health outcomes and disease trajectories. These data sets include healthcare insurance claims, electronic health records, genomics, environmental exposure, and other data modalities. The integration of these data will allow our research teams to make ground-breaking advances in the areas of precision medicine, healthcare AI, healthcare policy/economics, and basic science, all with the goal of improving patient outcomes.
In this role, you will work to develop innovative solutions for warehousing and integrating these large data resources. You will have frequent opportunities to dive deep to troubleshoot complex technical issues in large data management systems, working hand-in-hand with our world-class Information Technology and Research Computing teams. Your work will involve a wide variety of technologies including analytic relational data warehouses (column and row stores), graph databases, array databases, object stores, key/value stores, scale-up and scale-out storage platforms, containerized services, and others. Solutions will be deployed both on-premises and in private and public cloud environments.