Data Engineer

Location: New York, NY

*** Mention DataYoshi when applying ***

This position can be based remotely in the United States

Prognos’ is a NYC-based healthcare startup whose mission is to improve health by driving the best actions learned from the world's data. In order to achieve this goal we have curated the world’s largest clinical lab dataset, covering over 200M patients in the US, and are currently deploying cutting-edge technology for predicting disease at the earliest possible time.

The Mission of the Data Science team at Prognos is to develop, deploy and maintain analytic and machine learning pipelines within Prognos’ products, addressing business-relevant problems in close collaboration with our Engineering, Clinical and Product teams.

We are looking for an experienced engineer to join the team, and help us move this mission-critical task forward. This position will be focused on helping us learn about patient health from medical time series. Are you interested in applying modern data engineering, MLOps and devops practices to complex health data? Do you want to work on developing and deploying production feature engineering, data tracking and model deployment pipelines? Then come work with us!

Candidates must have at least three years of prior professional experience working with large datasets, especially data engineering pipeline development. A bachelor’s degree or higher in Computer Science, Computer Engineering, Electrical Engineering or a similar quantitative field is preferred but not strictly necessary, depending on industry experience. Experience with medical data (Claims, Rx, Clinical) is a big plus.

Required Skills and Experience

  • 3 years of professional software engineering experience, with data systems as a primary responsibility.
  • Deep modern database/data warehouse expertise, with emphasis on the Apache Spark ecosystem.
  • Professional experience dealing with large, complicated datasets.
  • Python programming expertise: best practices, packaging, modern libraries, etc.
  • Experience with Docker, Kubernetes and build/deployment.
  • Experience with common AWS products and tools (EC2, S3, etc).
  • Accustomed to working with git and shared codebases.

Preferred Skills and Experience

  • Experience developing and maintaining data pipelines powering production ML models.
  • Experience developing or using modern data pipelining and lineage tracking tools.
  • Experience with distributed computing systems.
  • Experience with healthcare data and/or insurance data is a plus.

About Prognos Health

Prognos is a leading clinically-focused healthcare analytics company with a platform that can query patient-centric data to answer key healthcare questions in minutes not months. The prognosFACTOR™ platform addresses payer, life sciences and provider needs, enabling clients to securely, efficiently and cost effectively analyze billions of lab and health records on more than 325 million de-identified patients. prognosFACTOR is HIPAA compliant and harmonizes and integrates lab data with other healthcare data assets from a trusted and diverse data ecosystem. For more information, visit

Values & Culture

  • We are collaborative. We put team trust and energy ahead of individual stardom. We are humble and willing to admit when wrong.
  • We go above and beyond. We exceed the needs of our partners and are not limited by our job descriptions. We are accountable for our actions, work, decisions, and results.
  • We are purposeful in all that we do. We focus on what matters and prioritize. We think in perspective and see the full picture.
  • We are curious. We learn from solving big problems. We are never satisfied and always strive for a better way. We aim to continually develop ourselves.
  • We are courageous and honest. We are not afraid to speak out. We challenge the process. We deal with conflict head on.
  • We are enthusiastic. We are optimistic for change and a better future. We believe in the greater good. We celebrate accomplishments and have fun.

Our Mission

To improve health by driving the best actions learned from the world’s data

Our Vision

To prevail over disease and empower people everywhere to live life to the fullest

Selected Perks

  • Flexible work arrangements (e.g. no set hours), fully remote work, and unlimited PTO
  • Health Insurance
  • Life Insurance
  • Long Term Disability
  • Dental
  • Vision
  • 401(k)
  • HSA
  • FSA
  • Dependent Care Flexible Spending
  • Commuter benefits
  • Free access to One Medical Group
  • Gym discounts
  • Flexible work hours and locations
  • Health Advocate
  • Employee Stock Option Plan

Powered by JazzHR


*** Mention DataYoshi when applying ***

Offers you may like...

  • Living Security

    Senior Data Engineer
    Austin, TX 78738

    Data Engineer Staff
    Orlando, FL 32825
  • CyberCoders

    Data Engineer
    Chicago, IL 60608
  • iknowvate technologies

    Big Data Engineer | FULLY REMOTE
    Las Vegas, NV
  • Seamless.AI

    Data Engineer - Remote US
    Columbus, OH