We are seeking a Data Scientist to work on exciting client-facing projects.
You will be responsible for:
Interpret messy structured and unstructured clinical data to produce actionable insights
Work with parallelized and modular data ingest and machine learning techniques
Improve the performance of Carta’s automation and machine learning pipelines
Work with clinical experts to generate de-identified datasets for evaluating our ML algorithms
Orchestrate the training, deployment, and evaluation of ML pipelines across multiple customer deployments.
Collaborate with other team members and stakeholders.
Qualifications and Skills:
Masters/PhD in Biomedical Informatics, Statistics, Computer Science or related field
Strong understanding of the core principles of statistical/machine learning
Experience working with clinical data
Experience with Natural Language Processing tools (e.g. Spacy)
Python expertise (4+ years' experience)
Proficient understanding of code versioning tools (i.e. git)
Familiarity with Jupyter notebooks or other data science tools
Strong data visualization skills (Tableau, matplotlib, etc.)
Experience with the FHIR standard for medical data
Postgres or SQL expertise
A hands-on, engaged approach to solving problems
Excellent communication skills and experience in collaborative environments
The desire to be continually learning about emerging technologies/industry trends
Job is remote but candidates must be US-based.
Professional Competencies:
Good verbal communication skills
Works well in a small, fast paced, team
Candidates must be highly motivated and self-sufficient, possess strong analytical and critical thinking skills and be able to adapt to new technologies quickly
Persistent and creative at finding solutions to problems on your own; able to use documentation, Google searches, and trial and error to solve problems you have not seen before.