Senior Data Scientist, Natural Language

Company:
Location: Remote

*** Mention DataYoshi when applying ***

It’s Time For A Change…

Your Future Evolves Here

Evolent Health has a bold mission to change the health of the nation by changing the way health care is delivered. Our pursuit of this mission is the driving power that brings us to work each day. We believe in embracing new ideas, testing ourselves and failing forward. We respect and celebrate individual talents and team wins. We have fun while working hard and Evolenteers often make a difference in everything from scrubs to jeans.

Are we growing? Absolutely. We have seen about 30% average growth over the last three years. Are we recognized? Definitely. We were named one of “Becker’s 150 Great Places to Work in Healthcare” in 2016, 2017, 2018 and 2019 and are proud to be recognized as a leader in driving important Diversity and Inclusion (D&I) efforts: Evolent achieved a 95% score on its first-ever submission to the Human Rights Campaign's Corporate Equality Index; was named on the Best Companies for Women to Advance List 2020 by Parity.org; and we publish an annual Diversity and Inclusion Annual Report to share our progress on how we’re building an equitable workplace. We recognize employees that live our values, give back to our communities each year, and are champions for bringing our whole selves to work each day. If you’re looking for a place where your work can be personally and professionally rewarding, don’t just join a company with a mission. Join a mission with a company behind it.

This position is for the Chicago location.


What You’ll Be Doing:


The Senior Data Scientist will support building of AI products in Agile fashion that empower healthcare payers, providers and members to quickly process medical data to making informed decisions and overall reduce health care costs. As a research scientist/engineer part of Data Science and Artificial Intelligence team you will be working primarily on unstructured text data to build machine learning models for information retrieval applications. These applications include but are not limited to optical character recognition, understanding the contents of the medical documents using natural language processing, and integrating processes into the overall AI pipeline to mine healthcare and medical information with high recall and other relevant metrics. We ingest claims, medical charts, etc. from providers containing unstructured data which will be transformed into structured data to support automated entry into our storage layers for downstream applications. The results will be used dually for real-time operational processes with both automated and human-based decision making as well as contribute to reducing healthcare administrative costs. We work with all major cloud and big data vendors offerings including but not limited to (Azure, AWS, Google, IBM, etc.) to achieve AI goals in healthcare and support Evolent business.

  • Develop Natural Language Medical/Healthcare documents comprehension related products to support Evolent Health business objectives, products and improve processing efficiency, reducing overall healthcare costs
  • Gather external data sets; build synthetic data and label data sets as per the needs for NLP/NLR/NLU
  • Apply software engineering skills to build Natural Language products to improve automation and improve user experiences leveraging unstructured data storage, Entity Recognition, POS Tagging, ontologies, taxonomies, data mining, information retrieval techniques, machine learning approach, distributed and cloud computing platforms
  • Build the Natural Language and Text Mining products — from platforms to systems for model training, versioning, deploying, storage and testing models with creating real time feedback loops to fully automated services
  • Work closely and collaborate with Data Scientists, Machine Learning engineers, IT teams and Business stakeholders spread out across various locations in US and India to achieve business goals
  • Provide support to other Data Scientist and Machine Learning Engineers


The Experience You’ll Need (Required):

  • MS degree or above in Computer Science, Computational linguistics, Mathematics, Physics or related STEM fields
  • 5+ years of Industry experience related to Unstructured Text Data and NLP
  • Strong understanding of mathematical concepts including but not limited to linear algebra, Advanced calculus, partial differential equations and statistics including Bayesian approaches
  • Strong programming experience including understanding of concepts in data structures, algorithms, compression techniques, high performance computing, distributed computing, and various computer architecture
  • Good understanding and experience with traditional data science approaches like sampling techniques, feature engineering, classification and regressions, SVM, trees, model evaluations
  • Additional course work, projects, research participation and/or publications in Natural Language processing, reasoning and understanding, information retrieval, text mining, search, computational linguistics, ontologies, semantics
  • Experience with developing and deploying products in production with experience in two or more of the following languages (Python, C++, Java, Scala)
  • Strong Unix/Linux background and experience with at least one of the following cloud vendors like AWS, Azure, and Google
  • Hands on experience with one or more of high-performance computing and distributed computing like Spark, Dask, Hadoop, CUDA distributed GPU
  • Thorough understanding of deep learning architectures and hands on experience with one or more frameworks like tensorflow, pytorch, keras
  • Hands on experience with libraries and tools like Spacy, NLTK, Stanford core NLP Genism, johnsnowlabs


Finishing Touches (Preferred):

  • Exposure to linguistic background especially language models
  • Medical concepts with codes from standard ontologies (SNOMED CT, LOINC, RxNorm, ICD, etc.)
  • Lucene, Solr, Elastic Search experience
  • Experience with Kubernetes and dockers
  • Experience building REST API’s for AI work and knowledge of microservices architecture
  • Experience working with team members spread globally.


Technical requirements:

During the current pandemic Evolent employees are working remotely from home. As such we require that all employees have the following technical capability at their home: High speed internet over 10 MBPS and, specifically for all call center employees, the ability to plug in directly to the home internet router. These at-home technical requirements are subject to change with any scheduled re-opening of our office locations.


Evolent Health is an equal opportunity employer and considers all qualified applicants equally without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran status, or disability status.


#LI-Remote

*** Mention DataYoshi when applying ***

Offers you may like...

  • Beekeeper AG

    Senior Data Analyst
    Kraków, małopolskie
  • Harnham

    Senior Data Analyst (m/f/d)
    Berlin
  • Maderik Institute Of Management AB

    Senior Data Analyst /Stockholm
    211 19 Malmö
  • Ubisoft

    Senior Data Analyst, Marketing Analytics
    San Francisco, CA 94107
  • GSK

    Senior Data Analyst
    Philadelphia, PA