A Lead Data Scientist provides tactical and strategic guidance to others, coordinating among a diverse set of teams in IT teams and business stakeholders. They provide technical leadership on large scale cross functional projects, develop strategy, provide insight into allocation of resources (resourcing and planning), and mentor others. This position will guide the strategic and technical development of centralized infrastructure capabilities and processes to support decision science across the organization, with a particular focus on data quality. It will require close partnerships many teams across the organization to shape solutions as they are developed and to assist in enabling and migrating production workflows.
Ph.D. in Computer Science, Computer Engineering or related field with 5+ years experience; or Masters 8+ yrs of experience;
Experience shaping technical strategies to address systemic problems, in partnership with other technical teams
Expertise building models using R, Python or other statistical and/or mathematical programming packages; proficiency in R and Python programming languages
Expert level of proficiency in computational skills, including a deep understanding of advanced analytics techniques and emerging forms of analytics
Experience with model management and model orchestration in cloud computing environments such as AWS or GCP
Experience with containerized deployment methods (e.g. Docker)
Proficiency in consuming REST based API and/or gRPC framework output
Experience working with models that require large datasets and are computationally intensive
Experience with source control methodologies (e.g. Git)
Familiarity with common data quality methodologies used by various personas including data engineers and data scientists
Expertise in adopting and advocating software development best practices
Ability to take initiative and drive work independently
Experience leading technical initiatives involving stakeholders from a variety of scientific and business backgrounds
Expertise with quantitative modeling techniques such as statistics, machine learning, optimization, and simulation
Excellent communication skills with the ability to communicate complex qualitative analysis in a clear, precise and actionable manner and deliver presentations to large audiences, executive leadership, and externally at conference and collaborations.
Experience with Apache Airflow, Kubernetes, SageMaker, SQL, MapReduce
Proficiency in Java and/or other modern programming languages
Location: Creve Coeur, MO or Remote from another location. If local, they must work onsite (once the site reopens). Only open to candidates who have prior experience working remotely. Make sure it is clearly listed on the resume where they worked remotely. Please include in the Summary on the resume what tools they used to work remotely.