Job Description:
Role activities:
- Be a fundamental part of data science team to collaborate with data scientists, product owners and project managers to build machine learning automation/pipeline solutions
- Have strong database background and SQL server/Hadoop data domain expertise a plus
- Ability to take a model from development to production
- Experience with ML model Cloud Deployment and performance optimization - Azure ML and Databricks are a strong plus
- Help architect automation framework scaffolding to leverage standardized processes
- Ability to scale solutions to handle large amounts of data efficiently and with controls to ensure failsafe product delivery
- Ability to understand the big picture not just a single solution but how hardware and software solutions align and to stay ahead of capacity needs
- Implement CI/CD processes to the ML model development and deployment life cycle
o Required qualifications:
- 3-5 years of progressing Python development experience using various packages and dependency resolutions
- 3-5 years of experience working with Hadoop, HIVE, Spark, and other Big Data tools
- Hands on experience with Custom UI on Elastic Search and Data Visualization with Kibana
- Data Quality Control including Data Cleansing/Wrangling and Data Pipeline builds and management
- Automation of batch jobs using AutoSys and other scheduling including Monitoring and Controls of batch Jobs
- Unix/Linux Administration experience
- Agile Experience required with use of JIRA for workload managements
- Experience in multiple Dev Pipelines with CI/CD experience using BitBucket, Ansible, Jenkins, and other tools
- Understanding of model building architecture
o Preferred qualifications:
- NLP Model Development and Testing include predefined Models such as BERT and RoBERTA
- ElasticSearch API and Web Application development
Job Band:
H5
Shift:
1st shift (United States of America)
Hours Per Week:
40
Weekly Schedule:
Referral Bonus Amount:
0