Job Description
Role Responsibilities
- Identifying and working with software and hardware partners to assess data driven solutions, actionable insights and roadmaps for inclusion in future solutions
- Being the technical expert and resource for data management and analytics
- Performs analysis and design for medium-sized to large or complex development and maintenance projects
- Selecting features, building and optimizing classifiers using machine learning techniques
- Data mining using state-of-the-art methods
- Extending company’s data with third party sources of information when needed
- Enhancing data collection procedures to include information that is relevant for building analytic systems
- Processing, cleansing, and verifying the integrity of data used for analysis
- Doing ad-hoc analysis and presenting results in a clear manner
- Creating automated anomaly detection systems and constant tracking of its performance Requirements
- Extensive experience working with IaaS and PaaS systems ( Microsoft Azure, Amazon Web Services etc)
- Experience building large scale data warehousing solutions, analysis, design, development, and performance tuning
- Experience architecting highly scalable, distributed systems using different open source and commercial tools for multi-terabyte data warehouses
- Solid understanding and experience with high-scale or distributed RDBMS and knowledge of NoSQL platforms
- Experience normalizing complex datasets from a variety of sources
- Hands on experience in configuring Hadoop cluster of major Hadoop distributions
- Experience in working with ecosystems like Hive, Pig, Sqoop, Map Reduce
- A clear understanding of cloud service and deployment models
- Excellent understanding of machine learning techniques and algorithms, such as k-NN, Naive Bayes, SVM, Decision Forests, etc.
- Experience with common data science toolkits, such as R, Shiny, MatLab, etc . Excellence in at least one of these is highly desirable
- Great communication skills
- Experience with data visualization tools, such as D3.js, GGplot, etc.
- Proficiency in using query languages such as SQL, Hive, Pig etc.
- Experience with NoSQL databases, such as MongoDB, Cassandra, HBase
- Good applied statistics skills, such as distributions, statistical testing, regression, etc
Additional Comments
5- 6 Yrs Data Engineers Role Responsibilities
- Identifying and working with software and hardware partners to assess data driven solutions, actionable insights and roadmaps for inclusion in future solutions
- Being the technical expert and resource for data management and analytics
- Performs analysis and design for medium-sized to large or complex development and maintenance projects
- Selecting features, building and optimizing classifiers using machine learning techniques
- Data mining using state-of-the-art methods
- Extending company’s data with third party sources of information when needed
- Enhancing data collection procedures to include information that is relevant for building analytic systems
- Processing, cleansing, and verifying the integrity of data used for analysis
- Doing ad-hoc analysis and presenting results in a clear manner
- Creating automated anomaly detection systems and constant tracking of its performance Requirements
- Extensive experience working with IaaS and PaaS systems ( Microsoft Azure, Amazon Web Services etc)
- Experience building large scale data warehousing solutions, analysis, design, development, and performance tuning
- Experience architecting highly scalable, distributed systems using different open source and commercial tools for multi-terabyte data warehouses
- Solid understanding and experience with high-scale or distributed RDBMS and knowledge of NoSQL platforms
- Experience normalizing complex datasets from a variety of sources
- Hands on experience in configuring Hadoop cluster of major Hadoop distributions
- Experience in working with ecosystems like Hive, Pig, Sqoop, Map Reduce
- A clear understanding of cloud service and deployment models
- Excellent understanding of machine learning techniques and algorithms, such as k-NN, Naive Bayes, SVM, Decision Forests, etc.
- Experience with common data science toolkits, such as R, Shiny, MatLab, etc . Excellence in at least one of these is highly desirable
- Great communication skills
- Experience with data visualization tools, such as D3.js, GGplot, etc.
- Proficiency in using query languages such as SQL, Hive, Pig etc.
- Experience with NoSQL databases, such as MongoDB, Cassandra, HBase
- Good applied statistics skills, such as distributions, statistical testing, regression, etc
About Us
For more than 20 years, UST has worked side by side with the world’s best companies to make a real impact through transformation. Powered by technology, inspired by people and led by our purpose, we partner with our clients from design to operation. Through our nimble approach, we identify their core challenges, and craft disruptive solutions that bring their vision to life. With deep domain expertise and a future-proof philosophy, we embed innovation and agility into our clients’ organizations—delivering measurable value and lasting change across industries, and around the world. Together, with over 29,000 employees in 30 countries, we build for boundless impact—touching billions of lives in the process.
Visit us at UST.com .