We are seeking a skilled and experienced Data Scientist to join our team. As a Data Scientist, you will play a crucial role in analyzing large datasets, developing machine learning models, and providing data-driven insights to drive business decisions. You should have a strong background in statistics, programming, and machine learning techniques, with specific experience using Databricks.
Responsibilities
- Data Analysis and Exploration: Analyze large and complex datasets using statistical techniques and exploratory data analysis to uncover patterns, trends, and insights.
- Identify key variables and perform feature engineering to prepare data for modeling.
- Machine Learning Model Development: Develop and deploy machine learning models to solve business problems.
- Apply various algorithms such as regression, classification, clustering, and natural language processing to extract actionable insights from data.
- Databricks Expertise: Utilize Databricks to manage and analyze large-scale datasets efficiently.
- Leverage its features, such as data ingestion, data transformation, and model deployment, to build scalable and reliable data science solutions.
- Model Evaluation and Validation: Evaluate model performance using appropriate metrics and validation techniques.
- Fine-tune models for optimal performance, considering factors like accuracy, precision, recall, and computational efficiency.
- Collaboration and Communication: Collaborate with cross-functional teams, including data engineers, software developers, and business stakeholders, to understand requirements and deliver impactful data-driven solutions.
- Communicate complex analytical concepts and findings to non-technical stakeholders in a clear and concise manner.
- Continuous Learning and Research: Stay updated with the latest advancements in data science, machine learning, and Databricks technologies.
- Participate in research activities, attend conferences, and explore new methodologies to enhance the team's capabilities.
Requirements
- Experience: 4-7 years of experience as a Data Scientist, with a proven track record of developing and deploying machine learning models in real-world applications.
- Strong Programming Skills: Proficiency in Python or R for data manipulation, analysis, and model development.
- Experience with SQL for data querying and manipulation.
- Machine Learning Expertise: Solid understanding of various machine learning algorithms, including regression, classification, clustering, and natural language processing. Hands-on experience with libraries such as sci-kit-learn, TensorFlow, or PyTorch.
- Databricks Proficiency: In-depth knowledge and hands-on experience with Databricks for data processing, model development, and deployment.
- Familiarity with Databricks features like Delta Lake, MLflow, and Spark.
- Statistical Analysis: Strong statistical skills and knowledge of statistical techniques for exploratory data analysis, hypothesis testing, and feature selection.
- Data Visualization: Experience with data visualization tools such as Tableau, Power BI, or matplotlib/seaborn for creating insightful visualizations and reports.
- Strong Problem-Solving Skills: Ability to understand complex business problems and apply analytical thinking to develop practical and innovative solutions.
- Communication Skills: Excellent verbal and written communication skills to effectively collaborate with cross-functional teams and present findings to stakeholders.
- Educational Qualification: A bachelor's or master's degree in Data Science, Computer Science, Statistics, or a related field. A Ph. D. is a plus.
- Continuous Learning Mindset: Enthusiasm for learning new technologies, methodologies, and industry trends. A passion for data science and its application in solving real-world challenges.