Job description

Job Description Print Preview 09/07/24, 3:07PM Job Title: Engineer - Data Engineering Key Responsibilities: Data Pipeline Development: Design, develop, and maintain scalable data pipelines using PySpark to process large volumes of data from various sources. Data Integration: Integrate data from multiple data sources and formats, ensuring high data quality and reliability. Optimization: Optimize and tune data processing jobs for performance and cost-efficiency. Collaboration: Work closely with data scientists, analysts, and other stakeholders to understand data requirements and deliver high-quality data solutions. ETL Processes: Develop and maintain ETL processes to extract, transform, and load data into data warehouses and data lakes. Data Quality: Implement data validation and monitoring processes to ensure data accuracy and consistency. Documentation: Document data engineering processes, workflows, and best practices. Troubleshooting: Identify, troubleshoot, and resolve data-related issues promptly. Required Qualifications: Experience: 3+ years of experience in data engineering or a related field. Education: Bachelors degree in Computer Science, Information Technology, Engineering, or a related field. Technical Skills: Proficiency in PySpark and Python. Strong knowledge of big data technologies such as Hadoop, Hive, and Spark. Experience with cloud platforms (e.g., AWS, Azure, GCP) and their data services. Familiarity with data warehousing solutions (e.g., Amazon Redshift, Google BigQuery, Snowflake). Knowledge of relational and NoSQL databases (e.g., MySQL, MongoDB, Cassandra). Data Processing: Experience with ETL/ELT processes and data pipeline orchestration tools (e.g., Apache Airflow, Apache NiFi). Problem-Solving: Strong analytical and problem-solving skills. Communication: Excellent verbal and written communication skills, with the ability to explain complex technical concepts to non-technical stakeholders Long Description Key Responsibilities: Data Pipeline Development: Design, develop, and maintain scalable data pipelines using PySpark to process large volumes of data from various sources. Data Integration: Integrate data from multiple data sources and formats, ensuring high data quality and reliability. Optimization: Optimize and tune data processing jobs for performance and cost-efficiency. Collaboration: Work closely with data scientists, analysts, and other stakeholders to understand data requirements and deliver high-quality data solutions. ETL Processes: Develop and maintain ETL processes to extract, transform, and load data into data warehouses and data lakes. Data Quality: Implement data validation and monitoring processes to ensure data accuracy and consistency. Documentation: Document data engineering processes, workflows, and best practices. Troubleshooting: Identify, troubleshoot, and resolve data-related issues promptly. Required Qualifications: Experience: 3+ years of experience in data engineering or a related field. Education: Bachelors degree in Computer Science, Information Technology, Engineering, or a related field. Technical Skills: https://hcm44.sapsf.com/xi/ui/rcmcommon/pages/jobReqPrintPreue&_s.crb=inNKb6awspU%2bnUdrviSp5o7OpOcjuCfnfTg%2btMzbKZU%3d Page 1 of 2 Job Description Print Preview 09/07/24, 3:07PM Proficiency in PySpark and Python. Strong knowledge of big data technologies such as Hadoop, Hive, and Spark. Experience with cloud platforms (e.g., AWS, Azure, GCP) and their data services. Familiarity with data warehousing solutions (e.g., Amazon Redshift, Google BigQuery, Snowflake). Knowledge of relational and NoSQL databases (e.g., MySQL, MongoDB, Cassandra). Data Processing: Experience with ETL/ELT processes and data pipeline orchestration tools (e.g., Apache Airflow, Apache NiFi). Problem-Solving: Strong analytical and problem-solving skills. Communication: Excellent verbal and written communication skills, with the ability to explain complex technical concepts to non-technical stakeholders https://hcm44.sapsf.com/xi/ui/rcmcommon/pages/jobReqPrintPreue&_s.crb=inNKb6awspU%2bnUdrviSp5o7OpOcjuCfnfTg%2btMzbKZU%3d Page 2 of 2

Please let the company know that you found this position on this Job Board as a way to support us, so we can keep posting cool jobs.

Similar jobs

Browse All Jobs
Agoda
October 6, 2024
PPS
October 6, 2024
IBM
October 6, 2024

IBA-Data Engineer