We are seeking a talented and a motivated Senior Data Engineer to join our dynamic team. As a Senior Data Engineer, you will be responsible for designing, implementing, and maintaining scalable data pipelines and infrastructure to support our data-driven initiatives. You will collaborate closely with cross-functional teams to understand data requirements, optimize data models, and ensure the reliability and performance of our data systems.
Key Responsibilities:
- Design, develop, and maintain robust data pipelines to ingest, process, and transform large volumes of structured and unstructured data from various sources (e.g., Database, API, SFTP).
- Build and optimize data models and schemas to support efficient data storage, retrieval, and analysis.
- Implement data integration solutions to consolidate data from disparate systems and sources.
- Work closely with Data Scientists and Analysts to understand their requirements and provide them with clean, reliable, and well-structured data.
- Collaborate with DevOps and Infrastructure teams to deploy and manage data infrastructure in cloud environments (e.g., AWS, Azure).
- Familiar with the concept of data security to ensure the data pipelines are secured and PII (Personally Identifiable Information) data are encrypted.
- Monitor and optimize the performance, scalability, and reliability of data systems to ensure high availability and low latency.
- Design and develop MLOps production pipelines; provide technical support to data scientists/ML engineers by getting their ML/DL models deployed at scale and meeting SLAs on both cloud and on-premises GPU and CPU instances.
- Develop and maintain documentation, standards, and best practices for data engineering processes and technologies.
- Stay current with emerging technologies and trends in data engineering and contribute to the continuous improvement of our data architecture and practices.
- Explore, evaluate, and champion the introduction of next-generation technologies in the data-ingestion workflow. Participate in project planning and provide technical guidance on cloud architecture for data projects.
Requirements:
- BS in Computer Science or other related discipline is required. Advanced degrees in Computer Science (PhD, MS) are highly desirable.
- Proven experience (3 years) working as a Data Engineer or in a similar role.
- Strong proficiency in programming languages such as Python. Conversant with data structures and algorithm design.
- Hands-on experience with distributed computing frameworks and big data technologies such as Spark, Kafka, etc.
- Proficiency in SQL and experience with relational databases (e.g., Azure SQL DB, AWS Redshift).
- Experience with data modelling, ETL/ELT processes, and data warehousing concepts.
- Familiarity with cloud platforms and services (e.g., AWS S3, AWS Step Functions, AWS Airflow, Azure Data Logic Apps, Azure Blob Storage).
- Experience with setting up SSO using AD and implementing RBAC (Role Based Access Control) for data security.
- Excellent problem-solving skills and attention to detail.
- Effective communication and collaboration skills, with the ability to work effectively in a cross-functional team environment.
- Experience with version control systems (e.g., Git) and CI/CD pipelines is a plus.