Apex Systems is currently looking for a Data Engineer to help support on of our clients. This will be a full-time, direct placement with the client. They are looking for a resource to support data pipeline creation. The client is currently focused in an Azure Data house, including the use of Data Factory, Data Lake, and Databricks.
RESPONSIBILITIES:
Primary focus of this role is to architect, develop and maintain data pipelines using analytical, data driven approach for constantly evolving business and that aligns with DFC's long-term technical data strategy
- Build scalable & repeatable batch & real-time data pipelines using standard frameworks & best practices
- Collaborate with technical & non-technical teams across organization to craft & build data models/APIs using Scala/Python/Spark on Azure/AWS
- Catalogue end-to-end data flows
- Support governance of data usage to ensure consistency, including development of data dictionary and guidance/rules on usage of specific metrics
- Supports data infrastructure including PBI data models, development of Bronze, Silver & Gold-level tables/views in Azure/Synapse/Databricks
- Provide mentorship on technical decision that will have an impact on the analytics community
- Participate in project planning, defining breakthroughs & delivarables
- Mentors less experienced developers or data engineers
EDUCATION & EXPERIENCE:
We are looking for someone who has:
- Bachelor's degree (Master's preferred) in technical field or equivalent work experience
- 3-5 years of experience in data engineering with an emphasis on data analytics and reporting
- 3-5 years of proven track record with at least one of the following cloud platforms: Microsoft Azure, Amazon Web Services (AWS), Google Cloud Platform (GCP), others
- 3-5 years of experience in the design and build of data extraction, transformation, and loading processes by writing custom & complex data pipelines
- consistent track record with one or more of the follow scripting languages: Scala, Python, SQL, Spark and/or other
- Proven understanding of machine learning methods