Job description

  • Collaborate with stake holders, DevSecOps and data scientists to execute on the analytics roadmap.
  • Understand the business domain and document requirements for data pipelines that enable descriptive, predictive and prescriptive analytics.
  • Analyze and explore data from several very large data systems and data sources and perform complex manipulations including federated joins, imputation, deduping, etc.
  • Leverage SQL, Python and PySpark to analyze, clean, transform, and persist data from databases and well as structured, unstructured and semi-structured data from raw files.
  • Write and tune SQL and SparkSQL queries against databases and Data Lakes.
  • Productionize data pipelines and add monitoring, support and operational metrics capabilities.
  • Create visualizations and dashboards using Tableau.
  • Write unit, integration and regression tests to test data pipeline jobs.
  • Create ERD diagrams and validate design with prototype data models.
  • Collaborate with the data science team to iteratively develop optimized data models for machine learning.
  • Innovate by experimentation with the latest technologies and get stake holders buy in by leverage objective metrics.
  • Present data-driven solutions to stake holders including executive management.

Minimum Requirements

  • U.S. Citizenship with the ability to obtain a U.S. Government Security Clearance.
  • Intermediate to advanced level hands-on knowledge of SQL.
  • 3-5 years’ experience with at least one of the major relational databases: Oracle, Postgres, MySQL.
  • Hands-on experience with Python including experience with either Python Pandas DataFrame APIs for data manipulation or experience with PySpark and Spark DataFrames.
  • Experience with structured, unstructured and semi-structured data in multiple file formats including text, CSV and JSON files.
  • Experience implementing various data engineering patterns.
  • Experience with one of the major BI tools (Tableau, Qlikview, PowerBI, etc).
  • Experience writing unit, integration and regression tests.
  • Understanding of the machine learning life cycle.
  • Experience with one major notebook environment (Jupyter, Collab, Databricks, etc) for Python.
  • Effective communication, documentation and problem-solving skills.
  • Ability to work in a fast-paced environment with a can-do attitude.

Preferred Qualifications:

  • Experience with Databricks Unified Analytics Platform for data engineering is strongly preferred.
  • Experience with DBT (Data Build Tool).
  • Experience with AWS DMS (Data Migration Service).
  • Experience with Spark and Spark SQL including structured DataFrame API is strongly preferred.
  • Experience with AWS cloud and AWS big data technologies like Sagemaker, Athena, EMR, Glue, S3, etc.
  • Experience with machine learning with Python and/or Spark libraries.
  • Experience with big data file formats like Parquet & Delta.
  • Experience creating DeltaLake or DataLake for large complex data.

Company Benefits

PSI offers full-time, benefits eligible employees a competitive total compensation package that includes paid leave, and options for employer sponsored group medical, dental, vision, short-term and long-term disability, life insurance, AD&D coverage, legal services, identity theft, and accident insurance. Flexible spending account and health saving account options offer pre-tax savings for qualified medical, dental, and vision expenses. The company sponsored 401(k) retirement plan has an employer contribution match that is immediately vested. We invest in the professional growth of our employees through professional courses, certifications, and tuition reimbursement programs.

EEO Commitment

It is company policy to promote equal employment opportunities. All personnel decisions, including, but not limited to, recruiting, hiring, training, promotion, compensation, benefits, and termination, are made without regard to race, color, religion, age, sex, sexual orientation, pregnancy, gender identity, genetic information, national origin, citizenship status, veteran status, protected veteran status, disability, or any other characteristic protected by applicable federal, state, or local law.

Reasonable accommodations for applicants and employees with disabilities will be provided. If a reasonable accommodation is needed to participate in the job application or interview process, to perform essential job functions, and/or to receive other benefits and privileges of employment, please contact Human Resources by emailing HRDepartment@plan-sys.com , or by dialing 703-575-8400.

Please let the company know that you found this position on this Job Board as a way to support us, so we can keep posting cool jobs.

Similar jobs

Browse All Jobs
Walgreens
April 14, 2024
Acronis
April 14, 2024

Data Engineer / Python Developer

Outsourced
April 14, 2024

Data Engineer - Python & SQL (Homebased)