Job Description :

HI There

Looking for a Data Quality Engineer for our client in Minneapolis, MN.

Locals are preffered since its on-site position.

Responsibilities:

Data Deidentification and Synthesis: 

  • Implement and maintain data deidentification pipelines using Tonic.ai (or similar tools) to mask and synthesize sensitive data.
  • Configure and customize Tonic.ai workflows to meet specific deidentification requirements and compliance standards.
  • Develop and execute data masking strategies to ensure data utility while preserving privacy.

Data Quality Framework Development:

  • Contribute to the development and implementation of data quality checks and validation processes.
  • Write Python scripts to automate data quality assessments and reporting.
  • Assist in identifying and resolving data quality issues.

AWS Cloud Infrastructure:

  • Deploy and manage applications and data pipelines on AWS services (e.g., EC2, S3, Lambda, RDS).
  • Utilize AWS services to scale and optimize data processing and storage.
  • Monitor and troubleshoot AWS infrastructure related to data deidentification and quality.

Python Programming:

  • Develop and maintain Python scripts for data manipulation, automation, and integration with various systems.
  • Write clean, efficient, and well-documented code.
  • Participate in code reviews and contribute to best practices.

 Collaboration and Communication:

  • Work closely with data engineers, data scientists, and other stakeholders to understand data requirements and deliver solutions.
  • Document technical specifications and procedures.
  • Communicate effectively with team members and provide regular updates on project progress.

Required Skills and Qualifications:

  • Bachelor’s degree in computer science, Engineering, or a related field.
  • Proficiency in Python programming.
  • Familiarity with AWS cloud services (e.g., EC2, S3, Lambda, RDS).
  • Understanding of data privacy concepts and deidentification techniques.
  • Strong problem-solving and analytical skills.
  • Excellent communication and collaboration skills.
  • Ability to learn quickly and adapt to new technologies.

Preferred Skills:

  • Experience with Tonic.ai or other data synthesis/masking tools.
  • Knowledge of SQL and database systems.
  • Experience with data quality frameworks and tools.
  • Familiarity with data governance and compliance standards (e.g., GDPR, HIPAA).
  • experience with version control such as git.