logo

View all jobs

GCP Data Engineer

Chicago, IL · Information Technology
Job Title: GCP Data Engineer
Duration: 6 months Contract to hire
Location: Chicago is the preferred location, but open to candidates from anywhere in the U.S.


Role Overview
We are seeking a highly skilled GCP Data Engineer to design, develop, and optimize scalable data solutions on Google Cloud Platform (GCP). The ideal candidate will have strong expertise in building robust batch and streaming pipelines, implementing modern data architectures, and enabling reliable, high-quality data platforms for analytics, reporting, and machine learning use cases.

Key Responsibilities
Data Engineering & Pipeline Development
  • Design, build, and optimize scalable batch and real-time (streaming) data pipelines using GCPnative services.
  • Develop and maintain data ingestion frameworks leveraging tools such as Pub/Sub, Dataflow, and Cloud Storage.
  • Implement data transformation pipelines using BigQuery, dbt, and Python-based workflows.
  • Ensure efficient handling of large-scale structured and unstructured datasets. Data Modeling & Architecture
  • Design and implement high-performance data models for cloud-based data lakes, data warehouses, and analytics platforms.
  • Optimize data schemas and partitioning strategies in BigQuery for performance and cost efficiency.
  • Support modern architectures such as medallion (bronze/silver/gold) layers and lakehouse patterns.

Development & Coding
  • Write advanced SQL queries for transformation, validation, and analytics.
  • Develop scalable data processing logic using Python and/or Apache Beam.
  • Build reusable, modular, and maintainable code for data workflows.

Data Quality, Observability & Reliability
  • Implement and maintain data quality checks, validation rules, and anomaly detection frameworks.
  • Enable data observability through monitoring, logging, and alerting mechanisms.
  • Ensure highly reliable data pipelines with fault tolerance and error handling strategies.

ETL/ELT Modernization
  • Support migration and modernization efforts from legacy ETL tools (e.g., Talend) to GCP-native ELT frameworks (dbt).
  • Optimize existing pipelines for performance, scalability, and maintainability in cloud environments.
  • Drive adoption of ELT best practices using BigQuery as the compute engine.

Collaboration & Stakeholder Engagement
  • Collaborate with data architects, business analysts, and machine learning teams to deliver trusted datasets.
  • Translate business requirements into scalable data solutions.
  • Provide technical guidance and support for downstream analytics and reporting use cases.

Best Practices & Governance
  • Drive adoption of best practices in cloud data engineering, CI/CD, and DevOps.
  • Implement secure data access controls using IAM roles, policies, and governance frameworks.
  • Follow standards for code quality, version control (Git), and automated deployments.

Required Qualifications
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.
  • 4+ years of experience in data engineering or data platform development.
  • Hands-on experience with Google Cloud Platform (GCP) services:  
  • BigQuery
  • Dataflow
  • Pub/Sub
  • Cloud Storage
  • Strong proficiency in SQL and Python.
  • Experience with dbt (Data Build Tool) or similar ELT frameworks.
  • Experience building batch and streaming data pipelines.

Preferred Skills
  • Experience with Apache Beam or Spark.
  • Familiarity with Talend or other ETL tools and migration to cloud-native solutions.
  • Knowledge of data lakehouse architectures and modern data stack.
  • Experience with CI/CD tools (e.g., GitHub Actions, Cloud Build, Jenkins).
  • Understanding of data security, governance, and compliance standards.
  • Exposure to machine learning data pipelines and feature engineering.
                                
Key Competencies
  • Strong problem-solving and analytical skills
  • Ability to work in cross-functional teams
  • Excellent communication and documentation skills
  • Focus on performance optimization and scalability
  • Attention to data quality and reliability

Share This Job

Powered by