Job Title:  Consulting - SA&MA - Stream Data Engineer

Job requisition ID ::  63740
Date:  Apr 24, 2024
Location:  Delhi CEC
Designation:  Consultant
Entity:  Deloitte Touche Tohmatsu India LLP
  1. Google Cloud Platform (GCP) Skills:
    • GCP Certification: Look for candidates with relevant GCP certifications, such as the Professional Data Engineer Certification.
    • Experience with GCP Services: Assess their hands-on experience with various GCP services, including BigQuery, Dataflow, Pub/Sub, and others.
    • Resource Management: Evaluate their ability to manage GCP resources efficiently and securely.
  1. BigQuery (BQ) Expertise:
    • SQL Proficiency: Assess their SQL skills, particularly in the context of writing efficient queries for BigQuery.
    • Schema Design: Evaluate their ability to design and optimize schemas for BigQuery datasets.
    • Performance Optimization: Look for experience in optimizing BigQuery performance and troubleshooting query performance issues.
  1. Directed Acyclic Graphs (DAGs) and Workflow Orchestration:
    • Apache Airflow Knowledge: Assess their familiarity with Apache Airflow and their ability to create, schedule, and monitor DAGs.
    • Task Dependencies: Evaluate their understanding of defining task dependencies and scheduling in DAGs.
    • Error Handling: Check their approach to handling errors and managing retries in Airflow workflows.
  1. AtScale:
    • Knowledge of AtScale: Look for candidates who understand the concepts of AtScale and its role in providing a unified semantic layer for analytics.
    • Implementation Experience: Assess their hands-on experience in implementing solutions using AtScale.
    • Integration Skills: Evaluate their ability to integrate AtScale with other components of the data engineering stack.
  1. Data Modeling and ETL (Extract, Transform, Load):
    • Data Modeling Skills: Assess their skills in designing effective data models, especially for analytical purposes.
    • ETL Experience: Evaluate their experience in designing and implementing ETL processes, considering both batch and streaming data.
  1. Programming and Scripting Languages:
    • SQL and Python: Check their proficiency in SQL for data manipulation and Python for scripting and automation..
  1. Version Control and Collaboration:
    • Git: Look for candidates with experience in using Git for version control.
    • Collaboration Tools: Assess their familiarity with collaboration tools such as JIRA, Confluence, or Slack.