Data Engineer

  • Location
    New York, New York
  • Category:
    IT - IT
  • Job Type:
    Direct Hire
  • Job reference:
    US_EN_8_814326_2881247

A company leading the streaming industry is looking to hire a Data Engineer to join their team and create/manage an expanding library of ETL/ELT. This role will report to the Senior Manager of Analytics.

 

Job Responsibilities: 

  • Onboard data and support suite of reporting, advanced analytics, and directives with various teams.
  • Scope and create optimal schema using best modeling practices for analytics and operation use-cases.
  • Manage and expand code ETL orchestrations - focusing on data observability augmentation and stack control.
  • Add new features with necessary unit/integration testing, UAT, and QA.
  • Perform ad hoc data analysis towards coverage inquiries and data quality.
  • Work closely with data science stakeholders in identifying and building optimal inputs for machine learning methods and predictive modeling. 
  • building workflows / programs with ORM functionality (SQLAlchemy) 
  •  

    Job Qualifications:  

  • 2 years of more of relevant/similar work experience
  • 1 year or more experience as a contributor/manager of batch, streaming pipelines.
  • Proficiency in programming languages (Python and bash are highly preferred, node.js a bonus)
  • Solid foundations in data modeling (object, dimensional, app access patterns) and transformations (regex and pandas)
  • Experience in SQL writing and analytic queries 
  • Experience in building workflows with ORM functionality (SQLAlchemy) 
  • Excellent at collecting feedback and requirements from stakeholders
  • Able to translate designs and strategic directives and implement into documentations (UML, docstring, ERDs)
  • Familiarity with Tableau (architecture, data interfaces, optimizations, etc)
  • Knowledge with development lifecycle fundamentals
  • Experience and knowledge in automated testing (selenium & pytest), source control (git), and code refactoring
  •  

    Preferred Qualifications: 

  • Cloud Data tooling (AWS, GCP) and DBT experience
  • Luigi, Airflow, or other orchestration tool experience
  • Containerization experience (Kubernetes, Docker, AWS ECS)
  • Web frameworks experience (Django, React, Flask)
  • AWS Lambda/step functions/Chalice and serverless framework experience 
  • Big data ecosystem experience (HDFS, Presto, SparkSQL, etc)
  •  



    Equal Opportunity Employer/Veterans/Disabled

    To read our Candidate Privacy Information Statement, which explains how we will use your information, please navigate to https://www.parkerlynch.com/candidate-privacy

    The Company will consider qualified applicants with arrest and conviction records