Data Engineer Intern
Ecolab
St Paul, Minnesota, United States Enterprise Data Office
May 2022 - July 2022

demo

  • Profiled tables in Snowflake using SQL to examine key statistics, identifying outliers and trends, enhancing data integrity
  • Identified 201M invalid rows and created an interactive Snowsight dashboard to support business decisions
  • Automated query generation with JinjaSQL in Python, utilizing the Alation catalog, accelerating data quality evaluation
  • Analyzed 19300 hours of Service Requests logs of dishmachines, cleansed using Python from Snowflake, processed Azure Cognitive Service to identify 6 common issues and their locations, informing potential refresh strategies