Data Engineer Intern
Ecolab
St Paul, Minnesota, United States
Enterprise Data Office
May 2022 - July 2022
- Profiled tables in
Snowflake
usingSQL
to examine key statistics, identifying outliers and trends, enhancing data integrity - Identified 201M invalid rows and created an interactive
Snowsight
dashboard to support business decisions - Automated query generation with
JinjaSQL
inPython
, utilizing theAlation
catalog, accelerating data quality evaluation - Analyzed 19300 hours of Service Requests logs of dishmachines, cleansed using Python from
Snowflake
, processedAzure Cognitive Service
to identify 6 common issues and their locations, informing potential refresh strategies