Resume
Work Experience
2022 Nov - Present
Data Engineering Manager
Shopfiy - Lakehouse
Managing a global team of engineers, building Shopify's Data Lakehouse tooling, and near real time Kafka data ingestion for offline analytics.
→ Managing 10 engineers
→ Fully remote team
→ Global team spanning from Pacific West Coast (PST) to Central Europe (CET)
2022 May - Nov
Staff Data Engineer
Shopify - Lakehouse
Leading the charge on Apache Iceberg adoption within Shopify, while launching our CDC archival system using Kafka Connect to write Apache Iceberg V2 datasets to have steaming updates to our tables. Develop our new Lakehouse standards.
→ Running 1000+ Kafka Connectors
→ Ingesting PB's of data a day
→ Standardizing Lakehouse data set publishing
→ Open Source Trino Contribution
Skills: Change Data Capture · Apache Kafka · Apache Iceberg · Apache Spark · Apache Hive · Trino · Scala · Python
2020 April - 2022 May
Senior Data Engineer
Shopify - Data Acquisition
Focusing on using Kafka Connect for ingest of Kakfa data, and made accessible via Apache Iceberg. Then creating type 1 dimensional model of data sources from CDC (Change Data Capture) Kafka data using Apache Spark and Iceberg. Also working on maintaining our legacy data ingestion pipelines.
Skills: Change Data Capture · Apache Kafka · Apache Iceberg · Apache Spark · Trino · Scala
2019 July - 2020 April
Data Engineer
Shopfiy (Oberlo) - Product Sourcing
Cloud migration of Shopify's subsidiary Oberlo's data services from AWS to GCP and native Shopify data stack (proprietary Shopify ETL tooling, and Spark modeling framework). Worked on a semi remote team split between Berlin Germany and Vilnius Lithuania, while coordinating efforts with our data teams back in Canada.
→ Orchestrated and executed the migration of database, api, and archived event extraction
→ Onboarded data scientists to the Shopify data platform
→ Assisted migration of modelling to PySpark.
​
Skills: Python · JRuby · AWS · Spark · Redshift
2017 July - 2019 July
Data Engineer
Shopfiy - Data Aquisition
Working on data extraction for ETL pipelines that are scalable, reliable, and fast. This includes building interfaces for API's, JDBC sources, and Kafka extractions.
→ Implemented incremental statement based replication
→ Implemented multi threaded ingestion for our sharded data sources
→ Migrated our extraction services to from legacy on premises data center to GCP
→ Audited raw historical data for GDPR compliance
→ Implemented data scrubbing and retention of PII data
​
Skills: Python · JRuby · Scala · GCP · Apache Spark · Docker
2015 April - 2016 June
Intern Software Developer
EventMobi - Backend
Develop the core data services technology that enables customer facing teams, partners and 3rd party developers to fulfill their requirements on building on the EventMobi platform.
​
Skills: Python · Flask · RabbitMQ · AWS · Docker
Education
2012 - 2017
University of Toronto | BSc. Honours Computer Science
Specialty of computer science with an interest in areas of operating systems and artificial intelligence.
Teaching Assistant Positions
2017 Winter
University of Toronto | CSC490 Capstone Project
Running IBM Watson and Bluemix workshops, design thinking sessions, helping students get to a testable MVP.
2016 Winter
University of Toronto | CSC454 Business of Software
Advising software business plans & presentations. Running through the business model canvas.
2015 - 2016
University of Toronto | CSC258 Computer Organization
Running digital logic circuits labs for students using Altera FPGA’s programming in Verlilog (a hardware description language)