A Simple Tech Interview Exam for Data Engineering

​Sample Data Engineering Take Home Test with Data and Solution

Jesse Ruiz (she/they)
7 min readJun 13, 2024
Photo by Avery Evans on Unsplash

If you are on the hunt for a data engineering role and want to practice more exam questions, please read on! I have included a take home exam, data file, and my solution. Please read the assessment questions below, and download the data from here (https://github.com/jessejinnaruiz/coding-interviews-exercises/blob/main/PUB-dataengineering-technical-takehome/data/data-log.csv). I was given 72 hours to write a solution.

Don’t peek at my solution beforehand! When you are done, come back to read my solution and please comment your questions or own solutions below.

Exam: Build your solution in the SRC folder. Provide us with source code that we can run on our end to reproduce your results and/or tests/builds. The choice of tools and packaging for building this project are up to you but please don’t use a remotely hosted source-control tool.

Assessment questions:

1. Aggregation:
- Summarize the raw data set and derive watch-time (duration of a stream in minutes)