There are two steps to this challenge.
Bonus points:
Create a synthetic data set that has similar characteristics as the original data set. There are no specific methods that you need to use. You can choose a single variable to reproduce (e.g. clicks).
The original and the synthetic data should have similar-looking distributions and descriptive statistics.
The data you'll be working with is synthetic advertising performance data. It represents daily summarized data for ad accounts across six different channels.
<aside> 🔑 You can access & query the data here:
Or you can download the SQLite database locally if you prefer:
Take screenshots of each SQL queries and the results.
Paste them in a word or google document.
For example:
Example screenshot of UI.