Google Cloud Dataprep is a serverless service used to format and transform your raw data before importing it into BigQuery or storing it as a new file. The file used for the demo is a Wikipedia log. You can download one of the files from https://dumps.wikimedia.org/other/pageviews/2017/2017-12/.
Open the Cloud Console screen in the browser and choose the project you want to use for Cloud Dataprep. Open Cloud Shell by clicking on the icon, as shown in this screenshot:
Run the following command to download a file from the URL given previously:
wget https://dumps.wikimedia.org/other/pageviews/2017/2017-12/pageviews-20171201-120000.gz
Run the following command to extract the gzip file and store it in the same directory:
gunzip pageviews-20171201-120000.gz
Run the following command to rename the file with a user-friendly name for this demo:
mv...