Data Sets and Formats for the Assignment
There are several datasets available for this assignment. These have been installed as blob storage on Microsoft Azure. Further data is available on data.gov.uk should you wish more detail. The data sets are: All the files are csv format and may be compressed with gzip. Spark natively understands this compression