Dear Edinburgh and Malaysia campus students,
In addition to the previous instructions, more information is provided below for the actual ML assessment. Note that the assessment is open notes and open book; you may make use of the lecture slides, exercises, their solution and associated R codes.
There are 10 multiple choice questions in this assessment worth 10 marks in total. Answer all questions. There is no negative marking.
There are 3 parts in this assessment and each part refers to a different dataset or datasets.
It is a good idea to download ALL the required datasets into a folder where your R codes can access them BEFORE you begin the test.
Part 1
Scenario:
A marketing company maintains a large database of customers consisting of working adults in a city. The company is interested to find out if the level of income of these adults can be used to determine the extent of online purchases made by them. For this purpose, they provide you with a dataset consisting of 100 customers with information on two variables: x = monthly income, and y = average amount spent on online purchases (per week). The data is available here: ObsDataset_for_Part1.csv.
Your task is to analyze this dataset and answer the following questions: Questions 1, 2 and 3 which use ObsDataset_for_Part1.csv, and
Questions 4 and 5 which use ObsDataset_for_Part1.csv and an
Your Task:
additional dataset TestValidDataset_for_Part1.csv.
Part 2
Scenario:
Obtaining high standardized scores on a competitive test is believed to influence the admission into College X. Is this belief really justified? You are provided with a dataset consisting of standardized scores of 100 students who took the competitive test and their admission status into College X. The dataset is available here: ObsDataset_for_Part2.csv where the columns with names x and y denote, respectively, the score obtained and the admission status. The admission status of 1 indicates that the student was successfully admitted, and 0 indicates otherwise.
Your task is to analyze this dataset and answer the following questions:
Your Task:
Questions 6, 7 and 8 which use ObsDataset_for_Part2.csv
Part 3
Scenario:
You are provided with a dataset which contains measurements on three variables made on a group of patients suspected of diabetes. The dataset is provided here: ObsDataset_for_Part3.csv
Your task is to analyze this dataset and answer the following questions: Questions 9 and 10 which use to ObsDataset_for_Part3.csv.
Best wishes, F70TS Instructors
Your Task:
Reviews
There are no reviews yet.