[Solved] CS60050 Assignment2

$25

File Name: CS60050_Assignment2.zip
File Size: 178.98 KB

SKU: [Solved] CS60050 Assignment2 Category: Tag:
5/5 - (1 vote)

The following has to be done using Bayesian learning (Nave Bayes classifier):

  • Randomly divide the data into 80% for training and 20% for testing. Apply the following:
    1. Handle the missing values in both train and test set. [5]
    2. Encode categorical variables using appropriate encoding method (in-built function allowed). [5]
    3. After completing step (a) and (b), compute 5-fold cross validation on the training set

(normalisation of data is allowed, if required). Print the final test accuracy. [10]

  • Apply PCA (select number of components by preserving 95% of total variance) on the processed data from step (1).
    1. Plot the graph for PCA (in-built function allowed for PCA and visualisation). [20]
    2. Use the features extracted from PCA to train your model. Compute 5-fold cross validation on the training set (normalisation of data is allowed, if required). Print the final test accuracy. [10]
  • Using the processed data from step (1), apply the following:
    1. A feature value is considered as an outlier if its value is greater than mean + 3 x standard deviation. A sample having maximum such outlier features must be dropped. [5]
    2. Using the sequential backward selection method, remove features. [15]
    3. Print the final set of features formed. [5]
    4. Compute 5-fold cross validation on the training set (normalisation of data is allowed if required). Print the final test accuracy. [5] 4) Report and results. [20]

Dataset Description:

Use Train_C.csv as data for this assignment. The Response column will be used as labels.

Shopping Cart

No products in the cart.

No products in the cart.

[Solved] CS60050 Assignment2[Solved] CS60050 Assignment2
$25