- Dataset:
The attached dataset is about PM2.5.
Training Data: Two alternate years can be taken as train data.
Testing Data: Two years of data from the remaining three can be taken as test data.
- Problem Statement:
You are supposed to perform two tasks for this dataset: Classification and Regression
- Classification Task: Target Column: Month
Evaluation Metric: Accuracy
- Regression Task: Target Column: PM2.5
Evaluation Metrics: MSE
Also, report the mean and standard deviation of the error.
Implement the above problem statement(both for Classification and Regression) from scratch using the following:
- Decision Trees (DT) ( You have to analyze yourself as told in class for different depths, width and other parameters of the tree and draw your inferences.)
- Bagged Decision Trees
- Random Forest
Implement these as taught in the class.
- Gaussian Processes:
In the data provided to you, you will find signal strength in dB vs distance. Assume the Distance to be an independent variable and Signal Strength as a target. Compute the mean and variance prediction for signal strength at the following 5 points {Sr. No.: <2,4,6,8,10>}. Use GPR to train using the remaining data points {Sr. No.: <1,3,5,7,9,11,12>} from the table provided.
| Sr. No. | Distance | Signal Strength(DBM) |
| 1 | 0 | -45 |
| 2 | 1 | -51 |
| 3 | 2 | -58 |
| 4 | 3 | -63 |
| 5 | 4 | -36 |
| 6 | 5 | -52 |
| 7 | 6 | -59 |
| 8 | 7 | -62 |
| 9 | 8 | -36 |
| 10 | 9 | -43 |
| 11 | 10 | -55 |
| 12 | 11 | -64 |

![[Solved] SML Assignment 5-Decision Trees Bagging Random Forest](https://assignmentchef.com/wp-content/uploads/2022/08/downloadzip.jpg)

![[Solved] CS 444 Project 4](https://assignmentchef.com/wp-content/uploads/2022/08/downloadzip-1200x1200.jpg)
Reviews
There are no reviews yet.