Name: [Solved] BIT project3-clustering, dimensionality reduction, and non-monotonous neurons solution(s)
Brand: Assignment Chef
SKU: [Solved] BIT project3-clustering, dimensionality reduction, and non-monotonous neurons solution(s)
Price: 25 USD
Availability: InStock
Rating: 5 (1 reviews)

5/5 - (1 vote)

task 3.1: fun with k-means clustering: Download the 2D data in the file data-clustering-1.csv and plot it; the result should look something

like this

Next, implement

Lloyds algorithm for k-means clustering (e.g. simply using scipy)
Hartigans algorithm for k-means clustering
MacQueens algorithm for k-means clustering

For k = 3, run each algorithm on the above data and plot your results. In fact, run each of them several times and look at the results. What do you observe? Are the results always the same or do they vary from run to run?

Measure the run times of each of your implementations (run them each at least 10 times and determine their average run times). What do you observe?

task 3.2: spectral clustering: Download data-clustering-2.csv and plot the 2D data in this file; the result should look something like this

Set k = 2 and apply the k-means algorithms you implemented in the previous task to this data. What do you observe?

Next, implement spectral clustering. Proceed as follows: Let

be the given data. First, compute an n n similarity matrix S where

S_ij= _ekxixjk²

and then compute the Laplacian matrix L = D S where the diagonal matrix D is given by

otherwise

Note that row i in L can be understood as a feature vector f(x_i) for data point x_i. Next, compute the eigenvalues _iand eigenvectors u_iof L and sort them in descending order. That is, let _ndenote the largest eigenvalue and u_ndenote the corresponding eigenvector.

The eigenvector u₂that corresponds to the second smallest eigenvalue ₂is called the Fiedler vector and is of significance in clustering. You will find that some of its entries are greater than 0 and some are less than 0. To cluster the given data into two clusters C₁and C₂, do the following: If entry i of u₂is greater than 0 assign x_ito cluster C₁, it it is less than zero, assign x_ito cluster C₂.

Set to some value (say = 1), cluster the data as described, and plot your results. What do you observe?

task 3.3: dimensionality reduction: The file data-dimred-X.csv contains a 500 150 data matrix X, that is, 150 data vectors x_i R⁵⁰⁰.

In fact, these data vectors are from three classes and you can find the corresponding class labels y_i {1,2,3} in the file data-dimred-y.csv.

The goal of this task is to explore mappings R⁵⁰⁰ R²that allow us to visualize (plot) high-dimensional data.

First of all, perform dimensionality reduction using PCA. That is, normalize the data in X to zero mean and compute the eigen-decomposition of the corresponding covariance matrix. Then, use the two eigenvectors u₁and u₂of the two largest eigenvalues to project the (normalized!) data into R². What do you observe?

Second of all, perform dimensionality reduction usingmulticlass LDA (as discussed in lecture 14). To this end, make use of the the fact that the data in X are from three classes. Compute the within class scatter matrix S_Wand the between class scatter matrix S_Band then the eigen-decomposition of the matrix S_W¹S_B. Again, use the two eigenvectors u₁and u₂of the two largest eigenvalues to project the data into R². What do you observe? Does the result differ from the one you obtained via PCA?

What if you project the data from R⁵⁰⁰to R³? For both approaches, this can be accomplished using the first three eigenvectors and creating 3D plots. task 3.4: non-monotonous neurons: The two files xor-X.csv and xor-y.csv contain data points x_i R²and label values which when plotted appropriately should lead to a picture like this

Note that XOR problems like this pose nonlinear classification problems, because there is no single hyperplane that would separate the blue from the orange dots. XOR problems are therefore famously used to prove the limitations of a single perceptron

where f is a monotonous activation function such as

However, this limitation is a historical artifact, because monotonous activation functions are a persistent meme that arose in the 1940s. Yet, there is nothing that would prevent us from considering more flexible neural networks composed of neurons with non-monotonous activations.

Mandatory sub-task: Given the above data, train a non-monotonous neuron

where

In order to do so, perform gradient descend over the loss function

That is, randomly initialize w₀ R²and ₀ R and then iteratively compute the updates

E wt+1 = wt w

Note: Good choices for the step sizes are = 0.001 and _w= 0.005, but you are encourages to experiment with these parameters and see how they influence the behavior and outcome of the training procedure.

If all goes well, and say you also implement a function that can visualize a classifier, you should observe something like this

result obtained from w₀,₀result obtained from w₅₀,₅₀

Voluntary sub-task: If you want to impress your professor, then also train a kernel SVM on the above data using a polynomial kernel

task 3.5: exploring numerical instabilities: This task revisits task 2.1 and is not such much a task in itself but an eye opener! The goal is to raise awareness for the fact that doing math on digital computers may lead to unreliable results! Everybody, i.e. every member of each team, must do it!

Download the file whData.dat, remove the outliers and collect the remaining height and weight data in two numpy arrays hgt and wgt and fit a 10-th order polynomial. Use the following code:

import numpy as np import numpy.linalg as la import numpy.polynomial.polynomial as poly import matplotlib.pyplot as plt

hgt = wgt =

xmin = hgt.min()-15 xmax = hgt.max()+15 ymin = wgt.min()-15 ymax = wgt.max()+15

def plot_data_and_fit(h, w, x, y): plt.plot(h, w, ko, x, y, r-) plt.xlim(xmin,xmax) plt.ylim(ymin,ymax) plt.show()

def trsf(x): return x / 100.

n = 10

x = np.linspace(xmin, xmax, 100)

# method 1:

# regression using ployfit c = poly.polyfit(hgt, wgt, n) y = poly.polyval(x, c) plot_data_and_fit(hgt, wgt, x, y)

# method 2:

# regression using the Vandermonde matrix and pinv

X = poly.polyvander(hgt, n)

c = np.dot(la.pinv(X), wgt) y = np.dot(poly.polyvander(x,n), c) plot_data_and_fit(hgt, wgt, x, y)

# method 3:

# regression using the Vandermonde matrix and lstsq X = poly.polyvander(hgt, n) c = la.lstsq(X, wgt)[0] y = np.dot(poly.polyvander(x,n), c) plot_data_and_fit(hgt, wgt, x, y)

# method 4:

# regression on transformed data using the Vandermonde

# matrix and either pinv or lstsq X = poly.polyvander(trsf(hgt), n) c = np.dot(la.pinv(X), wgt) y = np.dot(poly.polyvander(trsf(x),n), c) plot_data_and_fit(hgt, wgt, x, y)

What is going on here? Report what you observe! Think about what this implies! What if you were working in aerospace engineering where sloppy code and careless and untested implementations could have catastrophic results . . .

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

Whatsapp Us

[Solved] BIT project3-clustering, dimensionality reduction, and non-monotonous neurons solution(s)

Reviews

Related products

[Solved] BIT project2-least squares regression and nearest neighbor classifiers

[Solved] BIT project1-Warm up