[Solved] IR-Assignment #1

$25

File Name: IR_Assignment__1.zip
File Size: 150.72 KB

SKU: [Solved] IR-Assignment #1 Category: Tag:
5/5 - (1 vote)

Information Retrieval in High Dimensional Data

Please hand in your solutions via Moodle as an IPYTHON (Jupyter) notebook.

Solutions can be handed in by groups of four to five people. Please state the names of your group members at a prominent place in your submission. (For example, at the beginning of your provided notebook or in a separate text file.)

Curse of Dimensionality

Task 1: [2 Points] Let denote the p-dimensional hypercube of edge length d, centered at the origin.

  • Assume X to be uniformly distributed in C1. Determine d in dependence of p and q [0,1], such that

Pr(X Cd) = q

holds.

  • Let the components of the p-dimensional random variable Xp be independent and have the standard normal distribution. It is known that Pr(|X1| 2.576) = 0.99. For an arbitrary p, determine the probability Pr(kXpk > 2.576) for any of the components of Xp to lie outside of the interval [2.576,2.576]. Evaluate the value for p = 2, p = 3 and p = 500.

Task 2: [10 Points] Provide the PYTHON code to the following tasks (the code needs to be commented properly):

  • Sample 100 uniformly distributed random vectors from the box [1,1]d for d = 2.
  • For each of the 100 vectors determine the minimum angle to all other vectors. Then compute the average of these minimum angles. Note that for two vectors x, y the cosine of the angle between the two vectors is defined as

.

  • Repeat the above for dimensions d = 1,,1000 and use the results to plot the average minimum angle against the dimension.
  • Give an interpretation of the result. What conclusions can you draw for 2 randomly sampled vectors in a d-dimensional space?
  • Does the result change if the sample size increases?

Statistical Decision Making

Task 3: [10 Points] Answer the following questions. All answers must be justified.

  • The numbers in Figure 1 show the probability of the respective event to happen (e.g. the probability for the event X = 1 and Y = 1 is 0.02). Is this table a probability table? If so, why?
  • Based on Figure 1 give the conditional expectation EY |X=2[Y ] and the probability of the event X = 1 under the condition that Y = 3.
  • Is the function p(x,y) given by

(

1 for p(x,y) =

0 otherwise

a joint density function for two random variables?

  • For two random variables X and Y the joint density function is given by

(2e(x+y) for 0 x y, 0 y

p(x,y) =

0 otherwise.

What are the marginal density functions for X and Y respectively?

  • Let the joint density function of two random variables X and Y be given by

for 0 < x < 3, 0 < y < 1

p x,y

0 otherwise.

Determine the probability for X 2 under the condition that.

Task 4: [3 Points] Show that the covariance matrix C of any random variable X Rp is symmetric positive semidefinite, i.e. C = C> and x>Cx 0 for any covariance matrix C Rpp and any x Rp.

Figure 1: Task 3

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

Shopping Cart
[Solved] IR-Assignment #1[Solved] IR-Assignment #1
$25