Problem 1
Homework 1 considers a Covid-19 dataset which is published on Github. Please download any document type that you prefer of the dataset from the links which are shown in Figure 1. The dataset is updated daily and
Figure 1: The complete dataset links
includes data on confirmed cases, deaths, hospitalizations, testing, and vaccinations as well as other variables of potential interest. The data set has the following basic columns:
- iso code: Short name of the country
- continent: The continent where the country exists
- location: The country name
- date: The date when the data about various variables are taken.
You are responsible to implement a program which reads the given dataset from the file and computes the data for the following questions. Any programming language that you prefer will be accepted. Putting comments on your functions that you implement is must. Each question must be appended to a file which is called output{.csv, .txt}. The file contains the first 18 questions listed below. The 18th question will be written in this document.
- How many countries the dataset has?
- When is the earliest date data are taken for a country? Which country is it?
1
Homework #1 2
- How many cases are confirmed for each country so far? Print pairwise results of country and total cases. 4. How many deaths are confirmed for each country so far? Print pairwise results of country and total deaths. 5. What are the average, minimum, maximum and variation values of the reproduction rates for each country? Table 1: The format of the output for the questions 5, 6, 7, 8, 9, 10, 12, 13.
Country | minimum | maximum | average | variation |
value | value | value | value | value |
- What are the average, minimum, maximum and variation values of the icu patients (intensive care unit patients) for each country?
- What are the average, minimum, maximum and variation values of the hosp patients (hospital patients) for each country?
- What are the average, minimum, maximum and variation values of the weekly icu (intensive care unit) admissions for each country?
- What are the average, minimum, maximum and variation values of the weekly hospital admissions for each country?
- What are the average, minimum, maximum and variation values of new tests per day for each country?
- How many tests are conducted in total for each country so far?
- What are the average, minimum, maximum and variation values of the positive rates of the tests for each country?
- What are the average, minimum, maximum and variation values of the tests per case for each country?
- How many people are vaccinated by at least one dose in each country?
- How many people are vaccinated fully in each country?
- How many vaccinations are administered in each country so far?
- List information about population, median age, # of people aged 65 older, # of people aged 70 older, economic performance, death rates due to heart disease, diabetes prevalence, # of female smokers, # of male smokers, handwashing facilities, hospital beds per thousand people, life expectancy and human development index.
Table 2: The format of the output for the question 17
Country | population | median age | # of people aged 65 older |
value | value | value | value |
- Summarize all the results that you obtain by the first 17 questions (except question 2).
Table 3: The format of the output for the question 18
Country | q#3 | q#4 | q#5 min | q#5 max | q#5 avg | q#5 var |
value | value | value | value | value | value | value |
- Comment the results based on your observations. Write your opinions about the reasons of increasing infection rates by giving examples from the results. Feel free to explain any situation that you observe.
More observations more opportunities will bring you for the second homework.
(Solution) (Write your observations here.)
Reviews
There are no reviews yet.