[Solved] QBIO401 Assignment1-FASTA file

$25

File Name: QBIO401_Assignment1_FASTA_file.zip
File Size: 282.6 KB

SKU: [Solved] QBIO401 Assignment1-FASTA file Category: Tag:
5/5 - (1 vote)
  1. Write a Python function that takes as input a FASTA file and returns a sequence string [2pt].
  2. Write a Python function that takes as input a sequence string and returns a list with 4 entries that are the number of A, C, G, and T in the sequence [2pt].
  3. Write a Python function that takes two inputs: a sequence string and a string of two letters (e.g., CG or CT). This function returns the number of times the two letters occur consecutively in the sequence [2pt].
  4. Explore the NCBI website, go to the following two pages, and download the FASTA files for the human gene PTPN11 and its Drosophila orthologue csw.

https://www.ncbi.nlm.nih.gov/nuccore/NM_002834 https://www.ncbi.nlm.nih.gov/nuccore/NM_057783.3

For each of the two FASTA files, print the output of function #2 and function #3 with input CG. Compare the results and describe your finding [2pt].

  1. [Bonus] Write another Python function that takes as input a sequence string and returns a list with 16 entries that are the outputs of function #3 for all 16 possible two letter strings [Bonus 1 pt].

Turn in the code for the three (or four) Python functions and the answer for question #4 into one file in Jupyter Notebook format (.ipynb). Use the Turnitin link on Blackboard/Assignments/Assignment 1 to submit this file.

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

Shopping Cart
[Solved] QBIO401 Assignment1-FASTA file[Solved] QBIO401 Assignment1-FASTA file
$25