[Solved] CS490-590 Project 2-Sequence Alignment

$25

File Name: CS490_590_Project_2_Sequence_Alignment.zip
File Size: 357.96 KB

SKU: [Solved] CS490-590 Project 2-Sequence Alignment Category: Tag:
5/5 - (1 vote)

Rate this product

Description: You are to implement the dynamic programming based sequence alignment algorithm, with the following three variations: global alignment, local alignment, and global alignment with affine gap penalties. For all three variations, your algorithm must take O(n2) time to construct the matrix (or the three-level matrix in the case of alignment with affine gap penalties) and linear time to extract the optimal alignment from the matrix constructed. The alignment problems considered for this project do not involve substitution matrices. And you only need to output one optimal alignment in case of ties.

I/O and Parameter settings: Your sequences should be in FASTA format, namely with a discarded header line followed by lines of sequence. Assume that your first input sequence is in such a formatted file named sequence1.txt and your second input sequence is similarly in sequence2.txt. Upon starting the program, you should prompt the user to enter what kind of alignment he/she wants to perform: global, local, or affine. (Affine is WLOG assumed to be global-affine.) Then, in the cases of global or local choices you need to prompt the user to enter the match score, mismatch score, and gap score. In the case of affine alignment, you need to prompt the user to enter the match score, mismatch score, gap start score, and gap extension score. For the gap scores, whatever value X the user enters (check that X is an integer), you will use -|X| in the computation. You may also check that the mismatch score is less than the match score.

Regarding the output and its format, you should output: the ratio of matched letters to the alignment length, the value of OPT, and one solution path (not all of them). It is most useful to the user if you output the solution alignment with nice formatting that places one string above the other, respectively aligning the gaps, matches, and substitutions, continuing with both strings to the next pair of lines when reaching a set number of characters per line (e.g. 60). This formatting is not critical, however, and at a bare minimum is also expressible by giving the H,V,D sequence involved in the path. You do not need to output the matrix, although it is actually quite useful for your own error checking.

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

Shopping Cart
[Solved] CS490-590 Project 2-Sequence Alignment[Solved] CS490-590 Project 2-Sequence Alignment
$25