[Solved] SI 630 Homework 0 Regular Expressions

$25

File Name: SI_630_Homework_0__Regular_Expressions.zip
File Size: 357.96 KB

SKU: [Solved] SI 630 Homework 0 – Regular Expressions Category: Tag:
5/5 - (1 vote)

Youve been asked to perform a security audit for a large university. They want to know what kinds of email addresses might be recoverable from each web page. Conveniently, theyve already put together all of the web pages for you into a single file, where the HTML page for each page is on one line. Further, every page is guaranteed to have one email on it at most, since no one lists two email addresses for themselves on a page. However, not everyone lists their email on a page, so some pages have no email addresses! The big challenge is that there is no consistency in how the addresses are formatted!

Problem 1. Write a program that uses regular expressions to extract and canonicalize email addresses from web pages. Hint: regex groups may come in handy here. You will be provided with a large file of web pages on Canvas where each page is on a separate line. Your program will produce a new file the canonicalized email address found on each page or the word None if no email address was found. By canonicalized, we mean that if the author wrote myname at domain dot edu, you would output [email protected] in your file. Your output should have the same number of output lines as the input file.

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

Shopping Cart
[Solved] SI 630 Homework 0  Regular Expressions[Solved] SI 630 Homework 0 Regular Expressions
$25