Name: [Solved] SI 630 Homework 0 Regular Expressions
Brand: Assignment Chef
SKU: [Solved] SI 630 Homework 0 – Regular Expressions
Price: 25 USD
Availability: InStock
Rating: 5 (1 reviews)

5/5 - (1 vote)

Youve been asked to perform a security audit for a large university. They want to know what kinds of email addresses might be recoverable from each web page. Conveniently, theyve already put together all of the web pages for you into a single file, where the HTML page for each page is on one line. Further, every page is guaranteed to have one email on it at most, since no one lists two email addresses for themselves on a page. However, not everyone lists their email on a page, so some pages have no email addresses! The big challenge is that there is no consistency in how the addresses are formatted!

Problem 1. Write a program that uses regular expressions to extract and canonicalize email addresses from web pages. Hint: regex groups may come in handy here. You will be provided with a large file of web pages on Canvas where each page is on a separate line. Your program will produce a new file the canonicalized email address found on each page or the word None if no email address was found. By canonicalized, we mean that if the author wrote myname at domain dot edu, you would output [email protected] in your file. Your output should have the same number of output lines as the input file.

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

Whatsapp Us

[Solved] SI 630 Homework 0 Regular Expressions

Reviews

Related products

[Solved] SI 630 Homework 2 Word Embeddings

[Solved] SI 630 Homework 4 Latent Dirichlet Allocation

[Solved] SI 630 Homework 1 Classification

[Solved] SI 630 Homework 3 Dependency Parsing

[Solved] SI 630 Homework 5 Generation and Detection via Transformers