[SOLVED] python crawler

$25

File Name: python_crawler.zip
File Size: 131.88 KB

5/5 - (1 vote)

Similar code provided(I attached sample code for mining the site IMDB, which can extract information of each movie). Please finish this Project FOLLOW THAT CODE( USE THE SAME ALGORITHEM)

Run sample code first, you will understand these are similar~~~ <3

Python code instruction:

What do we expect?

From the website provided, record each product, and specific information of it:

CPU total number 13

Video Card total number 12

Hard Drive total number 79

Power Supply total number 12

Memory total number 86

SSD total number 55

* CPUs/Processors

1, go to website

http://www.newegg.ca/Product/ProductList.aspx?Submit=StoreIM&Depa=1&Category=34

2,Extract the nameof the CPUs, and Priceas the first two columns of CSV.file

3,Click in CPUs, under the picture(Next to Overview), select Specifications ,then extract following information:

For Model part

  • Brand
  • Series

For Tech Spec

  • # of Cores
  • Name
  • Operating Frequency
  • L2 Cache
  • L3 Cache

Put all above in CSV column 3-9

4, Next to Specification, click the Reviewpart, extract listed 10 reviews of each product. Write in CSV column 10

CSV file-Example (There is only one here, Please do it for all 13 CPUs):

CPUs Price Brand Series # of Cores Name Operating Frequenct L2 Cache L3 Cache 10 reviews

Intel Core i7-4790K Devils Canyon Quad-Core 4.0GHz LGA 1150 Desktop Processor Intel HD Graphics 4600 BX80646I74790K

409.99 Intel Core i7 Quad-Core Core i7-4790K 4.0GHz 4 x 256KB 8MB Pros: Fast,

Cons: None.

Other Thoughts: 4690k is just as good at gaming

Pros:

Cons:

Other Thoughts:

Pros:

Cons:

Other Thoughts:

**Video Cards/Video Devices

1, go to website

http://www.newegg.ca/Product/ProductList.aspx?Submit=StoreIM&Depa=1&Category=38

2,Extract the nameof the Video Card, and the Priceas the first two columns of CSV.file

3,Click in the Video Card, under the picture(Next to Overview), select Specifications,then extract following information:

For Model part

  • Brand
  • Model

For Chipset

  • Chipset Manufacturer
  • GPU
  • Core Clock
  • Boost Clock

For Memory

  • Memory Type
  • Memory Size

For General

  • Slot Width

Put all above in CSV column 3-11

4, Next to Specification, click the Reviewpart, extract listed 10 reviews of each product. Write in CSV column 12

CSV file-Example (There is only one here, Please do it for all 12 Video Cards):

Video Card Price Brand Model Chipset Manufacturer GPU Core Clock Boost Clock Memory Type Memory Size Slot Width 10 reviews
EVGA 02G-P4-2966-KR GeForce GTX 960 Gaming 2GB 128-Bit GDDR5 PCI Express 3.0 HDCP Ready SLI Support SuperSC ACX 2.0+ Video Card 264.99 EVGA 02G-P4-2966-KR NVIDIA GeForce GTX 960 1279 MHz 1342 MHz GDDR5 2GB Dual Slot Pros: Quiet
Cool
Powerful
No longer than a standard GPU.
Low energy for the performance.
Great value.

Cons: None.

***Hard Drives

1, go to website

http://www.newegg.ca/Product/ProductList.aspx?Submit=StoreIM&Depa=1&Category=15

2,Extract the nameof the Hard Drive, and Priceas the first two columns of CSV.file

3,Click in the Hard Drive, under the picture(Next to Overview), select Specifications,then extract following information:

For Model part

  • Brand
  • Series
  • Model

For Performance

  • Capacity
  • RPM

Put all above in CSV column 3-7

4, Next to Specification, click the Reviewpart, extract listed 10 reviews of each product. Write in CSV column 8

CSV file-Example (Please do it for all 79 Video Cards):

Hard Drives Price Brand Series Model Capacity RPM 10

Reviews

****Power Supplies

1, go to website

http://www.newegg.ca/Product/ProductList.aspx?Submit=StoreIM&Depa=1&Category=32

2,Extract the nameof the Power supply, and Priceas the first two columns of CSV.file

3,Click in the Power supply, under the picture(Next to Overview), select Specifications,then extract following information:

For Model part

  • Brand
  • Model

For Spec

  • Maximum power
  • Fans
  • PFC

Put all above in CSV column 3-7

4, Next to Specification, click the Reviewpart, extract listed 10 reviews of each product. Write in CSV column 8

CSV file-Example (Please do it for all 12 Power Supplies):

Power Supplies Price Brand Model Maximum power Fans PFC 10 reviews

*****Memory

1, go to website

http://www.newegg.ca/Product/ProductList.aspx?Submit=StoreIM&Depa=1&Category=17

2,Extract the nameof the Memory, and Priceas the first two columns of CSV.file

3,Click in the Memory, under the picture(Next to Overview), select Specifications,then extract following information:

For Model part

  • Brand
  • Series
  • Model

For Tech Spec

  • Capacity
  • Speed

Put all above in CSV column 3-7

4, Next to Specification, click the Review part, extract listed 10 reviews of each product. Write in CSV column 8

CSV file-Example (There is only one here, Please do it for all 86 Memory s):

Memory Price Brand Series Model Capacity Speed 10 reviews

******SSDs

1, go to website

http://www.newegg.ca/Product/ProductList.aspx?Submit=StoreIM&Depa=1&Category=119

2,Extract the nameof the SSD, and Priceas the first two columns of CSV.file

3,Click in the SSD, under the picture(Next to Overview), select Specifications,then extract following information:

For Model part

  • Brand
  • Series
  • Model

For Details

  • Capacity

For Performance

  • Max Sequential Read
  • Max Sequential write
  • MTTF

Put all above in CSV column 3-9

4, Next to Specification, click the Review part, extract listed 10 reviews of each product. Write in CSV column 10

CSV file-Example (There is only one here, Please do it for all 55 SSDs):

SSDs Price Brand Series Model Capacity Max Sequential Read Max Sequential Write MTTF 10 reviews

Again~

Sample code provided(I attached sample code for the site IMDB, which can extract information of each movie). Please finish this Project FOLLOW SAMPLE CODE

Run sample code first, you will understand these are similar~~~ <3

Finally , I need the final version of PYTHON CODE WITH 6 CSV.files

for :

CPU total number 13 (10 Col * 14 Rows)

Video Card total number 12 (12 Col * 13 Rows)

Hard Drive total number 79 (8 Col * 80 Rows)

Power Supply total number 12 (8 Col * 13 Rows)

Memory total number 86 (8 Col * 87 Rows)

SSD total number 55 (10 Col *56 Rows)

Ps: This project is really important for me, if you cant do it or finish it on time, please inform me as soon as possible!!!

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

Shopping Cart
[SOLVED] python crawler
$25