- Dataset
- Click the link View All Data Sets on URL http://archive.ics.uci.edu/ml. to get the data sets assigned:
No | Group Members | Dataset [Rows x Columns] | Presentation Slot |
1 | Cem Gle *Bura AkdenizKadir Hzarc | Anuran Calls (MFCCs) [7,19522] | 17.12.2020Thursday 12.00 12.17 |
2 | Mehmet Nusret Odaba *Abbas KutayOrhan Fatih Bayazt | Early stage diabetes risk prediction dataset. [52017] | 17.12.2020Thursday 12.20 12.37 |
3 | Emin Kaan Kadolu * Mert Meng | Turkiye Student Evaluation [5,82033] | 17.12.2020Thursday 12.40 12.57 |
4 | Ayberk mer Altuntabak *Abdulhalik ensinAmela Karmaj | Phishing Websites [2,45630] | 17.12.2020Thursday 13.00 13.17 |
5 | Diala Jassem M.B.J. *Mnevver Sueda KocatrkNurhande Akyz | Census Income [48,84214] | 17.12.2020Thursday 13.20 13.37 |
6 | Osman Mantc *Buse BatmanFatmanur zdemir | MAGIC Gamma Telescope [19,02011] | 17.12.2020Thursday 13.40 13.57 |
7 | lker Fener *Doukan DenizHalil brahim imek | Letter Recognition [20,00016] | 22.12.2020Tuesday 12.00 12.17 |
8 | Zahide Gr Tatan *Merve AyerZeynep Naz Akyoku | EEG Eye State [14,98015] | 22.12.2020Tuesday 12.20 12.37 |
9 | Halid Seyfullah Sert *Dilek DndarMert lik | South German Credit (UPDATE) [1,00021] | 22.12.2020Tuesday 12.40 12.57 |
10 | Ayenur Ylmaz *Belgin TatanKevser lde | Iranian Churn Dataset [3,15013] | 22.12.2020Tuesday 13.00 13.17 |
11 | Sedanur Kara *Berke ahinSinem Onal | Australian Sign Language signs [6,65015] | 24.12.2020Thursday 12.00 12.17 |
12 | Deniz Arda Grhizin *Can Berk DurmuTarkan Batar | Firm-Teacher_Clave-Direction_Classification [10,80020] | 24.12.2020Thursday 12.20 12.37 |
13 | Furkan Akman *Burak FidanMustafa Serta ztrk | Image Segmentation [2,31019] | 24.12.2020Thursday 12.40 12.57 |
14 | Ferihan abuk *Ali Berat etinMuhammed sa Akbaba | Page Blocks Classification [5,47310] | 24.12.2020Thursday 13.00 13.17 |
15 | Ahmet Enes Gndz *Hakan YalnMuhammed Fethullah Erolu | Shill Bidding Dataset [6,32113] | 24.12.2020Thursday 13.20 13.37 |
16 | Yasin Orhan *Abdullah YazarAhmet Hakan Eki | Pen-Based Recognition of Handwritten Digits [10,99216] | 24.12.2020Thursday 13.40 13.57 |
- The first students indicated by * sign are the group representatives.
- Learn / Get information about your data.
- Python Platform & Environment
- Get a platform/environment for python work on, if you do not have any. Install it on your computer.
- You may use any libraries you want; however, you should have complete understanding to use and explain it in demo sessions.
- Implement your work with your own code as possible as you can.
- Model Construction: Classification
- Do the data preprocessing steps, if required.
- Training & Test
- Decide how you will partition your data into training and test sets.
- Use holdout method for each of your classifiers separately. iii) In addition to holdout method, use cross-validation for at least one of your classifiers. iv) In addition to above, implement bagging ensemble method for your classifiers.
- v) In addition to above, implement boosting ensemble method for your classifiers
- Make the required type node settings.
- Use your dataset to construct 6 classification models as follows:
- Decision tree using gain ratio. ii) Decision tree using gini index. iii) Nave Bayes. iv) Artificial neural networks with 1 hidden layer.
- v) Artificial neural networks with 2 hidden layers. vi) Support vector machines.
- Implementation & Model Evaluation
- Implement 6 algorithms above on your dataset using python.
- Compare the performance and the results of 6 classifiers on your test dataset.
- Compare the performances of your classifiers with performances of the relevant papers given on the site.
- Presentation
- You are going to present your work done online in 12 minutes at the time slot reserved for your group. Group members should equally participate the presentation. See the table above.
- Prepare a presentation file discussing the details of your work done and results of the classifiers.
- Your presentation should contain the following parts at least:
- Problem definition ii) Dataset
- Information about the dataset.
- Number of instances, columns, etc.
- Problem definition ii) Dataset
iii) Data preprocessing, cleaning
- Missing values, and how you conduct on these.
- Transformations and normalizations.
- Training and test dataset. iv) Python implementation for each of the 6 classifiers
- IDE/environment used.
- Implementation details.
- Libraries used.
- v) Model evaluation & performance results
- Confusion matrices.
- Values of accuracy, recall, precision etc.
- Comparison of all 6 classifiers.
- vi) Conclusion
- Demo with Presentation
- You are going to demonstrate your work done online in 5 minutes after your presentation. See the table above.
- You are going to have 17 minutes in total for your groups session (12 minutes for presentation, and 5 minutes for demonstration).
- Please keep in mind that all the presentation and demo sessions will be recorded.
- All the students are expected to attend all sessions.
- Related Questions & Answers
- Prepare 5 questions and answers related to your topic. These questions may be asked to other students.
- Question types can be multiple choice (single or multiple selection), fill in the blanks, matching, essay, etc.
- Prepare a presentation file with 11 slides consisting these 5 questions and answers. First slide will be used for your topic and group members info. Use 1 slide per each question, and 1 slide per each answer.
- Evaluation
- Your grade related to project #1 will cover 10% of your total grade at least; may increase subject to coronavirus issues.
- Evaluation will be done out of 100 points:
- [4 pts] Data set understanding. ii) [4 pts] Data preprocessing.
iii) [4 pts] Training & test set partitioning. iv) [4 pts] Models construction: Classification. v) [36 pts] Implementation.
- vi) [10 pts] Model evaluation, test and results, comparison. vii) [20 pts] Presentation quality. viii) [8 pts] Demo quality. ix) [10 pts] Questions & answers quality.
- Submission
- You are going to submit the followings:
- Python codes implemented.
- Presentation file.
- You are going to submit the followings:
- Questions & answers presentation file.
- Write the following sentence in a text file: We hereby swear that the work done on this project is totally our own; and on our honor, we have neither given nor received any unauthorized and/or inappropriate assistance for this project. We understand that by the school code, violation of these principles will lead to a zero grade and is subject to harsh discipline issues. Rename it as we_swear.txt and include this file in the zip submission file.
- Only one of the group members (i.e. group representative, in short GrRep) is going to submit the project using GrReps info all the time. However, all group members should have a complete and comprehensive understanding of all the work done for all tasks and steps of the project.
- Zip all your documents into a single file using filename GrRepStudentNumber_P1.zip (e.g. 150118123_P1.zip) and submit it to the site http://ues.marmara.edu.tr before deadline.
- In case of any form of copying and cheating on solutions, all parts will get ZERO points. You should submit your own work. In case of any forms of cheating or copying, both giver and receiver are equally culpable and suffer equal penalties. All types of plagiarism will result in zero points from the homework.
- If case of using your handwriting, your handwriting should be readable, clear and neat. If possible, do not use any handwriting.
- Do not send project submissions through e-mail. E-mail attachments will not be accepted as valid submissions.
- You are responsible for making sure you are turning in the right file, and that it is not corrupted in anyway. We will not allow resubmissions if you turn in the wrong file, even if you can prove that you have not modified the file after the deadline.
- Grade evaluation may be done on selected parts of the project, so try to complete all parts of your project successfully.
- No late submissions will be accepted.
Reviews
There are no reviews yet.