[SOLVED] 代写 R C algorithm math database graph statistic software network security MN3024 Data Mining

30 $

File Name: 代写_R_C_algorithm_math_database_graph_statistic_software_network_security_MN3024_Data_Mining.zip
File Size: 894.9 KB

SKU: 3004628177 Category: Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Or Upload Your Assignment Here:


MN3024 Data Mining

Academic Year 201920

Module Handbook

Module Coordinator:
Dr Karima Dyussekeneva
Office: Bay Campus, School of Management Building, Third Floor, Room 320
Office Hours: Wednesday: 4.305.30 pm.; Friday: 2.303.30 pm.
Email: k.dyussekenevaswansea.ac.uk

Teaching Staff:
Dr Karima Dyussekeneva
Office: Bay Campus, School of Management Building, Third Floor, Room 320
Office Hours: Wednesday: 4.305.30 pm.; Friday: 2.303.30 pm.
Email: k.dyussekenevaswansea.ac.uk

Teaching Staff:
Dr Fred Boy
Office: Bay Campus, School of Management Building, Third Floor, Room 342

Office Hours: Monday 12:3013:30 Friday 10:3011:30
Email: f.a.boyswansea.ac.uk

School of Management
MN3024 Data Mining

Module Overview

Introduction

The field of Data Mining is still relatively new and in a state of evolution. Data Mining stands at the confluence of the fields of statistics and machine learning also known as artificial intelligence. A variety of techniques for exploring data and building models have been around for a long time in the world of statistics: linear regression, discriminant analysis, and for example, principal component analysis. Computer science has brought machine learning techniques, such as trees and neural networks, that are less structured than classical statistical models and more computationally intensive. In addition, the growing field of database management is also part of the Data Mining structure.

Today Data Mining is used in a variety of fields and applications. Enterprises benefit from collecting and analysing its data, hospitals can spot trends and anomalies in their patent records, search engines can do better ranking and ad placement. The list continues, with cybersecurity and computer network intrusion detection, financial and business intelligence and many more.

This booklet contains:
an introduction to the module
lecture and seminar locations
details of the core textbooks via the reading list
information on assessment and feedback, including the coursework brief
an overview of the entire module

LectureSeminar Locations

Lectures will take place in SoM 239 on Friday morning 11.0013.00.

Seminars will take place in SoM 128 on Friday afternoon 13.0014.00.

Please note: Lectureseminar times and locations may change in the first two weeks of term.Please check Blackboard announcements and the timetable data displayed on the Intranet for regular updates.

Communication

Lecture notes will be posted on Blackboard along with announcements and any administrative notices.

Learning Outcomes

On completion of this module students should be able to:

Recognise and recall basic data mining concepts and different data mining techniques appropriate for analysing various business problems.

Decide on and select an appropriate data mining instrument to analyse a relevant business problem, and describe the main functions of the selected technique.

Solve hypothetical business problems by applying appropriate data mining techniques, and interpret the outputs in the data mining and business contexts.

Use specialist software such as SPSS and Weka for utilising data mining algorithms in solving tasks such as classification, association and prediction.

Analyse outputs obtained by different data mining techniques, and inspect relationships between changes in the algorithm parameters and those in the outputs. Compare the performances of various data mining techniques, and balance complexity with accuracy in view of relevant business problem.

Present and defend opinions in selecting appropriate data mining techniques for the stated business problem by evaluating the validity of the application outputs according to criteria such as accuracy, computational complexity and strengths and weaknesses of the instruments.

Reading Material

The full reading list for this module is available via Blackboard in the Reading List folder.

The core textbook for the module is one of either:

Data Mining Third Edition. Ian Witten. Elsevier.

Introduction to Data Mining. PangNing Tan, Michael Steinbach, Vipin Kumar . Pearson.

A core textbook is only a starting point and provides introductory and background information only. Supplemental reading will be identified at each lecture. To achieve high marks in this module students will need to do the background and supplemental reading as well as conduct their own independent research for instance through the reading of academic journals, into the topics identified.

Assessment

The assessment for the module is structured as follows:
2 x 50 individual coursework report project report on data mining exercise

Feedback to the coursework will be provided within three calendar weeks of submission. All feedback for the coursework assignment will be provided through GradeMark. Marks will be made available via Grade Centre in Blackboard and your university student portal.

School of Management
MN3024 Data Mining

Individual Coursework Assignment

Each coursework assignment for this module is an individual assignment worth 50 of the overall module mark.

Coursework Brief

Coursework 1

Regression models for data mining Individual project. In this project, you centre on applying logistic regression to build a classification model, based on the Riding mowers SPSS data. The Data will be provided on Blackboard.

Coursework 2

Classification and prediction models for data mining Individual project. In this project you centre on applying decision tree to build a classification model, based on the Diabetes WEKA data. The Data will be provided on Blackboard.

Key Marking criteria will include:

Initiative: originality, innovativeness of answer
Assignment Structure: clarity of aims, objective, structure and presentation
Quality of Writing: Readability and ability to convey key messages concisely
QualityScope of Literature Review: Understanding of established knowledge
Suitability of Literature: Use of suitable sources, focused to answer key research aims
Literature Analysis: Qualitylevel of analytical skill demonstrated
Insightfulness of Analysis: Interest and usefulness of findings, conclusions drawn.
Understanding: Assignment demonstrates students have understood key topics
Overall Quality of Assignment

Submission

Assignment one must be submitted by 3pm on Monday 11th. Of November via Turnitin.

Assignment two must be submitted by 3pm on Monday 9th. Of December via Turnitin.

Please note:

The maximum file size that can be uploaded is 20mb. If your file is larger than this it is usually because you have included a lot of imagesyou should either remove some if possible, or else convert them to a more efficient format to bring the file size down e.g. .png or .gif.

You should ensure your student number is in the title of the filename for the work you submitupload.

For undergraduate students, the School of Management operates a late penalty system as follows: when work is submitted late with no prior authorisation, a penalty of 10 marks will be deducted from the actual mark for each calendar day, or part of a day, by which the deadline is exceeded. After seven days from the initial deadline, a mark of zero will be awarded. Late work should be submitted electronically in the usual way.

If the penalty for late submission exceeds the mark awarded for the component then the component will receive a final mark of 0.

Digital Submission of Coursework Instructions

Logon to Blackboard.
Access the appropriate Module site.
Click the Assignment menu button which appears on the left of the screen.
In this folder you will see a file entitled Student Declaration form.You need to complete this form and incorporate it as the first page of your coursework not two separate files.

Click Coursework. Please read the statement of originality before you click submit. By submitting work you are agreeing to this statement and confirming it to be true.
Complete the dialogue box with your forename and surname
To submit your coursework, locate the correct file on your computer by clicking the browse button and enter a title for the coursework we suggest the module code and your student ID MNB108 123456. Click SUBMIT
You will then be asked to check if the document is the one you wish to submit and if so click YES, SUBMIT
You will then receive a message saying paper successfully complete.
BLACKBOARD will then send you a confirmation email of submission.Please keep this receipt safe as evidence of your submission.

If you experience any difficulties submitting your work via Turnitin please contact the Student Hub straight away at SoMAssessmentswansea.ac.uk

Notes on Style and Word Count

Assignments are a critical part of the learning experience and development for scholars at Swansea University.Practice will pay dividends when it comes to honing your skills in report and essay writing. Students are therefore encouraged to submit the highest quality work they can to reach their maximum potential. Students with concerns about how to present their work can consult with the Module Coordinator for guidance in addition to the notes listed below:

The maximum word limit for the main assignment excluding references, tables, contents page, footnotes, charts, graphs, figures is 2000 words. The word count must be stated in the assignment cover sheet.

Markers will stop marking once the word count or time limit limit has been reached, likely leading to a reduced overall mark as key arguments or conclusions will not be included in the marked work.

Students who submit work that is below the word limit will not be penalised. This is because students will not have taken full advantage of the word limit available to them, which in itself may constitute a penalty.

Video, Audio or other Assessment Types
For some assessments students may be required to submit a video, audio or other digital media item. The Universitys overarching privacy policy advises students that the University will collect photographs and video recordings for the purpose of recording lectures, student assessment and examinations. This processing and storage of this information is lawful as it is necessary for the performance of a contract with the student and will apply to any personal data that we process for the purposes of administering and delivering their course of study.
https:www.swansea.ac.ukmediaStudentDataProtectionStatement1819.pdf

Proof Reading
Please be aware of the universitys Proof Reading policy which sets outs what the university considers to be good academic practice in relation to proof reading. The School of Management allows proof reading but please be aware of the requirements around this including keeping an evidence trail relating to any proof reading and whether it is formal or informal. Further information can be found here.

School of Management
MN3024 Data Mining

Module Schedule

Week

Topic
Lecture Contents
Seminar Contents
Key Readings
1
wc
3009
ModuleIntroduction
Introduction to the course and overview of content.

Courseworks General Highlights
Ian H. Witten, et al. Practical Machine Learning Tools and Technique, Chapter 1

PangNing Tan et al , Introduction to Data Mining, Chapter 1

2
wc
710

Data Input: concepts, instances and attributes

This session will introduce styles of learning in data mining.
It will look at the types of data attributes and data examples, as well as data preprocessing.
Data preprocessing exercise descriptive analysis, data visualising, missing values, outliers.

Ian H. Witten, et al. Practical Machine Learning Tools and Technique, Chapter 2

PangNing Tan et al , Introduction to Data Mining, Chapter 2

3
wc
1410
Data Output: knowledge representation linear models.

This session will introduce linear models for data mining, such as linear and logistic regression.

Logistic regression exercise.

Ian H. Witten, et al. Practical Machine Learning Tools and Technique, Chapter 4

4
wc
2110

Validation and evaluating output

This session will introduce data mining validation and evaluating. Such methods as holdout estimation, crossvalidation, bootstrapping will be looked at. Numeric prediction evaluating, such as error measures and students test will be considered.
Numeric prediction evaluation exercise.
Ian H. Witten, et al. Practical Machine Learning Tools and Technique, Chapter 5

5
wc
2810

Simple algorithms: association rules
This session will look at rudimentary rules, covering algorithms and association rules.

Association rules exercise
Ian H. Witten, et al. Practical Machine Learning Tools and Technique, Chapter 3, 4, 11 11.7

PangNing Tan et al , Introduction to Data Mining, Chapter 6

6
wc
411

Decision trees

This session will look at decision trees algorithm for data mining.

Decision trees exercise
Ian H. Witten, et al. Practical Machine Learning Tools and Technique, Chapter 3, 4

PangNing Tan et al , Introduction to Data Mining, Chapter 4

7
wc
1111

Clustering

This session will look at cluster analysis: basic concepts and algorithms.
Cluster analysis exercise
Ian H. Witten, et al. Practical Machine Learning Tools and Technique, Chapter 4, 6

PangNing Tan et al , Introduction to Data Mining, Chapter 8

8
wc
1811

Advanced methods

This session will introduce advanced data mining techniques, such as support vector machine and neural networks.
SVM, neural networks exercise
Ian H. Witten, et al. Practical Machine Learning Tools and Technique, Chapter 6

9
wc
2511

Ensemble learning

This session will look at ensemble learning and combining multiple methods. Algorithms used: bagging, boosting, stacking.
Ensemble learning exercise
Ian H. Witten, et al. Practical Machine Learning Tools and Technique, Chapter 8

PangNing Tan et al , Introduction to Data Mining, Chapter 5

10
wc
212

Data transformations
This session will look at attribute selection for data mining, sampling and data calibrating.

Data transformations exercise
Ian H. Witten, et al. Practical Machine Learning Tools and Technique, Chapter 7

G E N E R I C M A R K I N G P R O G R A M M E

MarkClass Descriptor

Information and knowledge

Application

Analysis

Synthesis and context

Evaluation

80100
First
Outstanding

Contains all information required, with no errors. Evidence of study beyond the module content.

Answers question fully and completely. Excellent adaptation and application of concepts.No irrelevant material.

Ideas expressed logically and coherently. Excellent use of appropriate mathematicaldiagrammatic exposition.

Excellent integration of ideas and information. Demonstrates outstanding understanding of topic within a wider context.

Shows evidence of significant independent thinking and critical awareness, and
originality.

7079
First
Excellent

Contains all information required, with no major errors and no or very few minor errors.

Answers question fully and completely. Good adaptation and application of concepts. Little or no irrelevant material.

Ideas expressed logically and coherently. Effective use of appropriate mathematicaldiagrammatic exposition.

Effective integration of ideas and information. Demonstrates substantial understanding of topic within a wider context.

Shows evidence of sound independent thinking and critical awareness.

6069
Upper second
Very good

Contains all or almost all information required, with no major errors and only a few minor errors.

Answers questionfully. Some adaptation and application of concepts. Little or no irrelevant material.

Ideas generally expressed logically and coherently. Competent use of appropriate mathematicaldiagrammatic exposition.

Competent integration of ideas and information. Demonstrates some understanding of topic within a wider context.

Shows some evidence of independent thinking and critical awareness.

5059
Lower second
Good

Contains most information required, with no or very fewmajorerrorsand some minor errors.

Partially answers question. Limited adaptation and application of concepts. Some irrelevant material.

Ideas not always expressed logically and coherently. Adequateuseof appropriate mathematicaldiagrammatic exposition.

Limited integration of ideas and information. Demonstrates modest but incomplete understanding of topic and its context.

Shows little evidence of independent thinking and critical awareness.

4049
Third
Satisfactory

Contains basic core information required, with some major and minor errors.

Only answers some aspects of question.No adaptation or application of concepts. Some irrelevant material.

Ideas rarely expressed logically and coherently. Limited use of appropriate mathematical expressions.

Minimalintegrationofideas and information. Demonstrates limited understanding of topic and its

Shows very little or no evidence of independent thinking and critical awareness.

3039
Fail Potentially tolerable Poor

Contains only a limited amount of information required, with numerous major and minor errors.

Does not answer question. Noadaptation or application of concepts. Much irrelevant material.

Ideas rarely expressed logically and coherently. Littleornouseof appropriate mathematicaldiagrammatic exposition.

No integration of ideas and information.Demonstrates little understanding of topic and its context.

Shows no evidence of independent thinking or critical awareness.

029
Fail Not tolerable Very poor

Contains none or almost none of information required and with many major and minor errors.

Wholly fails to answer question.No adaptation or application of concepts. Largely irrelevant material.

Ideas expressed incoherently.No linking of ideas within text. Little or no use of appropriate mathematicaldiagrammatic exposition.

No integration of ideas and information.Demonstrates no understanding of topic and its context.

Shows no evidence of independent thinking or critical awareness.

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

Shopping Cart
[SOLVED] 代写 R C algorithm math database graph statistic software network security MN3024 Data Mining
30 $