2022/23 Undergraduate Module Catalogue
XJCO2121 Data Mining
10 creditsClass Size: 100
Module manager: Professor Eric Atwell
Email: e.s.atwell@leeds.ac.uk
Taught: Semester 2 (Jan to Jun) View Timetable
Year running 2022/23
Pre-requisites
XJCO1121 | Databases |
This module is not approved as a discovery module
Module summary
This module explores the data mining process and its application in different domains such as text and web mining. You will learn the principles of data mining; compare a range of different techniques, algorithms and tools and learn how to evaluate their performance.Objectives
On completion of this module, students should be able to- understand the data mining process and its application in different domains such as text and web mining;
- understand the principles of data mining;
- compare a range of different techniques, algorithms and tools and evaluate their performance.
-demonstrate familiarity with some of the main application areas;
-demonstrate familiarity with data mining and text analytics tools.
Learning outcomes
On completion of this module, students should be able to
- understand the data mining process and its application in different domains such as text and web mining;
- understand the principles of data mining;
- compare a range of different techniques, algorithms and tools and evaluate their performance.
-demonstrate familiarity with some of the main application areas;
-demonstrate familiarity with data mining and text analytics tools.
Syllabus
Introduction to data mining terminology and components of the data mining process, text analytics, and SketchEngine; tools and techniques for data collection and data cleansing, use of machine learning classifiers for data classification, open-source and commercial data mining and text analytics resources and toolkits, CRISP-DM and WEKA; word meanings, text tagging, and scaling to big data; use of clustering and association tools for data mining, chatbots for university education; Machine Translation, Information Extraction, and Python tools for text analytics; web-based text analytics; case studies of current research and commercial applications in data mining and text analytics, BERT.
Teaching methods
Delivery type | Number | Length hours | Student hours |
Laboratory | 12 | 1.00 | 12.00 |
Class tests, exams and assessment | 2 | 2.00 | 4.00 |
Lecture | 8 | 1.00 | 8.00 |
Private study hours | 76.00 | ||
Total Contact hours | 24.00 | ||
Total hours (100hr per 10 credits) | 100.00 |
Opportunities for Formative Feedback
Coursework and labs.Methods of assessment
Coursework
Assessment type | Notes | % of formal assessment |
In-course Assessment | Report | 60.00 |
In-course Assessment | Test 2 | 20.00 |
In-course Assessment | Test 1 | 20.00 |
Total percentage (Assessment Coursework) | 100.00 |
Resits will be assessed by coursework.
Reading list
The reading list is available from the Library websiteLast updated: 01/06/2022 16:59:02
Browse Other Catalogues
- Undergraduate module catalogue
- Taught Postgraduate module catalogue
- Undergraduate programme catalogue
- Taught Postgraduate programme catalogue
Errors, omissions, failed links etc should be notified to the Catalogue Team.PROD