2021/22 Taught Postgraduate Module Catalogue
OCOM5204M Data Mining and Text Analytics
15 creditsClass Size: 100
Module manager: Professor Eric Atwell
Taught: 1 Jan to 28 Feb, 1 Jan to 28 Feb (adv year) View Timetable
Year running 2021/22
|OCOM5100M||Programming for Data Science|
This module is not approved as an Elective
Module summaryThe module will provide an introduction to linguistic theory and terminology. Students will develop understanding of and the ability to use algorithms and resources for implementing and evaluating text mining and analytics systems. Students will be supported to develop solutions using open-source and commercial toolkits, and will be encouraged to consider the applications of data mining and text analytics through case studies in information retrieval and extraction.
ObjectivesThe module will provide an introduction to linguistic theory and terminology. Students will develop understanding of and the ability to use algorithms and resources for implementing and evaluating text mining and analytics systems. Students will be supported to develop solutions using open-source and commercial toolkits, and will be encouraged to consider the applications of data mining and text analytics through case studies in information retrieval and extraction.
On completion of this module students should be able to:
(1) understand theory and terminology of empirical modelling of natural language;
(2) understand and use algorithms, resources and techniques for implementing and evaluating text mining and analytics systems;
(3) demonstrate familiarity with some of the main text mining and analytics application areas;
(4) appreciate why unrestricted natural language processing is still a major research task.
Indicative content for this module includes:
Theory and terminology in Data Mining and Computational Linguistics
Data and text mining tools and resources for practical applications
Data sources and data warehouses
Tools and techniques for data preparation
Supervised machine learning
Unsupervised machine learning
Association, collocation and co-occurrence discovery
Evaluation methods and metrics
Open-source and commercial text mining and text analytics tools
Web-based text analytics
Case studies of research and commercial applications
|Delivery type||Number||Length hours||Student hours|
|Independent online learning hours||28.00|
|Private study hours||104.00|
|Total Contact hours||18.00|
|Total hours (100hr per 10 credits)||150.00|
Private studyPrivate study will include directed reading and exercises and self-directed research in support of learning activities, as well as in preparation for assessments.
Independent online learning involves non-facilitated directed learning. Students will work through bespoke interactive learning resources and activities in the VLE.
Opportunities for Formative FeedbackOnline learning materials will provide regular opportunity for students to check their understanding (for example through formative MCQs with automated feedback). Regular group activity embedded into learning will allow self and peer assessment providing opportunities for formative feedback from peers and tutors.
Students will complete a formative group assessment in the same format as the final individual summative assessment, providing an opportunity for formative feedback.
Methods of assessment
|Assessment type||Notes||% of formal assessment|
|Online Assessment||Online Test||20.00|
|Online Assessment||Online Test||30.00|
|Total percentage (Assessment Coursework)||100.00|
Resit will be by Project Report on a different project.
Reading listThe reading list is available from the Library website
Last updated: 13/01/2022 16:49:43
Browse Other Catalogues
- Undergraduate module catalogue
- Taught Postgraduate module catalogue
- Undergraduate programme catalogue
- Taught Postgraduate programme catalogue
Errors, omissions, failed links etc should be notified to the Catalogue Team.PROD