Module and Programme Catalogue

Search site

Find information on

2020/21 Taught Postgraduate Module Catalogue

COMP5111M Big Data Systems

15 creditsClass Size: 230

Module manager: Dr Evangelos Pournaras
Email: e.pournaras@leeds.ac.uk

Taught: Semester 2 (Jan to Jun) View Timetable

Year running 2020/21

This module is not approved as an Elective

Module summary

The aim of the module is for students to develop a practical understanding of methods, techniques and architectures needed to build big data systems required, so that knowledge may be extracted from large heterogeneous data sets.

Objectives

On completion of this module, students should be able to:
- Understand the history, characteristics and future of big data
- Understand the contexts in which big data systems are applied
- Design systems architectures that could be used to implement big data solutions for given scenarios
- Implement key components of a big data system

Learning outcomes
On completion of the year/programme students should have provided evidence of being able to:
-to demonstrate in-depth, specialist knowledge and mastery of techniques relevant to the discipline and/or to demonstrate a sophisticated understanding of concepts, information and techniques at the forefront of the discipline;
-to exhibit mastery in the exercise of generic and subject-specific intellectual abilities;
-to demonstrate a comprehensive understanding of techniques applicable to their own research or advanced scholarship;
-proactively to formulate ideas and hypotheses and to develop, implement and execute plans by which to evaluate these;
-critically and creatively to evaluate current issues, research and advanced scholarship in the discipline.


Syllabus

Overview: history and definitions of big data, the five 'Vs' (Volume, Velocity, Variety, Veracity & Value), technology landscape, and future predictions (data, analysis capacity, business opportunities & employment). Application contexts: Structure & properties of data, analysis scenarios and case studies (e.g., nuclear physics (volume), social media (velocity), and medical bioinformatics or consumer retail (variety)). Systems architectures, encompassing data acquisition, storage, linkage, computation, security/confidentiality, and end-users. Key system components, e.g., Hapoop, MapReduce, parallel databases, SQL vs. NoSQL, algorithm scalability, and exploiting existing infrastructure (e.g., Cloud).

Teaching methods

Delivery typeNumberLength hoursStudent hours
Lecture221.0022.00
Practical111.0011.00
Private study hours117.00
Total Contact hours33.00
Total hours (100hr per 10 credits)150.00

Opportunities for Formative Feedback

Coursework and labs.

Methods of assessment


Coursework
Assessment typeNotes% of formal assessment
In-course MCQMCQ Quiz I (Big Data Systems and Architectures)20.00
In-course MCQMCQ Quiz II (Decentralized Analytics and AI)20.00
In-course AssessmentPractical Exercise (Minerva)20.00
Total percentage (Assessment Coursework)60.00

This module will be reassessed by an online time-constrained assessment.


Exams
Exam typeExam duration% of formal assessment
Online Time-Limited assessment48 hr 40.00
Total percentage (Assessment Exams)40.00

This module will be reassessed by an online time-constrained assessment.

Reading list

There is no reading list for this module

Last updated: 07/12/2020

Disclaimer

Browse Other Catalogues

Errors, omissions, failed links etc should be notified to the Catalogue Team.PROD

© Copyright Leeds 2019