Close Menu

COMP6130

Course Title: 
Big Data Analytics
Credits: 
3
Core Course: 
yes
Course Aims: 

This course will focus on the processing and analysis of large datasets (Big Data) while applying parallel machine learning techniques for handling these datasets. The use of tools like Apache Spark, Hadoop, MapReduce and NoSQL systems will be leveraged to speed up computation.
The majority of the course material will be drawn from textbooks and research papers.

Learning Outcomes: 

On successful completion of this course, student should be able to:

  •   Understand the concepts of Big Data

  •   Build predictive systems that rely on large datasets.

  •   Analyze data streams using appropriate technology (e.g. Apache Spark)

  •   Process large datasets to extract valuable information

  •   Plan and implement a strategy for big data management in an organization

  •   Store and process unstructured data

  •   Apply the appropriate parallel machine learning algorithms to reduce computation

  •   Design graphical model solutions to problems

Syllabus: 
  1. Overview

    • Introduction to Big Data
    • Why Big Data?
    • Characteristics of Big Data
  2. Big Data Infrastructure (e.g. Apache Hadoop + MapReduce)

  3. Stream Processing using appropriate technology (e.g. Apache Spark)

  4. Machine Learning systems for Big Data

    • Data Exploration
    • Data Preparation
    • Regression, Classification and Association Analysis
    • Data Visualization
    • Evaluation of Machine Learning Models
  1. NoSQL Systems for Big Data

  2. Graph Analytics for Big Data

  3. Clustering Analysis for Big Data

  4. Recommendation Systems using Big Data

  5. Big Data Management

Course Assessment: 

Course work                                                           60%

  1. In-course test.                           10%
  2. Projects (2).                               40%
  3. Homework assignment (2).       10%

Final Written Examination (3 hrs).                           40%

Students will be required to pass both the coursework and the final examination to pass the course.

Top of Page