DSC 333: Introduction to Big Data Processing

This course will explore different approaches and a framework for performing data analytics on a dynamic, heterogeneous cluster of computing nodes. The course will begin with studying principles behind MapReduce and implementation of custom distributed queries using Hadoop. It will then expand to cover higher-level languages and tools within Hadoop ecosystem (e.g., Pig, Hive) and cluster configuration techniques. Finally, the course will delve into a comparative evaluation of several NoSQL and NewSQL databases that make fundamentally different assumptions for data processing (e.g., OLAP vs OLTP, disk-bound vs in-memory or real-time streaming data). The primary focus of the course will be hands-on implementation and tuning performance for large-scale clusters and data sets.

CSC 355 is a prerequisite for this class.

Spring 2025-2026

Section: 901
Class number: 33169
Meeting time: Tu 5:45PM - 9:00PM
Location: N/A at Loop Campus
Instructor:

Section: 910
Class number: 33373
Meeting time: -
Location: Online: Async
Instructor:

Winter 2025-2026

Section: 801
Class number: 23021
Meeting time: Tu 5:45PM - 9:00PM
Location: CDM 00224 at Loop Campus
Instructor: Peter Bondarenko | View syllabus

Fall 2025-2026

Section: 701
Class number: 13208
Meeting time: Tu 5:45PM - 9:00PM
Location: CDM 00216 at Loop Campus
Instructor: Ahmed Abid | View syllabus

Spring 2024-2025

Section: 901
Class number: 33428
Meeting time: Tu 5:45PM - 9:00PM
Location: LEWIS 01005 at Loop Campus
Instructor: Ahmed Abid | View syllabus

Section: 910
Class number: 35165
Meeting time: -
Location: Online: Async
Instructor: Ahmed Abid | View syllabus

Winter 2024-2025

Section: 801
Class number: 23075
Meeting time: Tu 5:45PM - 9:00PM
Location: CDM 00224 at Loop Campus
Instructor: Ahmed Abid | View syllabus

CLOSED

Fall 2024-2025

Section: 701
Class number: 14949
Meeting time: Tu 5:45PM - 9:00PM
Location: LEWIS 01111 at Loop Campus
Instructor: Ahmed Abid | View syllabus

Spring 2023-2024

Section: 901
Class number: 35328
Meeting time: Th 5:45PM - 9:00PM
Location: CDM 00220 at Loop Campus
Instructor: Ahmed Abid | View syllabus

Winter 2023-2024

Section: 801
Class number: 23101
Meeting time: Tu 5:45PM - 9:00PM
Location: CDM 00224 at Loop Campus
Instructor: Tanu Malik | View syllabus

CLOSED

Fall 2023-2024

Section: 701
Class number: 13414
Meeting time: Tu 5:45PM - 9:00PM
Location: CDM 00228 at Loop Campus
Instructor: Tanu Malik | View syllabus

Winter 2022-2023

Section: 801
Class number: 29121
Meeting time: W 5:45PM - 9:00PM
Location: CDM 00218 at Loop Campus
Instructor: Alexander Rasin | View syllabus

Fall 2022-2023

Section: 701
Class number: 19144
Meeting time: Tu 5:45PM - 9:00PM
Location: CDM 00228 at Loop Campus
Instructor: Alexander Rasin | View syllabus

Spring 2021-2022

Section: 901
Class number: 37539
Meeting time: Tu 5:45PM - 9:00PM
Location: CDM 00226 at Loop Campus
Instructor: Alexander Rasin | View syllabus

Winter 2021-2022

Section: 801
Class number: 28843
Meeting time: W 5:45PM - 9:00PM
Location: CDM 00218 at Loop Campus
Instructor: Alexander Rasin | View syllabus

Previous Instructors

Course Info