2025

Data Science and Big Data

Name: Data Science and Big Data
Code: INF14381L
6 ECTS
Duration: 15 weeks/156 hours
Scientific Area: Informatics

Teaching languages: Portuguese
Languages of tutoring support: Portuguese
Regime de Frequência: Presencial

Sustainable Development Goals

Learning Goals

The course addresses the topic of distributed computation and data analytics, and aims to teach the fundamentals of Big Data and its computational challenges, and also provide students with the ability to develop computing solutions on large volumes of data using analytical methods of Data Science.

Contents

Big Data: fundamentals and related technologies

Data analytics methods

Distributed repositories

Hadoop data storage

Methodological challenges in large data collections

MapReduce approach

Stream processing

Machine learning for Big Data

Stream analytics

Real-time monitoring and visualization

Teaching Methods

The teaching methodology includes two types of classes:
• lectures
• laboratory classes

The lectures present the course contents, along with some case studies.
The laboratory classes are dedicated to learning the development methods and technologies to build an analytics solution over large data collection.

Assessment

Continuous assessment: 2 tests (30% each) + 2 practical group assignments (20% + 20%)
Final assessment: 1 exam (60%) + 2 group practical assignments (20% + 20%)

Teaching Staff