MATH 6397 Mathematics of Data Science

MATH 6397 Mathematics of Data Science. From signal processing to Convolutional Neural Networks

	When and Where Semester: Spring 2021 Meeting time: TuTh 2:30-4 pm Meeting place: Teams Office Hours: TuTh 4pm or by appointment Instructor Name: Demetrio Labate E-mail address: dlabate@math.uh.edu URL: http://www.math.uh.edu/~dlabate

When and Where

Semester: Spring 2021

Meeting time: TuTh 2:30-4 pm

Meeting place: Teams

Office Hours: TuTh 4pm or by appointment

Instructor

Name: Demetrio Labate

E-mail address:

dlabate@math.uh.edu

URL:

http://www.math.uh.edu/~dlabate

Course Objectives

This is a course of mathematics exploring foundational and theorical concepts underlying the development and applications of intelligent systems and deep learning algorithms. One major emphasis of this course is the connection between topics from classical and advanced signal processing on one hand and deep neural networks on the other hand. For instance, convolution operators underpin the design and development of convolutional neural networks; multiresolution analysis underlies several neural network designs such as the Inception module; manifold learning and sparse approximations provide powerful theoretical tools for the analysis and interpretation of deep learning architectures.

Topics of the course include: Fourier transform and convolution, multiresolution analysis, sparse approximations, manifold learning, statistical learning theory, dimensionality reduction and spectral clustering, convolutional neural networks.

This is class is targeted to graduate students interested in mastering theoretical tools underlying machine learning and data science.

Even though algorithmic aspects of the topics will not be ignored and exploration of algorithmic issues will be assigned for individual or group projects, this course will not duplicate existing courses on machine learning or data science offered in the Computer Science Department that are focused on algorithmic implementation and computation.

Prerequisite

Links

Lebesgue Integration (by Prof. C.Heil)

Banach and Hilbert spaces (by Prof. C.Heil)

Homework and student assignments

Deep Learning with PyTorch

FRI March 19.

instructions for the proposal and final project

here

April 23-29.

here

List of lecture notes:

Lecture set 01: MDS_1
Lecture set 02: MDS_2
Lecture set 03: MDS_3

Textbook Information

There is no official textbook.
This course brings together mathematical tools not usually presented in a single course, for the purpose of solving problems arising in different fields related to the analysis of data sets. I will be selecting material from several sources:

1. The Mathematics of Signal Processing by Damelin and Miller, Cambridge University Press ISBN-13: 978-1107601048. This is a mathematically rigorous book covering topics from advances and modern signal processing that useful for practitioners in data-driven fields such as imaging and time series.
2. Foundations of Data Science, by Blum, Hopcroft and Kannan’s available free online at https://www.cs.cornell.edu/jeh/book2016June9.pdf. It includes material on the Curse of Dimensionality and various topics in machine learning.
3. The Elements of Statistical Learning by Hastie, Tibshirani and Friedman, Springer 2017. The authors have made this book freely available on the website https://web.stanford.edu/~hastie/ElemStatLearn/printings/ESLII_print12_toc.pdf This classical treatise covers a broad range of topics in statistical learning theory and neural networks.
4. Deep Learning with PyTorch by Stevens, Antiga and Viehmann. The authors have made this book freely available on the website https://pytorch.org/assets/deep-learning/Deep-Learning-with-PyTorch.pdf It is a practial manual to implement deep learning algorithms in Pytorch - for those students more interested into the numerical/applied side.
5. Additional notes and reference papers will be provided by the instructor.