Please watch the Schedule of Classes to see when they are offered.
Courses & Descriptions
DATA 1301 — Introduction to Data Science
This course provides an introduction to the field of data science with a high-level overview of basic concepts, data types, and techniques while introducing data-informed decision making.
DATA 3401 — Python for Data Science 1
This is the first of a two-course sequence offering the foundations of Python programming in the context of data science. It introduces the full syntax of the Python language as it overviews structured, functional, and object-oriented programming methodologies. It also provides a basic conceptual understanding of computing and introduces Unix command-line tools, software employed in data science, such as git and Jupyter, and Python libraries such as numpy, matplotlib, and Pandas.
DATA 3402 — Python for Data Science 2
This is the second of a two-course sequence offering the foundations of Python programming in the context of data science. It reinforces concepts presented in DATA 3401 with greater depth and a focus on application to various problems in data science, while further exploring the python library ecosystem.
DATA 3421 Data Mining, Management, and Curation
This lecture and lab course will provide training in working with databases, including data mining techniques and principles and best practices in data management, storage, and curation. Prerequisite: DATA 3401, DATA 3402
DATA 3441 — Statistical Methods for Data Science 1
This lecture and lab course will provide an introduction to the fundamental building blocks of advanced data analysis, with emphasis on advanced linear algebra, optimization, statistical inference, and Monte Carlo methods. Working with real data sets will be emphasized, along with basics of R programming.
DATA 3442 — Statistical Methods for Data Science 2
This lecture and lab course will provide an introduction to the principles and general methods for the analysis of categorical data. This type of data occurs extensively in both observational and experimental studies, as well as industrial applications. While some theoretical statistical detail is given, the primary focus will be on methods of data analysis. Topics include generalized regression models, logistic regression models, Poisson regression models, and multinomial regression models. Problems will be motivated from a scientific perspective.
DATA 3461 — Machine Learning
This course introduces and surveys Machine Learning techniques and their application to various problems in data science.
DATA 4380 — Data Problems
This course is intended for Junior-level Data Science students, and will enable them to identify, define, and explore a number of potential problems and projects, for follow-up in the capstone course sequence.
DATA 4381 — Data Capstone Project 1
This is the first of a two-semester sequence that will involve deep engagement in a team or individual project in Data Science. Presentation of written and oral reports will be required.
DATA 4382 — Data Capstone Project 2
This is the second of a two-semester sequence that will involve deep engagement in a team or individual project in Data Science. Presentation of written and oral reports will be required.