Courses

Please watch the Schedule of Classes to see when they are offered.

Courses & Descriptions

DATA 1301 — Introduction to Data Science
This course provides an introduction to the field of data science with a high-level overview of basic concepts, data types, and techniques while introducing data-informed decision making.

DATA 3401 — Python for Data Science 1
This is the first of a two-course sequence offering the foundations of Python programming in the context of data science. It introduces the full syntax of the Python language as it overviews structured, functional, and object-oriented programming methodologies. It also provides a basic conceptual understanding of computing and introduces Unix command-line tools, software employed in data science, such as git and Jupyter, and Python libraries such as numpy, matplotlib, and Pandas.

DATA 3402 — Python for Data Science 2
This is the second of a two-course sequence offering the foundations of Python programming in the context of data science. It reinforces concepts presented in DATA 3401 with greater depth and a focus on application to various problems in data science, while further exploring the python library ecosystem.

DATA 3421 Data Mining, Management, and Curation
This lecture and lab course will provide training in working with databases, including data mining techniques and principles and best practices in data management, storage, and curation. Prerequisite: DATA 3401, DATA 3402

DATA 3441 — Statistical Methods for Data Science 1
This lecture and lab course will provide an introduction to the fundamental building blocks of advanced data analysis, with emphasis on advanced linear algebra, optimization, statistical inference, and Monte Carlo methods. Working with real data sets will be emphasized, along with basics of R programming.

DATA 3442 — Statistical Methods for Data Science 2
This lecture and lab course will provide an introduction to the principles and general methods for the analysis of categorical data. This type of data occurs extensively in both observational and experimental studies, as well as industrial applications. While some theoretical statistical detail is given, the primary focus will be on methods of data analysis. Topics include generalized regression models, logistic regression models, Poisson regression models, and multinomial regression models. Problems will be motivated from a scientific perspective.

DATA 3461 — Machine Learning
This course introduces and surveys Machine Learning techniques and their application to various problems in data science.

DATA 4380 — Data Problems
This course is intended for Junior-level Data Science students, and will enable them to identify, define, and explore a number of potential problems and projects, for follow-up in the capstone course sequence.

DATA 4381 — Data Capstone Project 1
This is the first of a two-semester sequence that will involve deep engagement in a team or individual project in Data Science. Presentation of written and oral reports will be required.

DATA 4382 — Data Capstone Project 2
This is the second of a two-semester sequence that will involve deep engagement in a team or individual project in Data Science. Presentation of written and oral reports will be required.

ASDS 5301 — Statistical Theory and Applications
An introduction to statistical theory and applications using R/SAS software. Topics may include descriptive statistics, numeracy, statistical concepts in estimation and testing, basic principles of design of experiments, analysis of variance, analysis of covariance, and some nonparametric statistical methods. Prerequisite: MATH 3330.

ASDS 5302 — Principle of Data Science
An introduction to the end-to-end process of going from unstructured, messy data to knowledge and actionable insights. Provides a broad overview of what data science means and systems and tools commonly used for data science and illustrates the principles of data science through several case studies, including business, marketing, medical imaging, and biology, among others. Prerequisite: MATH 3330.

ASDS 5303 — Statistical and Scientific Computing
Statistical and scientific computing focusing on programming in Python or R. Concepts in statistical computing include Monte Carlo techniques, Jackknife, Bootstrapping, Cross-Validation, and others. Concepts in scientific computing include computational and randomized linear algebra, principal component analysis, and applications. Prerequisite: MATH 3330.

ASDS 5304 — Applied Multivariate Statistical Analysis
Statistical analysis for data collected in several variables. Topics include sampling from the multivariate normal distribution, multivariate analysis of variance, discriminant analysis, principal components, and factor analysis. Basic knowledge of programming in SAS or R is required. Prerequisites: ASDS 5301, ASDS 5302, ASDS 5303.

ASDS 5305 — Deep Learning and Artificial Neural Networks
Introduction to artificial neural networks and deep learning. Topics include DL basics, history, and introduction to Pytorch; Perceptrons and their inspiration from neuroscience; Gradients, AutoGrad, designing deep linear systems; Training and testing neural networks, backpropagation, stochastic gradient descent; Applying Optimization and regularization to neural nets; Convolutional Neural Networks (ConvNets/CNNs); Recurrent Neural Networks (RNNs); and Modern Networks and their applications. Prerequisites: ASDS 5301ASDS 5302ASDS 5303.

ASDS 5306 — Applied Time Series Analysis in Data Analytics
An introduction to the theory and applications of time series modeling with an emphasis on modeling and forecasting using the software. Topics include stationarity and autocorrelation, autoregressive, moving average, ARMA and ARIMA; forecasting and estimation; spectral analysis. Computational implementation in R. Basic programming skills is preferred. Prerequisite: ASDS 5301ASDS 5302ASDS 5303.

ASDS 6301 — Advance Regression Analysis
Topics include multiple linear regression, ordinary and generalized least squares, partial and multiple correlation, regression diagnostics, collinearity, model building, and nonlinear regression. The course provides an extended introduction to the computer package widely used for statistical analysis. Basic knowledge of programming is required. Prerequisite: MATH 3330.

ASDS 6302 — Machine Learning with Applications
Topics include but are not limited to supervised learning methods: linear model, generalized linear model, logistic regression, linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), nearest neighbor classifier, support vector machines, tree-based methods (decision tree, random forest, XGBoost), and neural networks; and unsupervised learning methods: clustering, principal component analysis, and independent component analysis. The course provides an extended introduction to tools widely used for statistical machine learning. Basic programming skills are preferred. Prerequisite: MATH 3330.

ASDS 6303 — Data Mining with Information Visualization
Introduction to statistical pattern recognition. The main topics include Bayes decision theory, discriminant functions, maximum likelihood estimation, PCA, LDA, semi-supervised kernel learning, and graph embedding. This course will discuss some applications of data mining in different application fields, such as business, marketing, medical imaging, biology. Prerequisite: MATH 3330.

ASDS 6304 — Optimization and Big Data Analytics
Topics include an introduction to big data analysis, real-world applications of data science, linear system solutions, linear programming, duality theory, convex sets, convex functions, optimality conditions, unconstrained Optimization, constraint optimization, conjugate direction methods, alternating direction method of multipliers, classification/regression models and algorithms, dimensionality reduction for visualization and projects on real data. Prerequisites: ASDS 5301ASDS 5302ASDS 5303.

ASDS 6305 — Statistical and Scientific Computing II
Advanced topics in statistical and scientific computing, emphasizing complex data analysis, such as high-dimensional and functional data. Topics include Expectation-Maximization (EM), Stochastic and Monte Carlo EM, Metropolis-Hastings algorithm, Gibbs sampling, functional principal component analysis, tensors, and tensor decompositions. Prerequisites: ASDS 5301, ASDS 5302, ASDS 5303.

ASDS 6306 — Internship/Capstone Research Project
The capstone project aims to give students a "hands-on" experience in analyzing interesting datasets with cutting-edge techniques of their interest. While students are encouraged to get feedback and mentoring from faculty, they should work as independently as possible with their teams. Progress will be monitored by completing the following quarterly milestones: (1) Acquisition of dataset, data pre-processing, and clear statement/justification of the proposed analysis. (2) Preliminary results and discussion of the plausibility of the results, (3) Refinement of codes, analysis, and results. (4) Submission of a written 5+ page report including introduction, background, methods, results, discussion, and conclusion. (5) Final oral presentation chaired by an ASDS faculty who read the written report. The project team members are expected to answer project-related questions from faculty and peers satisfactorily. Prerequisites: ASDS 5301ASDS 5302ASDS 5303ASDS 6301ASDS 6302ASDS 6303.