Introduction to Data Science is a three-credit course, offered in the Spring 2018 semester at the University of Utah, cross-listed between Mathematics (MATH 4100) and Computing (COMP 5360).
The amount and complexity of information produced in science, engineering, business, and everyday human activity are increasing at a staggering rate. The goal of this course is to expose you to methods and techniques for analyzing and understanding complex data. Data Science lies at the intersection of statistics, computer science, and, of course, the domain from which the data comes from. This course will provide an introduction to the former two: statistics and computer science and provide you with a toolset to conquer problems in your domain!
The course begins by bootstrapping your coding skills (we will be using Python), and will move through a series of data science methods via real-life, project-based, lectures and computer labs. The goal of this course is to develop your skills in:
- data wrangling: how to acquire, clean, reshape, or sample data so that it’s ready for further processing?
- data exploration and analysis: how to analyze the signal in a large, noisy dataset?
- prediction: can inferences and decisions be made based on the available data?
- communication: how can findings be effectively communicated to others?
A more comprehensive description of the course material, including a list of projects, can be found in the course syllabus.
The class meets Tuesday/Thursday 10:45am-12:05pm, in WEB 2250.
Lecture Material: Jupyter notebooks and other lecture materials are posted to this github repository.
Homework Assignments: Homework assignemnts are posted on github.
Online discussion: We use a Slack Team for announcements and discussions. Sign up with your utah.edu e-mail address.
Grades / Submissions: We use Canvas for homework submissions, grades, and to post solutions.
Kiran Gadhave, Computer Science, firstname.lastname@example.org
Shuvrajit Mukherjee, Computer Science, email@example.com
RK Yoon, Mathematics, firstname.lastname@example.org
Is this course for me?
This course is designed for both undergraduates and graduates from various fields. So, if you are a student in, for example, Biology, Chemistry, Social Sciences, Psychology, Business, etc., and want to learn some computational and quantitative skills to be successful in your field as it transforms in an age increasingly dominated by data and computation, this course might be for you!
This is an introductory course: we will introduce both programming concepts and the necessary statistics, yet we do expect you to have at least a little background in programming and math. The formal prerequisite for this course is Calculus I (UU Math 1170, 1210, 1250, 1310, 1311 or equivalent). In addition, we expect that you have some basic programming fluency. If you don’t know what a for-loop or a function is, for example, you first might want to take a course like CS 1060.
Can I get quantitative reasoning or elective credit for this class?
Yes, this class counts as Quantitative Intensive (QI) credit towards a bachelor of science degree.
This class will satisfy the computing requirements for applied math students.
Can I take this class for credit as a graduate student?
The COMP 5360 course number enables most graduate students to take this course for credit towards their degree, as many graduate programs allow 5000 level courses from outside the degree’s home department. CS students (undergrad or graduate) cannot take this course.