CS 498: End-to-End Data Science
January 17, 2024
Breaks / “work time” during weeks when MPs are due.
Broad coverage of the principles, tools, and products of data science. Throughout the course, students will build data products such as models, packages, dashboards, and APIs using real-world data for real-world applications. Emphasis will be given to open-source tools that exist in or connect with the Jupyter languages (Python, R, and Julia). Their applications, interactions, and tradeoffs will be discussed.
Figure 1: Gromit, an animated dog, laying train track in front of himself as he rides a train down the track he is creating, from Wallace and Gromit.
Figure 2: A humourous tweet from Josh Wills that “defines” a data scientist in a somewhat meaningless way.
This course is not…
Generally, we will do a “deep dive” into almost no topics!
Anyone interested in doing data science, or working with data scientists!
The specific target audience is computer science students who want to work as data scientists.
The course could also be useful for those who want to work as…
Generally the more experience and interest in data science, the better. Specifically, you should…
Machine learning knowledge at the level of CS 441 will be very helpful, but not necessarily required.
During the course you will…
As a final project, you will repeat one (or some combination) of these, with data of your choosing.
Figure 3: A name tag with the name Dave written on it, in all caps.
I’ve been an Illini for a while…
bbd
, an R package for accessing baseball data.Check your email later today! We’ll be sending out a survey to better collect data to understand who you are.
Also, don’t forget to respond to the “Introduce Yourself” thread on Ed.