Hold on just a sec...
3 credits
Fall 2025 Lecture Upper DivisionThis course provides a thorough introduction to the R programming language, and its use for statistical computing and data science. The course will first look at the fundamentals of R, including different data-structures, control-flow, and the basic vocabulary. An emphasis will be placed on learning idiomatic and efficient R, covering ideas such as recycling, vectorization and functional programming. The course will then look at principles and tools for tasks like organizing data ('tidy data'), manipulating data ('data carpentry'), querying data (through topics like regular expressions) as well as visualizing data (including interactive visualizations). The material and the homework will encourage development of modular reusable code and reproducible research through ideas such as object-oriented programming and dynamic documents in R Markdown. The last part of the course will study statistical procedures such as least-squares regression, LASSO, Monte Carlo sampling and Markov chain Monte Carlo. Besides exams and homework, the course will involve a final project that students can collaborate together on.
Learning Outcomes1Read and write idiomatic R code.
2Be familiar with the entire data science pipeline, from organizing, manipulating, querying and visualizing data.
3Introduce computational tools like optimization and simulation in the context of statistical algorithms.