Statistical Data Science 2: Spring 2025

Course website: https://ahoriguchi.github.io/teaching/STA35B/website.html

Syllabus

Course Overview

  • This course is the second of a three-course series on statistical data science; see course description.
  • Many thanks to Spencer Frei; much of the material in this syllabus is taken from his syllabus.

Course Logistics

  • This course webpage will host lecture notes, homework, supplementary materials, etc.
  • Canvas will be used for lab materials and for turning in labs and homework. Solutions will be posted on Canvas.
  • Piazza will be used for discussion.

Pre-requisites

This is a course on advanced programming and data manipulation in the R programming language. The best preparation for this course is STA 035A. If you have no experience with programming in any language, this will be a difficult course, and you will need to expend significant time into the first few weeks to familiarize yourself with R.

The more formal requirements are below. These requirements are strict.

Prerequisite(s): one of the following two options:

  • STA 013 (C- or better) or STA 013Y (C- or better) or ECS 032A (C- or better)
  • STA 035A (C- or better) or STA 032 (C- or better) or STA 100 (C- or better)

Pre- or co-requisite (i.e., can be concurrent):

  • MAT 016B or MAT 017B or MAT 021B.

R and RStudio

All computing work in this class will be done using R and RStudio.

  • R is a free, open-source programming language for statistical computing.
  • RStudio is a free, open-source R programming environment. It contains a built-in code editor, many features to make working with R easier, and works the same way across different operating systems.

You will use RStudio for homework, labs and exams, so a working version of RStudio is required. You can choose to download it on your personal computer, or use UC Davis JupyterHub. You will need regular, reliable access to a computer either running an up-to-date version of R and RStudio, or with a working browser (for the JupyterHub option). If this is a problem, please let us know right away. There are resources available to support you. Some are listed here.

The room that labs are held in have computers with RStudio installed, and you may choose to use them. If you are using your own laptop, please make sure that it is charged before class.

Evaluation

The grade breakdown is:

  • 5% labs
  • 5% homework
  • 60% midterms
  • 30% final

Cutoffs for letter grades are:

  • A+: 95% or higher
  • A: 85% or higher
  • A-: 80% or higher
  • B+: 75% or higher
  • B: 65% or higher
  • B-: 60% or higher
  • C+: 55% or higher
  • C: 45% or higher
  • C-: 40% or higher
  • D+: 35% or higher
  • D: 25% or higher
  • D-: 20% or higher
  • F: lower than 20%

These thresholds may be adjusted at the end of the semester in a way that improves your grade for the course.

Grade disputes and adjustments: Students have 72 hours after receiving a grade on any assignment to contest it. Grading is consistent and we will provide detailed rubrics. If you think you deserve a different grade, prepare a strong argument and submit it by email to the TA.

Labs

  • Labs are typically due the Friday after the lab session, at 9 PM, and will be turned in via Gradescope (accessible through Canvas). Always check the course homepage and Gradescope/Canvas to ensure the correct deadline.
  • Labs will be completed in Quarto format (file extension qmd). Labs will involve writing a combination of code and written prose, and the Quarto format allows for a combination of the two. Labs must be submitted only in PDF format, the result of calling “Knit PDF” from RStudio on your Quarto document. Work submitted in any other format will receive a grade of 0, without exception. All code used to produce your results must be shown in your PDF file (e.g., do not use `echo = FALSE` or `include = FALSE` as options anywhere). qmd files do not need to be submitted.
  • Labs will be graded for completeness; it will be up to you to check the solutions once they are released.
  • No late labs will be accepted for any reason.
  • Students may choose to collaborate with each other on the labs and homework, but must clearly indicate the names of all students with whom they collaborated.

Homework

  • Typically released on Mondays, due the following Monday at 9 PM. Always check Gradescope/Canvas to ensure the correct due date.
  • The same policies that apply to labs will apply to homework. Homework may contain other non-coding components, and these can be typed (use any software you are comfortable with), or written and scanned.
  • Submit homework as PDF.
  • Homework will be graded for completeness; it will be up to you to check the solutions once they are released.
  • No late submissions accepted.
  • Students may choose to collaborate with each other on the labs and homework, but must clearly indicate the names of all students with whom they collaborated.

Midterms and Final

  • There will be four midterms and one final. The midterms will be in class, during the scheduled class times.
  • The lower score of the four midterms will be dropped. There will be no make-up exams. If you must miss an exam due to illness, travel, or some other reason, this will be the exam that will be dropped.
  • For the final, if you have another final starting 30 minutes before or after the scheduled time, you may present documentation and request for an accommodation to start 15 minutes before or after the scheduled time.

Late Work

All labs and homework will be due at 9 PM Pacific Time, on the relevant due date. No late work will be accepted for any reason. It is highly recommended that you begin working on your lab/homework as soon as it is released.

Attendance and Participation

Class attendance is strongly encouraged. Please be on time. If you miss a lecture for any reason, you are responsible for all material covered and any announcements made in your absence. Active participation is encouraged, both in class and on Piazza. Cell phones, laptops, and other electronic devices must be silenced in class. Laptops are to be used in class for learning purposes related to the lecture only.

Collaboration, Copying, and Plagiarism

All students are expected to follow the UCD Code of Academic Conduct. Any student who cheats on an assignment or exam will be referred to the Office of Student Support and Judicial Affairs and will receive an automatic failing grade on the relevant assignment. A second instance of academic dishonesty will result in a failing grade in the course. More information on the nature of dishonest academic behavior or UCD policy can be found on the website of the Office of Student Support and Judicial Affairs.

Collaboration is encouraged and students are encouraged to discuss course material with classmates. All work that is turned in, however, must be your own. If students have collaborated on labs or homework, the names of all students working together must be clearly indicated.

Please do not distribute any course materials outside of this class. This is an infringement of copyright as per UC policy. Use of sites like Course Hero and Chegg are not permitted.

Getting help

Labs, office hours and Piazza

You can access Piazza by clicking on the “Piazza” link on the sidebar in Canvas, or directly through this link. Students are encouraged to answer each others’ questions, and the TA will moderate by checking in every day. The quickest way to get a question answered is likely on Piazza, since anyone in class can answer. These are the rules for posting on Piazza:

  • Be respectful. Any content deemed inappropriate will be taken down by the TA, and reported to the instructor.
  • Search before you post; your question may have already been answered.
  • Posting code that is part of your solutions for labs and homework, and asking “what is wrong with this code?” is not acceptable. “I don’t know either” is not an appropriate answer as it does not contribute constructively to the conversation. Along with your posted question, explain what else you tried that didn’t work. The answers to many common coding questions can be found on https://stackoverflow.com/.

If you have a question that requires more than a short paragraph to answer, labs and office hours are the best options.

Email

Email will be used only for questions relating to private matters (accommodations, grading, emergencies, etc.). Questions about class logistics and content should be posted on Piazza, asked in class, during labs, or during office hours. If you must ask a question about a non-private matter via email, you must first document how you tried to answer the question for yourself or through other means (e.g., “I double checked the syllabus,” “There are conflicting responses on Piazza” …). Emails that do not follow these guidelines may not be answered.

Please do not send me messages on Canvas. I do not monitor my Canvas inbox.

Other campus resources

Statistics Tutors at the Academic Assistance and Tutoring Centers provide support for RStudio. More information is available here.

Many students face different challenges during college, and it is healthy to seek support. This is a comprehensive list of resources covering general academics, health and wellness, finances, housing, career/internship, and other topics.

Health and wellness resources are available here. If you have an emergency, call 911 immediately, or go to the nearest emergency room. Mental health staff are available 24 hours/7 days week by phone at 530-752-2349. (Follow the prompts to reach a counselor.)

Accommodations for students with disabilities

UC Davis is committed to educational equity in the academic setting, and in serving a diverse student body. All students who are interested in learning more about the Student Disability Center (SDC) are encouraged to contact them directly at https://sdc.ucdavis.edu, sdc@ucdavis.edu, or 530-752-3184. If you are a student who requires academic accommodations, please submit your SDC Letter of Accommodation to us as soon as possible, ideally within the first two weeks of this course.