Statistics Experience

Important

This assignment is due on Tuesday, November 26 at 11:59pm on Gradescope.

The world of statistics and data science is vast and continually growing! The goal of the statistics experience assignments is to help you engage with the statistics and data science communities outside of the classroom.

You may submit the statistics experience assignment anytime between now and the deadline.

Each experience has two parts:

1️⃣ Have a statistics experience.

2️⃣ Make a slide reflecting on your experience.

You must complete both parts to receive credit. The statistics experience will count as a homework grade.

Part 1: Experience statistics outside of the classroom

Complete an activity in one of the categories below. Under each category are suggested activities. You do not have to do one these suggested activities. You are welcome to find other activities as long as they are related to statistics/data science and they fit in one of the six categories. If there is an activity you’d like to do but you’re not sure if it qualifies for the statistics experience, just ask!

Category 1: Attend a talk or conference

Attend an talk, panel, or conference related to statistics or data science. If you are attending a single talk or panel, it must be at least 30 minutes to count towards the statistics experience. The event can be in-person or online.

Category 2: Talk with a statistician/ data scientist

Talk with someone who uses statistics in their daily work. This could include a professor, professional in industry, graduate student, etc.

Category 3: Listen to a podcast / watch video

Listen to a podcast or watch a video about statistics and data science. The podcast or video must be at least 30 minutes to count towards the statistics experience. A few suggestions are below:

This list is not exhaustive. You may listen to other podcasts or watch other statistics/data science videos not included on this list. Ask Professor Tackett if you are unsure whether a particular podcast or video will count towards the statistics experience.

Category 4: Participate in a data science competition or challenge

Participate in a statistics or data science competition. You can participate individually or with a team.

Category 5: Read a book on statistics/data science

There are a lot of books about statistics, data science, and related topics. A few suggestions are below. If you decide to read a book that isn’t on this list, ask Professor Tackett to make sure it counts toward the experience. Many of these books are available through Duke library.

  • Weapons of Math Destruction by Cathy O’Neil
  • How Charts Lie: Getting Smarter about Visual Information by Alberto Cairo
  • The Theory that Would Not Die by Sharon Bertsch McGrayne
  • The Art of Statistics: How to learn from data by David Spiegelhalter
  • The Signal and the Noise: Why so many predictions fail - but some don’t by Nate Silver
  • List of books about data science ethics

This list is not exhaustive.

Category 6: TidyTuesday

You may also participate in a TidyTuesday challenge. New data sets are announced on Monday afternoons.You can find more information about TidyTuesday and see the data in the TidyTuesday GitHub repo.

A few guidelines:

✅ Create a GitHub repo for your TidyTuesday submission. Your repo should include

  • The R Markdown file with all the code needed to reproduce your visualization.
  • A README that includes an image of your final visualization and a short summary (~ 1 paragraph) about your visualization.

✅ The visualization should include features or customization that are beyond what we’ve done in class .

✅ Include the link to your GitHub repo in the slide summarizing your experience.

Category 7: CURV - connecting, uplifting, and recognizing voices

CURV is a project by Dr. Jo Hardin at Pomona College to highlight statisticians and data scientists from groups who have been historically marginalized in the discipline.

For this statistics experience, you can contribute to the CURV data base. If there is a scholar you would like to suggest for the data base, submit your suggestion as an issue or pull request on the CURV GitHub repo and create a sample CURV page.

A few guidelines:

✅ Create a draft of the CURV page for your suggested scholar. For reference, click here for the CURV page for W.E.B. Du Bois. The page must be created in a Quarto document.

Tip

You can find the Quarto documents for current scholars in the data base in the CURV GitHub repo. You can use one of these as a template to format your page.

✅ Make a pull request to the CURV GitHub repo to add the .qmd file for your suggested scholar, OR open an issue with a link to the .qmd file for your suggested scholar. You can ask a member of the teaching team if you have questions about how to do this.

✅ Include the URL to your pull request or issue in your one-slide reflection.

Part 2: Reflect on your experience

Make one slide summarizing and reflecting on your experience. Submit the slide as a PDF on Gradescope.

Include the following on your slide:

  • Description of the experience
    • Name and brief description of the event/podcast/competition/etc.
  • Something you learned
    • Write 2 - 4 sentences about something you learned or found particularly interesting or unexpected.
  • Connection to STA 221
    • Write 2 - 4 sentences about how the experience connects to what we’ve done in the course.
  • Citation or link to web page for event/competition/etc.
    • No citation needed if you do an interview.

Make sure the slide includes the information mentioned above and is easily readable (i.e. use a reasonable font size!). Creativity on the experience and slide design is encouraged!

Submission

Submit the reflection as a PDF under the Statistics Experience assignment on Gradescope by Tuesday, November 26 at 11:59pm. Standard homework late policy applies.