AY 128/256: Astronomy Data Science Lab (Spring 2024)

Synopsis

This course consists of three data-centric laboratory experiments that draw on a variety of tools used by professional astronomers. Students will learn to procure and clean data (drawn from a variety of world-class astronomical facilities), assess the fidelity/quality of data, build and apply models to describe data, learn statistical and computational techniques to analyze data (e.g., Bayesian inference, machine learning, parallel computing), and effectively communicate data and associated scientific results.  This class will make use of data from facilities such as Kepler, Gaia, the Sloan Digital Sky Survey, and the Hubble Space Telescope to explore the structure and composition of the Milky Way, stars, and galaxies throughout the local and distant Universe. There is a heavy emphasis software development in the Python language, statistical techniques, and high-quality communication (e.g., written reports, oral presentations, and data visualization).

This course satisfies the Data Science major requirement for “Computational & Inferential Depth”. Read more about the upper division classes for the Data Science BA. This course also satisfies the Laboratory requirement for the Astronomy Major.

Instructors

  • Dan Weisz (dan.weisz@berkeley.edu; 311 Campbell Hall)
  • Anna Pusack (GSI, anna_pusack@berkeley.edu)
  • Olivia Aspegren (GSI, oliviaaspegren@berkeley.edu)

Course Aims

  • Introduce and motivate a range of analysis techniques and data pipelining
  • Gain practical, in-depth experience doing inference on real, open-ended modern astronomical challenges
  • Build reproducible, well-tested, well-documented software & infrastructure
  • Learn to work with open data and code, and in an open science environment
  • Hone presentation (speaking & visualization) skills
  • Develop skills for future in academia, industry, …

Course Format

  • 4 credits
  • 2 weekly 1.5 hour meetings
  • “Show & tell” progress reports + instructor lecture
  • 3.5 labs
  • Will require a fair amount of dedicated coding time

Lab Schedule:

Lectures

  • Tuesdays and Thursdays 2-3:30pm in 131 Campbell

Office Hours

  • Dan: Tuesday 3:30-4:20pm, 355 Campbell
  • Anna/Olivia: Mon 2:30-3:30pm, 355 Campbell
  • and by appointment
  • TALC: Weds 2-4pm 131 Campbell

Grading

  • 10%: Class Participation – Active engagement in class discussion and lecture, participation during “show and tell”, attending office hours, Q&A engagement on Ed Discussion (invite link)

  • 65%: Lab Reports/Notebooks – due before specified class, -10% for each day late, you can collaborate with people in the class, but all work, writeups, notebooks, coding, plots, etc. MUST be your own. We’ll drop your 2 lowest checkpoint scores.

  • 25%: Lab Checkpoints – weekly progress goals for each lab; graded mostly for completeness.

Prereqs

  • There are no formal prereqs for this class but there are a number of informal requirements, outlined in this document, which you must satisfy.