Skip to content

Key Features of the Project:

Students choose and propose topics

  • You may work by yourself in a team of one
  • But working in small groups of two, three, four, five, ... is encouraged and preferred
  • There is no perfect group size but three or four worked well for many teams
  • We allow single-person teams but recommend teams of three or four

Everything on GitHub

  • You should also use git (via your project's GitHub repository) for your intermediate steps
  • i.e. do not just upload finished work but all steps

Submissions must 'run;

  • Your end result is code that must do something sensible with data
  • Possible tasks for data are gathering, cleaning, aggregating, modeling, estimating, visualizing, summarizing ... and any combination thereof

Write-up in Markdown

  • Which should rendered nicely at GitHub, inclusion of figures or charts for visualisation is encouraged
  • You can choose html or pdf as the final format (note that pdf requires latex at your end)
  • No powerpoint, no keynote

Final Deliverables

  • The actual project plan and idea manifested in actual code
  • The actual project description in a write-up in markdown, i.e. a short paper
  • The actual project presentation recorded via Zoom or alike (more details forthcoming)

May contain multiple course components:

  • Pick from shell, sql, markdown, R, ...
  • But your work should focus and center around R and different R packages
  • "anything that runs on 'morrow', our RStudio Server", goes
  • Please avoid technologies we cannot deploy there
  • i.e. GPU-heavy deep learners may be out; cpu-based ones may work if installable
  • so in a nutshell if you can install it from CRAN and deploy it on RStudio, you can use it