The aim of this course is to give you the basic knowledge on how to manipulate large datasets using R, and the visualization using the Grammar of Graphics (ggplot). This is a course from a natural scientist to natural science students. 

The course will have a theory / practical class, as well homework for you to practice. All datasets we will be working with, will be related to biology and medicine. The courses will be online, and because programming is in English, the main language will be English. However, you may ask questions in German. 

The course online sessions will be Fridays from 11:45 to 13:15, starting October 16th, 2020. Last course will be the 5th of February, 2021

In case of presentations: All will be in English: It will allow you a space to practice your English skills in a safe environment. The Class will be interactive, therefore you need to have minimum a microphone to be able to interact in class.

This course is not a statistics course neither a programming course. We will use principles of both in order to achieve the data manipulation and visualization, but all concepts needed will be explained in due course.

Pre-requirements:

  • This is an introductory course for data analysis using R. There is no previous knowledge required. You need only your curiosity, desire to learn, and the discipline needed to learn a new language.
  • Hardware: You will need a computer (laptop or desktop, with screen) with 64-bit architecture, enough memory for the installation and work with the datasets (8 Mb RAM should be enough), and a microphone (a webcam is optional). 
  • Software: You will need to install R (version 3.6 only!) and RStudio. Both are free accessible. For guidance on how to install, please wait for the information I will send you as soon as you have registered in the course. (Disclaimer: Although  I have not yet met anybody who has had problems with the installations, I or the University cannot be hold accountable for any damage caused in your computer during the software installation. The use of this software is your free choice and responsibility). Both programs are compatible with Linux, MacOS and Windows.
  • Since the course will take place online, you will need an internet connection with enough speed for online streaming.

The course will be divided in two sections: A theory/practice section which will take about the first half of the semester, and a personal data analysis project which we will develop together.

This is a voluntary course, therefore the course will be evaluated as Passed or not Passed (bestanden / nicht bestanden). At the end there will be a "Take Home" exam, which will allow you to be proud of how much you have learned about data manipulation and visualization during the semester.

I am looking forward to meet you and explore biological data using R.