Zur Seitennavigation oder mit Tastenkombination für den accesskey-Taste und Taste 1 
Zum Seiteninhalt oder mit Tastenkombination für den accesskey und Taste 2 
 
Aktuelles Semester: WiSe 2025/26

Blockveranstaltung: Data Science tools for the Life Sciences

Funktionen
  • Zur Zeit keine Belegung möglich
  • https://moodle.uni-greifswald.de/course/view.php?id=15617
Informationen

Grunddaten

Semester: SoSe 2023
SWS: 6
Sprache: Englisch
Max. Teilnehmer/-innen: 20
Belegungszeitraum:

Termine

Gruppe: - iCalendar Export für Outlook
  Tag Zeit Rhythmus Dauer Raum Raum-
plan
Lehrperson Bemerkung fällt aus am Max. Teilnehmer/-innen
iCalendar Export für Outlook -. 09:00 bis 17:00 Block 04.09.2023 bis
20.09.2023
Felix-Hausdorff-Straße 18 - Seminarraum 0.08 Scheuerlein     20
Einzeltermine
04.09.2023 | 05.09.2023 | 06.09.2023 | 07.09.2023 | 08.09.2023 | 11.09.2023 | 12.09.2023 | 13.09.2023 | 14.09.2023 | 15.09.2023 | 18.09.2023 | 19.09.2023 | 20.09.2023 |

Es gibt bereits 11 Anmeldungen / 11 davon zugelassen

Gruppe -:

Inhalt

Kommentar

Data Science Tools for the Life Sciences

 

 

General Information:

 

format:                       block course

date:                           Monday 04.09.2023, through Wednesday 20.09.2023

sign up until:             31.05.2023

time:                           Monday through Friday, 9:00 until 17:00

place:                         Seminarraum 0.08 (Multimedia-Raum), Rechenzentrum, Felix-Hausdorff- Strasse 18, 17489 Greifswald

teaching language: English

HIS ID:                       62567

Pass the course:      term paper that will be graded

 

moodle link:              https://moodle.uni-greifswald.de/course/view.php?id=15617

 

Responsible instructor: Dr. Alexander Scheuerlein, research assistant at the Institute of Data Science, University of Greifswald

 

 

 

Participants (max 20):

The course targets students in the Life sciences. Specifically, students of: 

-       Zoology 

  • (Bachelor students of Zoology receive credit points for a “Wahlspezial” module if they get written permission from Prof. Dr Steffen Harzsch before they sign up. See here for details of the process: https://moodle.uni-greifswald.de/course/view.php?id=11285

)

-       Botany

-       Human Biology

-       Environmental Natural Sciences

-       Environmental Sciences

-       Psychology

 

 

Description:

The course intends to provide students in life sciences expertise and adequate tools to identify data that are relevant for a research question, and analyze them in a meaningful way. For a reproducible environment, the course will utilize the open-source language package of R on Jupyterhub. 

 

Major emphasis of the course will be on data retrieval, data management, and data handling in R. 

 

  • Students will learn how to handle different data formats, and how to employ the appropriate tools to read these data into the R programming environment.

 

  • Students will learn how to perform exploratory data analysis, with an emphasis on data visualization based on the “tidyverse” collection of packages in R.

 

  • Students will be introduced to basic statistics, and how these can be applied in R. Starting with the basic concepts of statistics (distributions, means, variance, hypothesis testing, regression analysis), students will be introduced to the deployment of general linear models (glm). Particular emphasis will be laid on the underlying assumptions of the statistical models, and how to check whether these assumptions are met (residual analysis). Further concepts taught will be the use of model selection based on information criteria (AIC), multivariate approaches, generalized models (poisson, binomial), generalized additive models and models with random effects.  In the end, students will be able to run generalized additive models, interpret them, plot the results in a meaningful way, and understand when random effects are useful in model design.

 

  • Students will be introduced to the basic concept of machine learning and apply an example. Programming in this section will be done with Python. The main purpose of this section is to illustrate the broad applicability of machine learning for day-to-day routine tasks, such as cell counting or image analysis.

 

The course will end with a project: a research question should be conceived (or will be provided), and the relevant data should be retrieved (or will be provided). The students are expected to generate exploratory plots, and apply appropriate statistical tools to address the research question properly. To pass the course students have to produce term paper (as a jupyter notebook) which will be graded. 

Voraussetzungen

- Microsoft Office (Word, Excel)

- own PC / laptop,

- internetconnection

Lerninhalte

Description:

The course is intended to provide students of the life sciences with the expertise and the adequate tools to identify data that are relevant for a research question, and analyze them in a meaningful way. To provide students with a reproducible environment, the course will utilize the open-source language package of R on Jupyterhub.  

Major emphasis of the course will be put on data retrieval, data management, and the handling of data in R. Students will learn how to deal with various data formats, and how to employ the appropriate tools to read these data into the R programming environment.

In a further step students will learn how to perform exploratory data analysis, with an emphasis on data visualization. We will make major use of tools from the “tidyverse” collection of packages in R.

Next, students will be introduced to basic statistics, and how these can be applied in R. Starting with the basic concepts of statistics (distributions, means, variance, hypothesis testing, regression analysis), students will be introduced to the deployment of general linear models (glm). Particular emphasis will be laid on the assumptions that underlie these statistical models, and how to check whether these assumptions are met (residual analysis). Further concepts taught will be the use of model selection based on Infomation criteria (AIC), and the introduction of random effects in statistical analysis. In the end students should be able to run general linear models, be able to interpret them, and understand when random effects are useful in model design. 

Finally, the applicants of the course will be introduced to machine learning. Students will be introduced to the basic concept, and walk through their application in a worked example. Programming in this section will be done with Python. The main purpose of this section is to illustrate the wide applicability of machine learning for day-to-day routine tasks, such as cell counting or image analysis.

The course will end with a project: In the project a research question should be conceived (or will be provided), and the relevant data should be retrieved (or will be provided). The students are expected to generate meaningful exploratory plots, and apply appropriate statistical tools to address the research question being asked. A short written report in the form of a Jupyter notebook should be produced. 

Zielgruppe

The course is targeted at both bachelor and master students at the Math Nat Faculty, specifically, students of:

-       Zoology

-       Botany

-       Environmental Sciences (= Umweltwissenschaften)

-       Psychology

-       Human Biology

Moodle https://moodle.uni-greifswald.de/course/view.php?id=15617

Zugeordnete Person

Zugeordnete Person Zuständigkeit
Scheuerlein, Alexander, Dr. verantwortlich

Studiengänge

Abschluss Studiengang Studienphase PO-Version
Bachelor of Science Psychologie BSc. Bachelor 2020
Master of Science Psychologie MSc. Master 2017

Zuordnung zu Einrichtungen

© 2009-2025 Universität Greifswald
Knoten: hisqis-prod2