Algorithmen der Bioinformatik

Bachelor Theses

Here is a listing of possible topics for projects. Please come and see us if you want to learn more about the topics. Further suggestions are welcome.


  • Adding non-circular splits to a phylogenetic outline - AVAILABLE
    • Phylogenetic outlines are a new type of phylogenetic networks, based on circular splits. The aim here is to extend the algorithm so as to incorporate additional non-circular splits
  • Implementation and evalulation of phylogenetic methods in Python- Available
    • The aim of this and similar projects is to reimplement many of the algorithms provided by SplitsTree4 in Python and to compare the implementations
  • Running DIAMOND+MEGAN in Docker in a production setting
    • DIAMOND+MEGAN are used by some bioinformatics service companies to process customers data. The task is to study how this can be done more easily and efficiently using Docker
  • Phylogenetic networks on SARS-CoV-2 genomes- Systematically investigate the application of phylogenetic tree and network methods - Yi Zhou
    • Survey and compare different approaches to computing phylogenetic trees on SARS-CoV-2 genomes. Look into the application of phylogenetic network methods.
  • Determining plant-growth promoting genes in incomplete genomes  - AVAILABLE
  • Analysis of long read metagenomic samples - Comparison of methods published in Arumugam et al ‎2019 and EL Moss et al ‎2020 - AVAILABLE
    • The aim is to apply the methods described in the one paper to the data of the other paper, and vice versa, to compare the two approaches
  • Textmining for PGPT traits in publicly available literature and comparison to the novel PGPT-Ontology - AVAILABLE
    • The project aims at improving/updating the current collection of orthologous protein groups of plant growth-promoting traits (PGPTs) by developing an automatic approach, that mines publications (e.g. via WhatIzIt, iTextMine) for new-announced PGPTs. In more detail, it will report on their taxonomic and functional assignments, and their experimental proof.
  • Comparison of hybrid assemblers for short and long sequencing reads - AVAILABLE
    •  Long read sequencing methods are useful to generate genome assemblies from sequencing data as the long reads are able to assemble repeating sequences better than short reads, however, they usually have a high per base error rate. This led to the development of hybrid assembly approaches using both short and long read data as input. The goal of the thesis is to test different hybrid assembly approaches on real data and evaluate the assemblies with various metrics.
  • Implementation and improvement of MEGAN's taxonomic binning algorithms in Python - Tobias Lass
  • Implementation and improvement of MEGAN's functional binning algorithms in Python - Indira Echsler


