Selective inference for testing trees and edges in hierarchical clustering and phylogeny

Seminar iTHEMS Biology Seminar

Date: December 9 (Thu) 10:00 - 11:00, 2021 (JST)
Speaker: Hidetoshi Shimodaira (Professor, Graduate School of Informatics, Kyoto University / Team Leader, Mathematical Statistics Team, RIKEN Center for Advanced Intelligence Project (AIP))
Venue: via Zoom
Language: English

Bootstrap resampling is quite useful for computing “confidence values” or “p-values” of trees and edges. However, they are biased and may lead to false positives (too many wrong discoveries) or false negatives (too few correct discoveries) depending on the “curvature” of the boundary surface of a hypothesis region in the data space. In addition, we face the issue of selection bias because we tend to use the dataset twice for hypothesis selection and its evaluation. I will explain these two types of bias and show methods to adjust the confidence values.

Reference

Hidetoshi Shimodaira and Yoshikazu Terada, Selective Inference for Testing Trees and Edges in Phylogenetics, Front. Ecol. Evol., 24 (2019), doi: 10.3389/fevo.2019.00174

Selective inference for testing trees and edges in hierarchical clustering and phylogeny

Reference

Related News

iTHEMS Biology Seminar by Prof. Hidetoshi Shimodaira on December 9, 2021