December 9 at 10:00 - 11:00, 2021 (JST)
Prof. Hidetoshi Shimodaira (Professor, Graduate School of Informatics, Kyoto University / Team Leader, Mathematical Statistics Team, RIKEN Center for Advanced Intelligence Project (AIP)) Edit
via Zoom

Bootstrap resampling is quite useful for computing “confidence values” or “p-values” of trees and edges. However, they are biased and may lead to false positives (too many wrong discoveries) or false negatives (too few correct discoveries) depending on the “curvature” of the boundary surface of a hypothesis region in the data space. In addition, we face the issue of selection bias because we tend to use the dataset twice for hypothesis selection and its evaluation. I will explain these two types of bias and show methods to adjust the confidence values.


  1. Hidetoshi Shimodaira and Yoshikazu Terada, Selective Inference for Testing Trees and Edges in Phylogenetics, Front. Ecol. Evol., 24 (2019), doi: 10.3389/fevo.2019.00174