The sample complexity of species tree estimation: How many genes does it take to infer a species tree?
- Date
- February 19 (Thu) 13:00 - 14:00, 2026 (JST)
- Speaker
-
- Max Hill (Assistant Professor, University of Hawaiʻi, USA)
- Venue
- Seminar Room #359 (Main Venue)
- via Zoom
- Language
- English
- Host
- Sungsik Kong
In this talk, I will discuss the problem of inferring an evolutionary tree from DNA sequence data. The main focus will be on the sample complexity of this problem---i.e., the question of how much data is required to achieve high probability of correct inference. After introducing a standard stochastic model of gene and DNA evolution, I will highlight some surprising features of DNA sequence data that complicate inference. Finally, I will present an impossibility result which takes the form of an information-theoretic lower bound on the minimum amount of data needed for accurate inference when genes exhibit variation in mutation rates. No prior knowledge of phylogenetics or information theory is assumed. Based on joint work with Sebastien Roch.
This is a closed event for scientists. Non-scientists are not allowed to attend. If you are not a member or related person and would like to attend, please contact us using the inquiry form. Please note that the event organizer or speaker must authorize your request to attend.