12–17 Jul 2026
University of Graz
Europe/Vienna timezone

Ladderpath: A Compression-Based Hierarchical Framework for Biological Sequence Analysis

16 Jul 2026, 18:30
2h
University of Graz

University of Graz

Poster Numerical, Computational, and Data-Driven Methods Poster Presentations

Speaker

Yu Liu (Beijing Normal University)

Description

Ladderpath is a compression-based framework, grounded in Algorithmic Information Theory, for extracting repeated, nested, and hierarchically organized structure from symbolic sequences. Rather than treating biological sequences as flat strings, it reconstructs reusable building blocks and their dependency relations, yielding interpretable representations together with quantitative measures of complexity, hierarchical reuse, and sequence distance.

This poster presents Ladderpath as a unified mathematical and computational framework for biological sequence analysis. I introduce its core formalism, including ladderons, laddergraphs, and indices of repetition and compressibility, and show how the same representation supports multiple problems in mathematical biology. For example, Ladderpath-derived distances provide an alternative to purely alignment-based similarity measures for phylogenetic and comparative sequence analysis, enabling the study of duplication, modular reuse, and evolutionary innovation; Ladderpath-based tokenization offers a compression-guided alternative to amino-acid-level and BPE-style segmentation in protein and DNA language models, with the potential to capture longer-range regularities and biologically meaningful reusable units.

Overall, the poster argues that compression-based hierarchical decomposition provides an interpretable analytical tool and a mathematical interface connecting sequence complexity, evolution, and learning.

Author

Yu Liu (Beijing Normal University)

Co-authors

Da Zhou (Xiamen University) Ziwei Dai (Southern University of Science and Technology)

Presentation materials

There are no materials yet.