Rule Extrapolation in Language Models: A Study of Compositional Generalization on OOD Prompts
Spotlight, Neural Information Processing Systems, 2024
We introduce rule extrapolation, a novel metric of OOD compositional generalisation on formal languages, and evaluate it on several AR Language model architectures. We also propose a normative model to explain OOD behaviour via simplicity bias.
Download here