Plain English Papers All LLMs use tokenization. Are we doing it totally wrong? Slashing model size by 85% while redefining how we build adaptable, efficient LLMs T-FREE challenges how we do tokenization. Image source from Vellum.ai (https://www.vellum.ai/llm-parameters/logit-bias).