Plain English Papers New attack needs just API access and $20 to extract GPT-4's hidden architecture A novel attack extracts hidden architectural details from GPT4, PaLM, and more
All LLMs use tokenization. Are we doing it totally wrong? Slashing model size by 85% while redefining how we build adaptable, efficient LLMs