North Coast Synthesis Ltd.

Better (than) tokenization with BLTs

◀ Prev | 2025-08-01, access: $ Basic | Next ▶

Video theory text LLaMA tokenization Using "patches" of input bytes, instead of a fixed token list, allows better scalability and improves performance on some tasks that are hard for token-based LLMs.