Perplexity open-sources Unigram tokeniser to cut CPU usage by 5-6x

Jessica Rajan, Published on May 29th, 2026

Perplexity open-sources Unigram tokeniser to cut CPU usage by 5-6x

Perplexity has open-sourced a rebuilt Unigram tokeniser designed to cut CPU usage by 5-6 times and improve inference efficiency for smaller AI models. The tool focuses on XLM-RoBERTa's 250,000-token vocabulary, widely used in ranking and retrieval tasks. It matches the reference implementation's output while reducing processing overhead by avoiding costly string rebuilding and hash-maps.

Read Full Article ...

© yugma 2026

Google Play and the Google Play logo are trademarks of Google LLC.

Apple and the Apple logo are trademarks of Apple Inc.