An Embarrassingly Simple Method to Mitigate Undesirable Properties of Pretrained Language Model Tokenizers (2022)

First Author: Hofmann V

Attributed to: Exaggeration, cohesion, and fragmentation in on-line forums funded by EPSRC

No abstract provided

Type: Conference/Paper/Proceeding/Abstract