Lots of apps use automatic encoding and language detection (including browsers and text editors/viewers) and thus could benefit from Chardet v7 a lot.
Chardet 7.0 has HUGE improvements in both speed and accuracy of automatic Encoding and Language detection, and appears to be significantly rewritten.
E.g. Language detection accuracy 90.9% vs 47.0%.
Citing:
“It maintains API compatibility with chardet 5.x and 6.x, but with 27x improvements to detection speed, and highly accurate support for even more encodings.
It fixes numerous longstanding issues (like poor accuracy on short strings and poor performance), and is just all-around better than previous versions of chardet in every possible respect.
It’s even faster, more memory efficient, and more accurate than charset-normalizer, which is something I’m particularly proud of.”
Just the next release (26.10), note though that chardet is actually synced as-is from debian and not maintained in Ubuntu currently so you want the new version to be in debian first (the experimental pocket would be sufficient there (unless it is overly buggy)) before it can be synced …