![]() ![]() The program doesn’t even bother to check what encoding the text is in it just uses its own favorite encoding and turns a bunch of characters into strings of completely different characters. ![]() A modern computer has the ability to display text that uses over 100,000 different characters, but unfortunately that text sometimes passes through a doddering old program that believes there are only the 256 that it can fit in a single byte. If numbers aren’t beautiful, I don’t know what is. A person reading that can deduce that it was actually supposed to say this: Somewhere, a computer got hold of a list of numbers that were intended to constitute a quotation and did something distinctly un-beautiful with it. If numbers aren’t beautiful, I don’t know what is. You have almost certainly seen text on a computer that looks something like this: Update: Not only can you fix Unicode mistakes with Python, you can fix Unicode mistakes with our open source Python package, “ftfy”. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. Archives
December 2022
Categories |