unicode
This commit is contained in:
@@ -0,0 +1,52 @@
|
|||||||
|
% @doc
|
||||||
|
% <pre>
|
||||||
|
% T R O N A L D D U M P
|
||||||
|
%
|
||||||
|
% .-""""""""""""-.
|
||||||
|
% .-' _..------.._ '-.
|
||||||
|
% .' .' GOLDEN NFC '. '.
|
||||||
|
% / / COMB-OVER MAP \ \
|
||||||
|
% ; ; .-^^^^^^^^^^-. ; ;
|
||||||
|
% | | / THEY'RE \ | |
|
||||||
|
% | | | NOT SENDING | | |
|
||||||
|
% | | | ASCII | | |
|
||||||
|
% ; ; \_.--. .--._./ ; ;
|
||||||
|
% \ \ (o)(o) / /
|
||||||
|
% '. '. __ .' .'
|
||||||
|
% '-._ '._==_.' _.-'
|
||||||
|
% '-._____.-'
|
||||||
|
% /|||\
|
||||||
|
% / ||| \
|
||||||
|
% / ||| \
|
||||||
|
% .-------' ||| '-------.
|
||||||
|
% / THE BEST NORMALIZER \
|
||||||
|
% / VERY STABLE CODEPOINTS \
|
||||||
|
% /_________________________________\
|
||||||
|
% </pre>
|
||||||
|
%
|
||||||
|
% When unicode sends its codepoints, they're not
|
||||||
|
% sending their best. They're not sending ASCII.
|
||||||
|
% They're not sending ASCII. They're sending integers
|
||||||
|
% that have lots of problems, and they're bringing
|
||||||
|
% those problems with us. They're bringing diacritics.
|
||||||
|
% They're bringing non-idempotent lowercasing. They're
|
||||||
|
% bringing graphemes that don't correspond bijectively
|
||||||
|
% with printable characters. They're bringing RTL.
|
||||||
|
% They're bringing invisible characters. They're
|
||||||
|
% bringing characters that draw outside the character
|
||||||
|
% boundary. They're bringing variable-width
|
||||||
|
% whitespace. They're bringing control characters.
|
||||||
|
% They're bringing emojis.
|
||||||
|
%
|
||||||
|
% And some, I assume, are good characters.
|
||||||
|
%
|
||||||
|
% `SrcStr' is a unicode NFC list, not an ordinary
|
||||||
|
% string. you think a string is a list of codepoints.
|
||||||
|
%
|
||||||
|
% NOOOOO.
|
||||||
|
%
|
||||||
|
% See it's different, because that's why.
|
||||||
|
%
|
||||||
|
% This is the cost of diversity, folks.
|
||||||
|
% @end
|
||||||
|
|
||||||
Reference in New Issue
Block a user