This commit is contained in:
Peter Harpending
2026-06-02 11:04:34 -07:00
parent eff77fff6b
commit dfb158e593
+52
View File
@@ -0,0 +1,52 @@
% @doc
% <pre>
% T R O N A L D D U M P
%
% .-""""""""""""-.
% .-' _..------.._ '-.
% .' .' GOLDEN NFC '. '.
% / / COMB-OVER MAP \ \
% ; ; .-^^^^^^^^^^-. ; ;
% | | / THEY'RE \ | |
% | | | NOT SENDING | | |
% | | | ASCII | | |
% ; ; \_.--. .--._./ ; ;
% \ \ (o)(o) / /
% '. '. __ .' .'
% '-._ '._==_.' _.-'
% '-._____.-'
% /|||\
% / ||| \
% / ||| \
% .-------' ||| '-------.
% / THE BEST NORMALIZER \
% / VERY STABLE CODEPOINTS \
% /_________________________________\
% </pre>
%
% When unicode sends its codepoints, they're not
% sending their best. They're not sending ASCII.
% They're not sending ASCII. They're sending integers
% that have lots of problems, and they're bringing
% those problems with us. They're bringing diacritics.
% They're bringing non-idempotent lowercasing. They're
% bringing graphemes that don't correspond bijectively
% with printable characters. They're bringing RTL.
% They're bringing invisible characters. They're
% bringing characters that draw outside the character
% boundary. They're bringing variable-width
% whitespace. They're bringing control characters.
% They're bringing emojis.
%
% And some, I assume, are good characters.
%
% `SrcStr' is a unicode NFC list, not an ordinary
% string. you think a string is a list of codepoints.
%
% NOOOOO.
%
% See it's different, because that's why.
%
% This is the cost of diversity, folks.
% @end