Artifact
28d88faea58ce220dd955aed1e0b60745f0b3919:
Wiki page
[tDOM and characters beyond BMP] by
rolf
2013-12-25 01:36:33.
0000: 44 20 32 30 31 33 2d 31 32 2d 32 35 54 30 31 3a D 2013-12-25T01:
0010: 33 36 3a 33 33 2e 33 35 38 0a 4c 20 74 44 4f 4d 36:33.358.L tDOM
0020: 5c 73 61 6e 64 5c 73 63 68 61 72 61 63 74 65 72 \sand\scharacter
0030: 73 5c 73 62 65 79 6f 6e 64 5c 73 42 4d 50 0a 50 s\sbeyond\sBMP.P
0040: 20 63 33 63 66 33 35 30 32 63 36 63 30 31 39 63 c3cf3502c6c019c
0050: 38 66 63 63 62 62 63 62 37 37 36 36 61 61 36 32 8fccbbcb7766aa62
0060: 30 61 65 31 61 34 35 37 65 0a 55 20 72 6f 6c 66 0ae1a457e.U rolf
0070: 0a 57 20 34 36 30 0a 57 69 74 68 20 74 63 6c 20 .W 460.With tcl
0080: 62 75 69 6c 64 20 77 69 74 68 20 74 68 65 20 64 build with the d
0090: 65 66 61 75 6c 74 73 20 61 73 20 73 68 69 70 70 efaults as shipp
00a0: 65 64 2c 20 6f 6e 65 20 69 73 6e 27 74 20 61 62 ed, one isn't ab
00b0: 6c 65 20 74 6f 20 70 72 6f 63 65 73 73 20 61 6c le to process al
00c0: 6c 20 64 61 74 61 2c 20 77 68 69 63 68 20 63 61 l data, which ca
00d0: 6e 20 74 72 61 6e 73 70 6f 72 74 65 64 20 77 69 n transported wi
00e0: 74 68 20 58 4d 4c 2e 20 42 65 63 61 75 73 65 20 th XML. Because
00f0: 6f 66 20 74 68 65 20 63 68 61 72 61 74 65 72 79 of the charatery
0100: 20 65 6e 74 69 74 79 20 6d 65 63 68 61 6e 69 73 entity mechanis
0110: 6d 20 6f 66 20 58 4d 4c 2c 20 65 76 65 6e 20 63 m of XML, even c
0120: 68 61 72 61 63 74 65 72 73 20 62 65 79 6f 6e 64 haracters beyond
0130: 20 42 4d 50 20 63 61 6e 20 62 65 20 65 78 70 72 BMP can be expr
0140: 65 73 73 65 64 20 69 6e 20 37 2d 62 69 74 20 41 essed in 7-bit A
0150: 53 43 49 49 2e 20 28 49 66 20 79 6f 75 27 72 65 SCII. (If you're
0160: 20 6e 6f 74 20 73 6f 20 73 75 72 65 2c 20 77 68 not so sure, wh
0170: 61 74 20 74 68 69 73 20 58 4d 4c 20 73 70 65 65 at this XML spee
0180: 63 68 20 69 73 20 61 6c 6c 20 61 62 6f 75 74 20 ch is all about
0190: 62 75 74 20 79 6f 75 27 72 65 20 6f 6e 20 74 6f but you're on to
01a0: 70 20 77 69 74 68 20 74 63 6c 2c 20 74 68 69 6e p with tcl, thin
01b0: 6b 20 61 62 6f 75 74 20 69 74 20 61 73 20 5c 55 k about it as \U
01c0: 68 68 68 68 68 68 2e 29 20 74 44 4f 4d 20 77 69 hhhhhh.) tDOM wi
01d0: 6c 6c 20 64 69 73 67 65 73 74 20 61 6c 6c 20 55 ll disgest all U
01e0: 6e 69 63 6f 64 65 20 63 68 61 72 61 63 74 65 72 nicode character
01f0: 20 61 6e 64 20 77 69 6c 6c 20 69 74 20 67 69 76 and will it giv
0200: 65 20 62 61 63 6b 20 73 65 72 69 61 6c 69 7a 65 e back serialize
0210: 64 20 28 63 68 61 72 61 63 74 65 72 73 20 62 65 d (characters be
0220: 79 6f 6e 64 20 42 4d 4c 20 61 73 20 63 68 61 72 yond BML as char
0230: 61 63 74 65 72 73 20 65 6e 74 69 74 65 73 29 2e acters entites).
0240: 20 0d 0a 0a 5a 20 33 38 31 36 66 64 39 35 31 34 ...Z 3816fd9514
0250: 37 30 62 35 32 63 39 36 30 64 64 34 38 39 33 35 70b52c960dd48935
0260: 32 66 63 39 34 62 0a 2fc94b.