Skip to content

Commit 3710e7c

Browse files
committed
Add text about the history of coded character sets
1 parent 6a25993 commit 3710e7c

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

‎index.html

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2013,6 +2013,8 @@ <h3>Choosing character encodings</h3>
20132013
</ul>
20142014
</aside>
20152015

2016+
<p>Historically (and especially in the period before Unicode was created), there were many [=coded character sets=] in common use, with different schemes for encoding and serializing characters into the memory or storage of computer systems. In addition to standards-based schemes, such as those specified by ISO/IEC 8859, there were also many proprietary vendor or platform-specific [=character sets=], often with associated [=character encoding forms=]. When referring to the [=character encoding form=] of legacy (non-Unicode) [=coded character sets=] in this document, we mean the specific modern mappings of bytes to Unicode code points, as specified in [[Encoding]].</p>
2017+
20162018
<div class="req" id="char-use-utf8">
20172019
<p class="advisement">Use UTF-8 for all document formats, protocols, and serialization forms.</p>
20182020
</div>
@@ -2098,6 +2100,8 @@ <h3>Identifying character encodings</h3>
20982100
<div class="req" id="char_enc_rules">
20992101
<p class="advisement">If a specification is based on a format that permits [=character encodings=] other than UTF-8, the specification SHOULD restrict the [=character encoding=] to UTF-8.</p>
21002102
</div>
2103+
2104+
<p>Document formats or protocols sometimes provide support for [=legacy character encodings=]. Specifications built upon those formats, where it is feasible to do so, can specify that conformant implementations use only UTF-8.</p>
21012105

21022106
<div class="req" id="char_heuristics">
21032107
<p class="advisement">Specifications MUST NOT propose the use of heuristics to determine the encoding of data.</p>

0 commit comments

Comments
 (0)