OpenJDK / amber / amber
changeset 784:b42ef9406aae
4752069: (cs spec) BOM should not be ignored in UTF-16 charsets
Summary: API doc update regarding BOM hanlding in UTF-16 charsets
Reviewed-by: alanb
author | sherman |
---|---|
date | Wed, 25 Jun 2008 08:27:06 -0700 |
parents | b22e3837ba56 |
children | 36c29b2692f1 |
files | jdk/src/share/classes/java/nio/charset/Charset.java |
diffstat | 1 files changed, 11 insertions(+), 10 deletions(-) [+] |
line wrap: on
line diff
--- a/jdk/src/share/classes/java/nio/charset/Charset.java Mon Jun 16 10:46:22 2008 -0700 +++ b/jdk/src/share/classes/java/nio/charset/Charset.java Wed Jun 25 08:27:06 2008 -0700 @@ -188,21 +188,22 @@ * <ul> * * <li><p> When decoding, the <tt>UTF-16BE</tt> and <tt>UTF-16LE</tt> - * charsets ignore byte-order marks; when encoding, they do not write + * charsets interpret the initial byte-order marks as a <small>ZERO-WIDTH + * NON-BREAKING SPACE</small>; when encoding, they do not write * byte-order marks. </p></li> + * - * <li><p> When decoding, the <tt>UTF-16</tt> charset interprets a byte-order - * mark to indicate the byte order of the stream but defaults to big-endian - * if there is no byte-order mark; when encoding, it uses big-endian byte - * order and writes a big-endian byte-order mark. </p></li> + * <li><p> When decoding, the <tt>UTF-16</tt> charset interprets the + * byte-order mark at the beginning of the input stream to indicate the + * byte-order of the stream but defaults to big-endian if there is no + * byte-order mark; when encoding, it uses big-endian byte order and writes + * a big-endian byte-order mark. </p></li> * * </ul> * - * In any case, when a byte-order mark is read at the beginning of a decoding - * operation it is omitted from the resulting sequence of characters. Byte - * order marks occuring after the first element of an input sequence are not - * omitted since the same code is used to represent <small>ZERO-WIDTH - * NON-BREAKING SPACE</small>. + * In any case, byte order marks occuring after the first element of an + * input sequence are not omitted since the same code is used to represent + * <small>ZERO-WIDTH NON-BREAKING SPACE</small>. * * <p> Every instance of the Java virtual machine has a default charset, which * may or may not be one of the standard charsets. The default charset is