CharacterEncoding

Description

[ShutdownSurvive]
abstract class Tinman.Core.Formatting.CharacterEncoding

Derived from: ICharacterEscape
Extended by: CharacterEncodingSimple ^abstract

Base class for single-byte and multi-byte character encodings.

Public / Constants

DecodeReplacement

public constant DecodeReplacement → ((char) 0xFFFD:char)

The default replacement character for decoding: U+FFFD

SimpleHttp

[ShutdownSurvive]
public static readonly attribute SimpleHttp → (ISimpleHttpText)

A ISimpleHttpText object that uses the built-in character encoding to decode text data.

See also: ISimpleHttp

UTF_16_BE

public static readonly attribute UTF_16_BE → (CharacterEncoding)

Character encoding UTF-16 (big-endian).

UTF_16_LE

public static readonly attribute UTF_16_LE → (CharacterEncoding)

Character encoding UTF-16 (little-endian).

UTF_32_BE

public static readonly attribute UTF_32_BE → (CharacterEncoding)

Character encoding UTF-32 (big-endian).

UTF_32_LE

public static readonly attribute UTF_32_LE → (CharacterEncoding)

Character encoding UTF-32 (little-endian).

UTF_8

public static readonly attribute UTF_8 → (CharacterEncoding)

Character encoding UTF-8.

Public / Constructors

For

public static method For → (2)

name ⁱⁿ : string: The name or null.
defaultEncoding ^opt : CharacterEncoding = null: The default encoding to return in case name ⁱⁿ is not recognized. Defaults to null.
returns → CharacterEncoding: The found encoding or null.

Returns a character encoding by its name.

The given name is normalized before trying to find a character encoding: First, the name is converted to lower-case. Then, all characters that are neither letters nor digits are removed (i.e. only '0'..'9', 'a'..'z' are retained). The resulting normalized name is then tested against the following values in order to find a character encoding:

UTF_8: 'utf8'
UTF_16_BE: 'utf16be'
UTF_16_LE: 'utf16', 'utf16le'
UTF_32_BE: 'utf32be'
UTF_32_LE: 'utf32', 'utf32le'
CharacterEncodingSimple.ISO_8859_1: 'cp1252', 'iso88591', 'latin1', 'windows1252'
CharacterEncodingSimple.ASCII: 'ascii', 'usascii'
CharacterEncodingSimple.Cp437: '437', 'cp437', 'ibm437'

Public / Methods

Decode

_2 overloads

public abstract method Decode¹ → (2)

input ⁱⁿ : ByteBuffer: [not-null]
The input buffer.
replacement ^opt : int32 = CharacterEncoding.DecodeReplacement: The replacement character to use for codes that cannot be decoded.
returns → int32: The decoded Unicode character or -1 iff the given buffer does not contain any more characters.

Decodes a single Unicode character.

public abstract method Decode² → (2)

input ⁱⁿ : IDataStream: [not-null]
The input buffer.
replacement ^opt : int32 = CharacterEncoding.DecodeReplacement: The replacement character to use for codes that cannot be decoded.
returns → int32: The decoded Unicode character or -1 iff the given buffer does not contain any more characters.

Decodes a single Unicode character.

IOException: If an I/O error has occurred.

DecodeString

public method DecodeString → (2)

bytes ⁱⁿ : ByteBuffer: The encoded string.
replacement ^opt : int32 = CharacterEncoding.DecodeReplacement: The replacement character to use for codes that cannot be decoded.
returns → string: The UTF-16 code unit sequence or null iff bytes ⁱⁿ is null.

Converts the given encoded string to a UTF-16 code unit sequence.

Encode

_2 overloads

public abstract method Encode¹ → (2)

character ⁱⁿ : int32: The Unicode character.
output ⁱⁿ : ByteBuffer: [not-null]
The output buffer.
returns → int32: The number of bytes that have been written to output ⁱⁿ if there was enough space left; -n if the buffer does not have enough space left, where n is then the number of bytes that would have been written. Will be 0 iff the given character ⁱⁿ cannot be encoded.

Encodes the given Unicode character.

public abstract method Encode² → (2)

character ⁱⁿ : int32: The Unicode character.
output ⁱⁿ : IDataStream: [not-null]
The output buffer.
returns → int32: The number of bytes that have been written to output ⁱⁿ. Will be 0 iff the given character ⁱⁿ cannot be encoded.

Encodes the given Unicode character.

IOException: If an I/O error has occurred.

EncodeCount

[Pure]
public abstract method EncodeCount → (1)

character ⁱⁿ : int32: The Unicode character.
returns → int32: The number of bytes that are required to encode the given Unicode character. Will be 0 iff character ⁱⁿ cannot be encoded.

Returns the number of bytes that are required to encode the given Unicode character.

EncodeString

[OwnerReturn]
public method EncodeString → (2)

characters ⁱⁿ : string: UTF-16 code unit sequence.
bytes ^opt : ByteBuffer ^own = null: The output buffer (can be null).
returns → ByteBuffer: The resulting buffer or null iff characters ⁱⁿ is null.

Converts the given UTF-16 code unit sequence to an encoded string.

Characters that cannot be encoded will be replaced with the code for EncodeReplacement. The encoded bytes will be written to bytes ^opt beginning at the current buffer position. Before returning, this method sets the ByteBuffer.Position and ByteBuffer.Limit to the range of encoded bytes that have been output.

EncodeStringCount

[Pure]
public method EncodeStringCount → (1)

characters ⁱⁿ : string: UTF-16 code unit sequence.
returns → int32: The number of bytes that are required to encode the given Unicode character.

Returns the number of bytes that are required to encode the given UTF-16 code unit sequence.

Public / Attributes

EncodeCountRange

[Constant]
public abstract attribute EncodeCountRange → (get)

value : RangeI: [>=0]
The range of the number of bytes per character.

Returns the number of bytes per character (for valid input code points).

EncodeReplacement

[Constant]
public virtual attribute EncodeReplacement → (get)

value : char: The replacement character for encoding.

Returns the replacement character that is used for encoding when a character cannot be represented.

The default implementation returns DecodeReplacement.

EncodeReplacementCount

[Constant]
public abstract attribute EncodeReplacementCount → (get)

value : int32: [>=1]
The number of bytes.

Returns the number of bytes that are required to encode EncodeReplacement.

Name

public abstract attribute Name → (get)

value : string: [not-null]
The encoding name.

Returns the name of this character encoding.

CharacterEncoding

Description

Public / Constants

Decode​Replacement

Simple​Http

UTF_16_​BE

UTF_16_​LE

UTF_32_​BE

UTF_32_​LE