Unicode
Description
This helper class provides low-level functionality for using the Unicode standard.
Within the scope of this class, the term "character" refers to a 32-bit signed integer value and is interpreted as follows:
-
Equal to -1:
This value indicates that a character is invalid, undefined or missing. -
Less than -1:
A sequence of UTF-16 code units that represents a valid surrogate pair, see CU. -
Less than SurrogateFirst or greater than SurrogateLast, but not greater than 65535:
A single non-surrogate UTF-16 code unit, which is also a valid Unicode code point. -
Greater than or equal to SurrogateFirst and less than or equal to SurrogateLast:
A surrogate UTF-16 code unit, which is not a valid Unicode code point. -
Greater than 65535 and less than CodePoints:
A single UTF-32 code unit, which is also a valid Unicode code point. -
Greater than or equal to CodePoints:
All values in that range are defined as invalid.
Public / Methods
C
2 overloads
Constructs a string for the given character.
Constructs a UTF-16 code unit sequence from the given string position.
CPS
Converts the given character in to a Unicode code point or a UTF-16 surrogate code unit.
- See also