| java.lang.Object | |
| ↳ | java.lang.Character |
Character is the wrapper for the primitive type char. This
class also provides a number of utility methods for working with
chars.
Character data is based upon the Unicode Standard, 4.0. The Unicode specification, character tables and other information are available at http://www.unicode.org/.
Unicode characters are referred to as code points. The range of valid
code points is U+0000 to U+10FFFF. The Basic Multilingual Plane (BMP)
is the code point range U+0000 to U+FFFF. Characters above the BMP are
referred to as Supplementary Characters. On the Java platform, UTF-16
encoding and char pairs are used to represent code points in
the supplementary range. A pair of char values that represent
a supplementary character are made up of a high surrogate with a
value range of 0xD800 to 0xDBFF and a low surrogate with a value
range of 0xDC00 to 0xDFFF.
On the Java platform a char value represents either a single
BMP code point or a UTF-16 unit that's part of a surrogate pair. The
int type is used to represent all Unicode code points.
| Nested Classes | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Character.Subset | |||||||||||
| Character.UnicodeBlock | Blocks of characters, as defined by the Unicode 4.0.1 specification. | ||||||||||
| Constants | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| byte | COMBINING_SPACING_MARK | Unicode category constant Mc. | |||||||||
| byte | CONNECTOR_PUNCTUATION | Unicode category constant Pc. | |||||||||
| byte | CONTROL | Unicode category constant Cc. | |||||||||
| byte | CURRENCY_SYMBOL | Unicode category constant Sc. | |||||||||
| byte | DASH_PUNCTUATION | Unicode category constant Pd. | |||||||||
| byte | DECIMAL_DIGIT_NUMBER | Unicode category constant Nd. | |||||||||
| byte | DIRECTIONALITY_ARABIC_NUMBER | Unicode bidirectional constant AN. | |||||||||
| byte | DIRECTIONALITY_BOUNDARY_NEUTRAL | Unicode bidirectional constant BN. | |||||||||
| byte | DIRECTIONALITY_COMMON_NUMBER_SEPARATOR | Unicode bidirectional constant CS. | |||||||||
| byte | DIRECTIONALITY_EUROPEAN_NUMBER | Unicode bidirectional constant EN. | |||||||||
| byte | DIRECTIONALITY_EUROPEAN_NUMBER_SEPARATOR | Unicode bidirectional constant ES. | |||||||||
| byte | DIRECTIONALITY_EUROPEAN_NUMBER_TERMINATOR | Unicode bidirectional constant ET. | |||||||||
| byte | DIRECTIONALITY_LEFT_TO_RIGHT | Unicode bidirectional constant L. | |||||||||
| byte | DIRECTIONALITY_LEFT_TO_RIGHT_EMBEDDING | Unicode bidirectional constant LRE. | |||||||||
| byte | DIRECTIONALITY_LEFT_TO_RIGHT_OVERRIDE | Unicode bidirectional constant LRO. | |||||||||
| byte | DIRECTIONALITY_NONSPACING_MARK | Unicode bidirectional constant NSM. | |||||||||
| byte | DIRECTIONALITY_OTHER_NEUTRALS | Unicode bidirectional constant ON. | |||||||||
| byte | DIRECTIONALITY_PARAGRAPH_SEPARATOR | Unicode bidirectional constant B. | |||||||||
| byte | DIRECTIONALITY_POP_DIRECTIONAL_FORMAT | Unicode bidirectional constant PDF. | |||||||||
| byte | DIRECTIONALITY_RIGHT_TO_LEFT | Unicode bidirectional constant R. | |||||||||
| byte | DIRECTIONALITY_RIGHT_TO_LEFT_ARABIC | Unicode bidirectional constant AL. | |||||||||
| byte | DIRECTIONALITY_RIGHT_TO_LEFT_EMBEDDING | Unicode bidirectional constant RLE. | |||||||||
| byte | DIRECTIONALITY_RIGHT_TO_LEFT_OVERRIDE | Unicode bidirectional constant RLO. | |||||||||
| byte | DIRECTIONALITY_SEGMENT_SEPARATOR | Unicode bidirectional constant S. | |||||||||
| byte | DIRECTIONALITY_UNDEFINED | Unicode bidirectional constant. | |||||||||
| byte | DIRECTIONALITY_WHITESPACE | Unicode bidirectional constant WS. | |||||||||
| byte | ENCLOSING_MARK | Unicode category constant Me. | |||||||||
| byte | END_PUNCTUATION | Unicode category constant Pe. | |||||||||
| byte | FINAL_QUOTE_PUNCTUATION | Unicode category constant Pf. | |||||||||
| byte | FORMAT | Unicode category constant Cf. | |||||||||
| byte | INITIAL_QUOTE_PUNCTUATION | Unicode category constant Pi. | |||||||||
| byte | LETTER_NUMBER | Unicode category constant Nl. | |||||||||
| byte | LINE_SEPARATOR | Unicode category constant Zl. | |||||||||
| byte | LOWERCASE_LETTER | Unicode category constant Ll. | |||||||||
| byte | MATH_SYMBOL | Unicode category constant Sm. | |||||||||
| int | MAX_CODE_POINT |
Maximum code point value - |
|||||||||
| char | MAX_HIGH_SURROGATE |
Maximum value of a high surrogate or leading surrogate unit in UTF-16
encoding - |
|||||||||
| char | MAX_LOW_SURROGATE | Maximum value of a low surrogate or trailing surrogate unit in UTF-16
encoding - '?'. |
|||||||||
| int | MAX_RADIX | The maximum possible radix used for conversions between Characters and integers. | |||||||||
| char | MAX_SURROGATE |
Maximum value of a surrogate unit in UTF-16 encoding - |
|||||||||
| char | MAX_VALUE | The maximum possible Character value. | |||||||||
| int | MIN_CODE_POINT |
Minimum code point value - |
|||||||||
| char | MIN_HIGH_SURROGATE |
Minimum value of a high surrogate or leading surrogate unit in UTF-16
encoding - |
|||||||||
| char | MIN_LOW_SURROGATE |
Minimum value of a low surrogate or trailing surrogate unit in UTF-16
encoding - |
|||||||||
| int | MIN_RADIX | The minimum possible radix used for conversions between Characters and integers. | |||||||||
| int | MIN_SUPPLEMENTARY_CODE_POINT |
Minimum value of a supplementary code point - |
|||||||||
| char | MIN_SURROGATE |
Minimum value of a surrogate unit in UTF-16 encoding - |
|||||||||
| char | MIN_VALUE | The minimum possible Character value. | |||||||||
| byte | MODIFIER_LETTER | Unicode category constant Lm. | |||||||||
| byte | MODIFIER_SYMBOL | Unicode category constant Sk. | |||||||||
| byte | NON_SPACING_MARK | Unicode category constant Mn. | |||||||||
| byte | OTHER_LETTER | Unicode category constant Lo. | |||||||||
| byte | OTHER_NUMBER | Unicode category constant No. | |||||||||
| byte | OTHER_PUNCTUATION | Unicode category constant Po. | |||||||||
| byte | OTHER_SYMBOL | Unicode category constant So. | |||||||||
| byte | PARAGRAPH_SEPARATOR | Unicode category constant Zp. | |||||||||
| byte | PRIVATE_USE | Unicode category constant Co. | |||||||||
| int | SIZE |
Constant for the number of bits to represent a |
|||||||||
| byte | SPACE_SEPARATOR | Unicode category constant Zs. | |||||||||
| byte | START_PUNCTUATION | Unicode category constant Ps. | |||||||||
| byte | SURROGATE | Unicode category constant Cs. | |||||||||
| byte | TITLECASE_LETTER | Unicode category constant Lt. | |||||||||
| Class<Character> | TYPE | The char Class object. |
|||||||||
| byte | UNASSIGNED | Unicode category constant Cn. | |||||||||
| byte | UPPERCASE_LETTER | Unicode category constant Lu. | |||||||||
| Public Constructors | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
Constructs a new instance of the receiver which represents the char
valued argument.
| |||||||||||
| Public Methods | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
Calculates the number of | |||||||||||
Returns the char value which the receiver represents.
| |||||||||||
Returns the code point at the index in the | |||||||||||
Returns the code point at the index in the | |||||||||||
Returns the code point at the index in the | |||||||||||
Returns the Unicode code point that proceeds the | |||||||||||
Returns the Unicode code point that proceeds the | |||||||||||
Returns the Unicode code point that proceeds the | |||||||||||
Counts the number of Unicode code points in the subsequence of the
| |||||||||||
Counts the number of Unicode code points in the subsequence of the
| |||||||||||
Compares the receiver to the specified Character to determine the
relative ordering.
| |||||||||||
Convenient method to determine the value of character
codePoint in the supplied radix. | |||||||||||
Convenient method to determine the value of character
c in
the supplied radix. | |||||||||||
Compares the argument to the receiver, and returns true if they represent
the same object using a class specific comparison.
| |||||||||||
Returns the character which represents the value in the specified radix.
| |||||||||||
Gets the Unicode directionality of the specified character.
| |||||||||||
Gets the Unicode directionality of the specified character.
| |||||||||||
Gets the numeric value of the Unicode character.
| |||||||||||
Gets the numeric value of the Unicode character.
| |||||||||||
Gets the general Unicode category of the specified character.
| |||||||||||
Gets the general Unicode category of the specified character.
| |||||||||||
Returns an integer hash code for the receiver.
| |||||||||||
Returns whether the specified character is defined in the Unicode
specification.
| |||||||||||
Returns whether the specified character is defined in the Unicode
specification.
| |||||||||||
Returns whether the character is a digit.
| |||||||||||
Returns whether the character is a digit.
| |||||||||||
A test for determining if the | |||||||||||
Returns whether the character is an ISO control character.
| |||||||||||
Returns whether the character is an ISO control character.
| |||||||||||
Returns whether the specified character is ignorable in a Java or Unicode
identifier.
| |||||||||||
Returns whether the specified character is ignorable in a Java or Unicode
identifier.
| |||||||||||
Returns whether the character is a valid part of a Unicode identifier as
other than the first character.
| |||||||||||
Returns whether the character is a valid part of a Unicode identifier as
other than the first character.
| |||||||||||
Returns whether the character is a valid start of a Unicode identifier
| |||||||||||
Returns whether the character is a valid start of a Unicode identifier
| |||||||||||
This method is deprecated.
Use isJavaIdentifierStart(char)
| |||||||||||
This method is deprecated.
Use isJavaIdentifierPart(char)
| |||||||||||
Returns whether the character is a letter.
| |||||||||||
Returns whether the character is a letter.
| |||||||||||
Returns whether the character is a letter or a digit.
| |||||||||||
Returns whether the character is a letter or a digit.
| |||||||||||
A test for determining if the | |||||||||||
Returns whether the character is a lower case letter.
| |||||||||||
Returns whether the character is a lower case letter.
| |||||||||||
Returns whether the specified character is mirrored
| |||||||||||
Returns whether the specified character is mirrored
| |||||||||||
This method is deprecated.
Use isWhitespace(char)
| |||||||||||
Returns whether the character is a Unicode space character.
| |||||||||||
Returns whether the character is a Unicode space character.
| |||||||||||
A test for determining if the | |||||||||||
A test for determining if the | |||||||||||
Returns whether the character is a titlecase character.
| |||||||||||
Returns whether the character is a titlecase character.
| |||||||||||
Returns whether the character is valid as part of a Unicode identifier as
other than the first character.
| |||||||||||
Returns whether the character is valid as part of a Unicode identifier as
other than the first character.
| |||||||||||
Returns whether the character is a valid initial character for a Unicode
identifier.
| |||||||||||
Returns whether the character is a valid initial character for a Unicode
identifier.
| |||||||||||
Returns whether the character is an upper case letter.
| |||||||||||
Returns whether the character is an upper case letter.
| |||||||||||
A test for determining if the | |||||||||||
Returns whether the character is a whitespace character in Java.
| |||||||||||
Returns whether the character is a whitespace character in Java.
| |||||||||||
Determines the index into the | |||||||||||
Determines the index into the | |||||||||||
Reverse the order of the first and second bytes in character
| |||||||||||
Converts the Unicode code point, | |||||||||||
Converts the Unicode code point, | |||||||||||
Converts a surrogate pair into a Unicode code point. | |||||||||||
Returns the lower case equivalent for the character when the character is
an upper case letter, otherwise returns the character.
| |||||||||||
Returns the lower case equivalent for the character when the character is
an upper case letter, otherwise returns the character.
| |||||||||||
Converts the specified character to its string representation.
| |||||||||||
Returns a string containing a concise, human-readable description of the
receiver.
| |||||||||||
Returns the title case equivalent for the character, otherwise returns the
character.
| |||||||||||
Returns the title case equivalent for the character, otherwise returns the
character.
| |||||||||||
Returns the upper case equivalent for the character when the character is
a lower case letter, otherwise returns the character.
| |||||||||||
Returns the upper case equivalent for the character when the character is
a lower case letter, otherwise returns the character.
| |||||||||||
Returns a | |||||||||||
|
[Expand]
Inherited Methods | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
From class java.lang.Object
| |||||||||||
From interface java.lang.Comparable
| |||||||||||
Unicode category constant Mc.
Unicode category constant Pc.
Unicode category constant Cc.
Unicode category constant Sc.
Unicode category constant Pd.
Unicode category constant Nd.
Unicode bidirectional constant AN.
Unicode bidirectional constant BN.
Unicode bidirectional constant CS.
Unicode bidirectional constant EN.
Unicode bidirectional constant ES.
Unicode bidirectional constant ET.
Unicode bidirectional constant L.
Unicode bidirectional constant LRE.
Unicode bidirectional constant LRO.
Unicode bidirectional constant NSM.
Unicode bidirectional constant ON.
Unicode bidirectional constant B.
Unicode bidirectional constant PDF.
Unicode bidirectional constant R.
Unicode bidirectional constant AL.
Unicode bidirectional constant RLE.
Unicode bidirectional constant RLO.
Unicode bidirectional constant S.
Unicode bidirectional constant.
Unicode bidirectional constant WS.
Unicode category constant Me.
Unicode category constant Pe.
Unicode category constant Pf.
Unicode category constant Cf.
Unicode category constant Pi.
Unicode category constant Nl.
Unicode category constant Zl.
Unicode category constant Ll.
Unicode category constant Sm.
Maximum code point value - U+10FFFF.
Maximum value of a high surrogate or leading surrogate unit in UTF-16
encoding - '?'.
Maximum value of a low surrogate or trailing surrogate unit in UTF-16
encoding - '?'.
The maximum possible radix used for conversions between Characters and integers.
Maximum value of a surrogate unit in UTF-16 encoding - '?'.
The maximum possible Character value.
Minimum code point value - U+0000.
Minimum value of a high surrogate or leading surrogate unit in UTF-16
encoding - '?'.
Minimum value of a low surrogate or trailing surrogate unit in UTF-16
encoding - '?'.
The minimum possible radix used for conversions between Characters and integers.
Minimum value of a supplementary code point - U+010000.
Minimum value of a surrogate unit in UTF-16 encoding - '?'.
The minimum possible Character value.
Unicode category constant Lm.
Unicode category constant Sk.
Unicode category constant Mn.
Unicode category constant Lo.
Unicode category constant No.
Unicode category constant Po.
Unicode category constant So.
Unicode category constant Zp.
Unicode category constant Co.
Constant for the number of bits to represent a char in
two's compliment form.
Unicode category constant Zs.
Unicode category constant Ps.
Unicode category constant Cs.
Unicode category constant Lt.
Unicode category constant Cn.
Unicode category constant Lu.
Constructs a new instance of the receiver which represents the char valued argument.
| value | the char to store in the new instance. |
|---|
Calculates the number of char values required to represent
the Unicode code point. This method only tests if the
codePoint is greater than or equal to 0x10000,
in which case 2 is returned, otherwise 1.
To test if the code point is valid, use the
isValidCodePoint(int) method.
| codePoint | The code point to test. |
|---|
int value of 2 or 1.Returns the char value which the receiver represents.
Returns the code point at the index in the char[]. If
char unit at the index is a high-surrogate unit, the next
index is less than the length of the sequence and the char
unit at the next index is a low surrogate unit, then the code point
represented by the pair is returned; otherwise the char
unit at the index is returned.
| seq | The sequence of char units. |
|---|---|
| index | The index into the seq to retrieve and
convert. |
| NullPointerException | if seq is null. |
|---|---|
| IndexOutOfBoundsException | if the index is negative
or greater than or equal to seq.length(). |
Returns the code point at the index in the CharSequence.
If char unit at the index is a high-surrogate unit, the
next index is less than the length of the sequence and the
char unit at the next index is a low surrogate unit, then
the code point represented by the pair is returned; otherwise the
char unit at the index is returned.
| seq | The sequence of char units. |
|---|---|
| index | The index into the seq to retrieve and
convert. |
| NullPointerException | if seq is null. |
|---|---|
| IndexOutOfBoundsException | if the index is negative
or greater than or equal to seq.length(). |
Returns the code point at the index in the char[] that's
within the limit. If char unit at the index is a
high-surrogate unit, the next index is less than the limit
and the char unit at the next index is a low surrogate
unit, then the code point represented by the pair is returned; otherwise
the char unit at the index is returned.
| seq | The sequence of char units. |
|---|---|
| index | The index into the seq to retrieve and
convert. |
| limit | The exclusive index into the seq that marks
the end of the units that can be used. |
| NullPointerException | if seq is null. |
|---|---|
| IndexOutOfBoundsException | if the index is
negative, greater than or equal to limit,
limit is negative or limit is
greater than the length of seq. |
Returns the Unicode code point that proceeds the index in
the CharSequence. If the char unit at
index - 1 is within the low surrogate range, the value
index - 2 isn't negative and the char unit
at index - 2 is within the high surrogate range, then the
supplementary code point made up of the surrogate pair is returned;
otherwise, the char value at index - 1 is
returned.
| seq | The CharSequence to search. |
|---|---|
| index | The index into the seq. |
| NullPointerException | if seq is null. |
|---|---|
| IndexOutOfBoundsException | if index is less than 1
or greater than seq.length(). |
Returns the Unicode code point that proceeds the index in
the char[]. If the char unit at
index - 1 is within the low surrogate range, the value
index - 2 isn't negative and the char unit
at index - 2 is within the high surrogate range, then the
supplementary code point made up of the surrogate pair is returned;
otherwise, the char value at index - 1 is
returned.
| seq | The char[] to search. |
|---|---|
| index | The index into the seq. |
| NullPointerException | if seq is null. |
|---|---|
| IndexOutOfBoundsException | if index is less than 1
or greater than seq.length. |
Returns the Unicode code point that proceeds the index in
the char[] and isn't less than start. If
the char unit at index - 1 is within the
low surrogate range, the value index - 2 isn't less than
start and the char unit at
index - 2 is within the high surrogate range, then the
supplementary code point made up of the surrogate pair is returned;
otherwise, the char value at index - 1 is
returned.
| seq | The char[] to search. |
|---|---|
| index | The index into the seq. |
| NullPointerException | if seq is null. |
|---|---|
| IndexOutOfBoundsException | if index is less than or
equal to start, index is greater
than seq.length, start is not
negative and start is greater than
seq.length. |
Counts the number of Unicode code points in the subsequence of the
CharSequence, as delineated by the
beginIndex and endIndex. Any surrogate
values with missing pair values will be counted as 1 code point.
| seq | The CharSequence to look through. |
|---|---|
| beginIndex | The inclusive index to begin counting at. |
| endIndex | The exclusive index to stop counting at. |
| NullPointerException | if seq is null. |
|---|---|
| IndexOutOfBoundsException | if beginIndex is
negative, greater than seq.length() or greater
than endIndex. |
Counts the number of Unicode code points in the subsequence of the
char[], as delineated by the offset and
count. Any surrogate values with missing pair values will
be counted as 1 code point.
| seq | The char[] to look through. |
|---|---|
| offset | The inclusive index to begin counting at. |
| count | The number of char values to look through in
seq. |
| NullPointerException | if seq is null. |
|---|---|
| IndexOutOfBoundsException | if offset or
count is negative or if endIndex is
greater than seq.length. |
Compares the receiver to the specified Character to determine the relative ordering.
| c | the Character |
|---|
| NullPointerException | if c is null. |
|---|
Convenient method to determine the value of character
codePoint in the supplied radix. The value of
radix must be between MIN_RADIX and MAX_RADIX.
| codePoint | the character, including supplementary characters |
|---|---|
| radix | the radix |
Compares the argument to the receiver, and returns true if they represent the same object using a class specific comparison.
In this case, the argument must also be a Character, and the receiver and argument must represent the same char value.
| object | the object to compare with this object |
|---|
true if the object is the same as this object
false if it is different from this objectReturns the character which represents the value in the specified radix. The radix must be between MIN_RADIX and MAX_RADIX inclusive.
| digit | the integer value |
|---|---|
| radix | the radix |
Gets the Unicode directionality of the specified character.
| codePoint | the character, including supplementary characters |
|---|
Gets the Unicode directionality of the specified character.
| c | the character |
|---|
Gets the numeric value of the Unicode character.
| codePoint | the character, including supplementary characters |
|---|
Gets the numeric value of the Unicode character.
| c | the character |
|---|
Gets the general Unicode category of the specified character.
| codePoint | the character, including supplementary characters |
|---|
Gets the general Unicode category of the specified character.
| c | the character |
|---|
Returns an integer hash code for the receiver. Any two objects which
answer true when passed to equals must
answer the same value for this method.
Returns whether the specified character is defined in the Unicode specification.
| c | the character |
|---|
Returns whether the specified character is defined in the Unicode specification.
| codePoint | the character, including supplementary characters |
|---|
Returns whether the character is a digit.
| codePoint | the character, including supplementary characters |
|---|
Returns whether the character is a digit.
| c | the character |
|---|
A test for determining if the char is a high
surrogate/leading surrogate unit that's used for representing
supplementary characters in UTF-16 encoding.
| ch | The char unit to test. |
|---|
Returns whether the character is an ISO control character.
| c | the character |
|---|
true if c is an ISO control
character, otherwise false
Returns whether the character is an ISO control character.
| c | the character, including supplementary characters |
|---|
true if c is an ISO control
character, otherwise false
Returns whether the specified character is ignorable in a Java or Unicode identifier.
| codePoint | the character, including supplementary characters |
|---|
Returns whether the specified character is ignorable in a Java or Unicode identifier.
| c | the character |
|---|
Returns whether the character is a valid part of a Unicode identifier as other than the first character.
| codePoint | the character, including supplementary characters |
|---|
Returns whether the character is a valid part of a Unicode identifier as other than the first character.
| c | the character |
|---|
Returns whether the character is a valid start of a Unicode identifier
| c | the character |
|---|
Returns whether the character is a valid start of a Unicode identifier
| codePoint | the character, including supplementary characters |
|---|
This method is deprecated.Use isJavaIdentifierStart(char)
Returns whether the character is a Java letter.
This method is deprecated.Use isJavaIdentifierPart(char)
Returns whether the character is a Java letter or digit character.
Returns whether the character is a letter.
| codePoint | the character, including supplementary characters |
|---|
Returns whether the character is a letter.
| c | the character |
|---|
Returns whether the character is a letter or a digit.
| codePoint | the character, including supplementary characters |
|---|
Returns whether the character is a letter or a digit.
| c | the character |
|---|
A test for determining if the char is a high
surrogate/leading surrogate unit that's used for representing
supplementary characters in UTF-16 encoding.
| ch | The char unit to test. |
|---|
Returns whether the character is a lower case letter.
| c | the character |
|---|
Returns whether the character is a lower case letter.
| codePoint | the character, including supplementary characters |
|---|
Returns whether the specified character is mirrored
| c | the character |
|---|
Returns whether the specified character is mirrored
| codePoint | the character, including supplementary characters |
|---|
Returns whether the character is a Unicode space character. A member of one of the Unicode categories Space Separator, Line Separator, or Paragraph Separator.
| c | the character |
|---|
Returns whether the character is a Unicode space character. A member of one of the Unicode categories Space Separator, Line Separator, or Paragraph Separator.
| codePoint | the character, including supplementary characters |
|---|
A test for determining if the codePoint is within the
supplementary code point range.
| codePoint | The code point to test. |
|---|
A test for determining if the char pair is a valid
surrogate pair.
| high | The high surrogate unit to test. |
|---|---|
| low | The low surrogate unit to test. |
Returns whether the character is a titlecase character.
| codePoint | the character, including supplementary characters |
|---|
Returns whether the character is a titlecase character.
| c | the character |
|---|
Returns whether the character is valid as part of a Unicode identifier as other than the first character.
| c | the character |
|---|
Returns whether the character is valid as part of a Unicode identifier as other than the first character.
| codePoint | the character, including supplementary characters |
|---|
Returns whether the character is a valid initial character for a Unicode identifier.
| codePoint | the character, including supplementary characters |
|---|
Returns whether the character is a valid initial character for a Unicode identifier.
| c | the character |
|---|
Returns whether the character is an upper case letter.
| c | the character |
|---|
Returns whether the character is an upper case letter.
| codePoint | the character, including supplementary characters |
|---|
A test for determining if the codePoint is a valid Unicode
code point.
| codePoint | The code point to test. |
|---|
Returns whether the character is a whitespace character in Java.
| codePoint | the character, including supplementary characters |
|---|
c is a whitespace character
in Java, otherwise false.
Returns whether the character is a whitespace character in Java.
| c | the character |
|---|
c is a whitespace character
in Java, otherwise false.
Determines the index into the char[] that is offset
(measured in code points and specified by codePointOffset),
from the index argument and is within the subsequence as
delineated by start and count.
| seq | The char[] to find the index within. |
|---|---|
| start | The inclusive index that marks the beginning of the subsequence. |
| count | The number of char values to include within
the subsequence. |
| index | The index to begin from, within the char[]. |
| codePointOffset | The number of code points to look back or forwards; may be a negative or positive value. |
codePointOffset code
points from index.| NullPointerException | if seq is null. |
|---|---|
| IndexOutOfBoundsException | if start or
count is negative, start + count
greater than seq.length, index is
less than start, index is greater
than start + count or there aren't enough values
in seq after index or before
index if codePointOffset is
negative. |
Determines the index into the CharSequence that is offset
(measured in code points and specified by codePointOffset),
from the index argument.
| seq | The CharSequence to find the index within. |
|---|---|
| index | The index to begin from, within the
CharSequence. |
| codePointOffset | The number of code points to look back or forwards; may be a negative or positive value. |
codePointOffset code
points from index.| NullPointerException | if seq is null. |
|---|---|
| IndexOutOfBoundsException | if index is negative,
greater than seq.length(), there aren't enough
values in seq after index or before
index if codePointOffset is
negative. |
Reverse the order of the first and second bytes in character
| c | the character |
|---|
Converts the Unicode code point, codePoint, into a UTF-16
encoded sequence that is returned as a char[].
| codePoint | The Unicode code point to encode. |
|---|
char sequence; if code point is
a supplementary code point,
then a 2 char array is returned, otherwise a 1
char array is returned.| IllegalArgumentException | if codePoint is not a
valid Unicode code point. |
|---|
Converts the Unicode code point, codePoint, into a UTF-16
encoded sequence and copies the value(s) into the
char[] dst, starting at the index
dstIndex.
| codePoint | The Unicode code point to encode. |
|---|---|
| dst | The char[] to copy the encoded value into. |
| dstIndex | The index to start copying into dst. |
char value units copied into
dst.| IllegalArgumentException | if codePoint is not a
valid Unicode code point. |
|---|---|
| NullPointerException | if dst is null. |
| IndexOutOfBoundsException | if dstIndex is negative,
greater than or equal to dst.length or equals
dst.length - 1 when codePoint is a
supplementary code point. |
Converts a surrogate pair into a Unicode code point. This method assume that the pair are valid surrogates. If the pair are NOT valid surrogates, then the result is indeterminate. The isSurrogatePair(char, char) method should be used prior to this method to validate the pair.
| high | The high surrogate unit. |
|---|---|
| low | The low surrogate unit. |
Returns the lower case equivalent for the character when the character is an upper case letter, otherwise returns the character.
| codePoint | the character, including supplementary characters |
|---|
Returns the lower case equivalent for the character when the character is an upper case letter, otherwise returns the character.
| c | the character |
|---|
Converts the specified character to its string representation.
| value | the character |
|---|
Returns a string containing a concise, human-readable description of the receiver.
Returns the title case equivalent for the character, otherwise returns the character.
| codePoint | the character |
|---|
Returns the title case equivalent for the character, otherwise returns the character.
| c | the character |
|---|
Returns the upper case equivalent for the character when the character is a lower case letter, otherwise returns the character.
| c | the character |
|---|
Returns the upper case equivalent for the character when the character is a lower case letter, otherwise returns the character.
| codePoint | the character, including supplementary characters |
|---|
Returns a Character instance for the char
value passed. This method is preferred over the constructor, as this
method may maintain a cache of instances.
| c | The char value. |
|---|
Character instance.