Extensions and enhancements to java.text to support unicode transforms, UnicodeSet, surrogate char utilities, UCA collation, normalization, break iteration (rule and dictionary based), enhanced number format, international string searching, and arabic shaping.
See: Description
Interface Summary | |
---|---|
Replaceable |
Replaceable is an interface representing a
string of characters that supports the replacement of a range of
itself with a new string of characters. |
SymbolTable | An interface that defines both lookup protocol and parsing of symbolic names. |
Transliterator.Factory | The factory interface for transliterators. |
UForwardCharacterIterator | Interface that defines an API for forward-only iteration on text objects. |
UnicodeMatcher |
UnicodeMatcher defines a protocol for objects that can
match a range of characters in a Replaceable string. |
Class Summary | |
---|---|
ArabicShaping | Shape Arabic text on a character basis. |
ArabicShapingException | Thrown by ArabicShaping when there is a shaping error. |
BreakDictionary | This is the class that represents the list of known words used by DictionaryBasedBreakIterator. |
BreakIterator | A class that locates boundaries in text. |
CanonicalIterator | This class allows one to iterate through all the strings that are canonically equivalent to a given string. |
CharsetDetector |
CharsetDetector provides a facility for detecting the
charset or encoding of character data in an unknown format.
|
CharsetMatch | This class represents a charset that has been identified by a CharsetDetector as a possible encoding for a set of input data. |
ChineseDateFormat | A concrete DateFormat for ChineseCalendar. |
ChineseDateFormatSymbols | A subclass of DateFormatSymbols for ChineseDateFormat. |
CollationElementIterator |
|
CollationKey |
A |
CollationKey.BoundMode | Options that used in the API CollationKey.getBound() for getting a CollationKey based on the bound mode requested. |
Collator |
Collator performs locale-sensitive string comparison. |
Collator.CollatorFactory | A factory used with registerFactory to register multiple collators and provide display names for them. |
ComposedCharIter | ComposedCharIter is an iterator class that returns all of the precomposed characters defined in the Unicode standard, along with their decomposed forms. |
DateFormat | DateFormat is an abstract class for date/time formatting subclasses which formats and parses dates or time in a language-independent manner. |
DateFormatSymbols |
DateFormatSymbols is a public class for encapsulating
localizable date-time formatting data, such as the names of the
months, the names of the days of the week, and the time zone data.
|
DateTimePatternGenerator | This class provides flexible generation of date format patterns, like "yy-MM-dd". |
DateTimePatternGenerator.FormatParser | Class providing date formatting |
DateTimePatternGenerator.PatternInfo | PatternInfo supplies output parameters for add(...). |
DateTimePatternGenerator.VariableField | Utility class for FormatParser. |
DecimalFormat |
DecimalFormat is a concrete subclass of
NumberFormat that formats decimal numbers. |
DecimalFormatSymbols |
This class represents the set of symbols (such as the decimal separator, the
grouping separator, and so on) needed by DecimalFormat to format
numbers. |
DictionaryBasedBreakIterator | A subclass of RuleBasedBreakIterator that adds the ability to use a dictionary to further subdivide ranges of text beyond what is possible using just the state-table-based algorithm. |
IDNA | IDNA API implements the IDNA protocol as defined in the IDNA RFC. |
MeasureFormat | A formatter for Measure objects. |
MessageFormat |
MessageFormat provides a means to produce concatenated
messages in language-neutral way. |
Normalizer |
Unicode Normalization
Unicode normalization APInormalize transforms Unicode text into an equivalent composed or
decomposed form, allowing for easier sorting and searching of text.
|
Normalizer.Mode | Constants for normalization modes. |
Normalizer.QuickCheckResult | Result values for quickCheck(). |
NumberFormat |
NumberFormat is the abstract base class for all number
formats. |
NumberFormat.Field | [Spark/CDL] The instances of this inner class are used as attribute keys and values in AttributedCharacterIterator that NumberFormat.formatToCharacterIterator() method returns. |
NumberFormat.NumberFormatFactory | A NumberFormatFactory is used to register new number formats. |
NumberFormat.SimpleNumberFormatFactory | A NumberFormatFactory that supports a single locale. |
RawCollationKey |
Simple class wrapper to store the internal byte representation of a CollationKey. |
ReplaceableString |
ReplaceableString is an adapter class that implements the
Replaceable API around an ordinary StringBuffer .
|
RuleBasedBreakIterator | Rule Based Break Iterator This is a port of the C++ class RuleBasedBreakIterator from ICU4C. |
RuleBasedCollator |
RuleBasedCollator is a concrete subclass of Collator. |
RuleBasedNumberFormat |
A class that formats numbers according to a set of rules. |
RuleBasedTransliterator |
RuleBasedTransliterator is a transliterator
that reads a set of rules in order to determine how to perform
translations. |
SearchIterator |
SearchIterator is an abstract base class that defines a protocol for text searching. |
SimpleDateFormat |
SimpleDateFormat is a concrete class for formatting and
parsing dates in a locale-sensitive manner. |
StringCharacterIterator |
StringCharacterIterator implements the
CharacterIterater protocol for a String .
|
StringPrep | StringPrep API implements the StingPrep framework as described by RFC 3454. |
StringPrepParseException | Exception that signals an error has occurred while parsing the input to StringPrep or IDNA. |
StringSearch |
|
Transliterator |
Transliterator is an abstract class that
transliterates text from one format to another. |
Transliterator.Position | Position structure for incremental transliteration. |
UCharacterIterator | Abstract class that defines an API for iteration on text objects.This is an interface for forward and backward iteration and random access into a text object. |
UFormat | An abstract class that extends java.text.Format to provide additional ICU protocol, specifically, the getLocale() API. |
UnicodeCompressor | A compression engine implementing the Standard Compression Scheme for Unicode (SCSU) as outlined in Unicode Technical Report #6. |
UnicodeDecompressor | A decompression engine implementing the Standard Compression Scheme for Unicode (SCSU) as outlined in Unicode Technical Report #6. |
UnicodeFilter |
UnicodeFilter defines a protocol for selecting a
subset of the full range (U+0000 to U+FFFF) of Unicode characters.
|
UnicodeSet | A mutable set of Unicode characters and multicharacter strings. |
UnicodeSetIterator | UnicodeSetIterator iterates over the contents of a UnicodeSet. |
UnicodeSet.XSymbolTable | Internal class for customizing UnicodeSet parsing of properties. |
UTF16 |
Standalone utility class providing UTF16 character conversions and indexing conversions. Code that uses strings alone rarely need modification. |
UTF16.StringComparator |
UTF16 string comparator class. |