UnicodeWarning

exception UnicodeWarning Base class for warnings related to Unicode.

UnicodeError.object

object The object the codec was attempting to encode or decode.

UnicodeError.end

end The index after the last invalid data in object.

UnicodeTranslateError

exception UnicodeTranslateError Raised when a Unicode-related error occurs during translating. It is a subclass of UnicodeError.

UnicodeError.reason

reason A string describing the specific codec error.

UnicodeError.encoding

encoding The name of the encoding that raised the error.

UnicodeError.start

start The first index of invalid data in object.

unicodedata.ucd_3_2_0

unicodedata.ucd_3_2_0 This is an object that has the same methods as the entire module, but uses the Unicode database version 3.2 instead, for applications that require this specific version of the Unicode database (such as IDNA).

UnicodeError

exception UnicodeError Raised when a Unicode-related encoding or decoding error occurs. It is a subclass of ValueError. UnicodeError has attributes that describe the encoding or decoding error. For example, err.object[err.start:err.end] gives the particular invalid input that the codec failed on. encoding The name of the encoding that raised the error. reason A string describing the specific codec error. object The object the codec was attempting to encode or decode. start

unicodedata.normalize()

unicodedata.normalize(form, unistr) Return the normal form form for the Unicode string unistr. Valid values for form are ‘NFC’, ‘NFKC’, ‘NFD’, and ‘NFKD’. The Unicode standard defines various normalization forms of a Unicode string, based on the definition of canonical equivalence and compatibility equivalence. In Unicode, several characters can be expressed in various way. For example, the character U+00C7 (LATIN CAPITAL LETTER C WITH CEDILLA) can also be expressed as the sequence U+0043 (L