Next: Ctl-Char Syntax, Previous: Basic Char Syntax, Up: Character Type [Contents][Index]
In addition to the specific escape sequences for special important control characters, Emacs provides several types of escape syntax that you can use to specify non-ASCII text characters.
Firstly, you can specify characters by their Unicode values.
?\unnnn
represents a character with Unicode code point
‘U+nnnn’, where nnnn is (by convention) a hexadecimal
number with exactly four digits. The backslash indicates that the
subsequent characters form an escape sequence, and the ‘u’
specifies a Unicode escape sequence.
There is a slightly different syntax for specifying Unicode
characters with code points higher than U+ffff
:
?\U00nnnnnn
represents the character with code point
‘U+nnnnnn’, where nnnnnn is a six-digit hexadecimal
number. The Unicode Standard only defines code points up to
‘U+10ffff’, so if you specify a code point higher than
that, Emacs signals an error.
Secondly, you can specify characters by their hexadecimal character
codes. A hexadecimal escape sequence consists of a backslash,
‘x’, and the hexadecimal character code. Thus, ‘?\x41’ is
the character A, ‘?\x1’ is the character C-a, and
?\xe0
is the character
‘a’ with grave accent.
You can use any number of hex digits, so you can represent any
character code in this way.
Thirdly, you can specify characters by their character code in
octal. An octal escape sequence consists of a backslash followed by
up to three octal digits; thus, ‘?\101’ for the character
A, ‘?\001’ for the character C-a, and ?\002
for the character C-b. Only characters up to octal code 777 can
be specified this way.
These escape sequences may also be used in strings. See Non-ASCII in Strings.
Next: Ctl-Char Syntax, Previous: Basic Char Syntax, Up: Character Type [Contents][Index]