Contents   Index   Search   Previous   Next


2.1 Character Set

1
   The only characters allowed outside of comments are the graphic_characters and format_effectors.

Syntax

2
character ::= graphic_character | format_effector | other_control_function
3
graphic_character ::= identifier_letter | digit | space_character | special_character

Static Semantics

4
   The character repertoire for the text of an Ada program consists of the collection of characters called the Basic Multilingual Plane (BMP) of the ISO 10646 Universal Multiple-Octet Coded Character Set, plus a set of format_effectors and, in comments only, a set of other_control_functions; the coded representation for these characters is implementation defined (it need not be a representation defined within ISO-10646-1).
5
   The description of the language definition in this International Standard uses the graphic symbols defined for Row 00: Basic Latin and Row 00: Latin-1 Supplement of the ISO 10646 BMP; these correspond to the graphic symbols of ISO 8859-1 (Latin-1); no graphic symbols are used in this International Standard for characters outside of Row 00 of the BMP. The actual set of graphic symbols used by an implementation for the visual representation of the text of an Ada program is not specified.
6
   The categories of characters are defined as follows:
7
   identifier_letter
upper_case_identifier_letter | lower_case_identifier_letter
8
   upper_case_identifier_letter
Any character of Row 00 of ISO 10646 BMP whose name begins ``Latin Capital Letter''.
9
   lower_case_identifier_letter
Any character of Row 00 of ISO 10646 BMP whose name begins ``Latin Small Letter''.
10
    digit
One of the characters 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9.
11
    space_character
The character of ISO 10646 BMP named ``Space''.
12
    special_character
Any character of the ISO 10646 BMP that is not reserved for a control function, and is not the space_character, an identifier_letter, or a digit.
13
    format_effector
The control functions of ISO 6429 called character tabulation (HT), line tabulation (VT), carriage return (CR), line feed (LF), and form feed (FF).
14
    other_control_function
Any control function, other than a format_effector, that is allowed in a comment; the set of other_control_functions allowed in comments is implementation defined.
15
    The following names are used when referring to certain special_characters:
      symbolname      symbolname
    
         "quotation mark         :colon
         #number sign         ;semicolon
         &ampersand         <less-than sign
         'apostrophe, tick         =equals sign
         (left parenthesis         >greater-than sign
         )right parenthesis         _low line, underline
         *asterisk, multiply         |vertical line
         +plus sign         [left square bracket
         ,comma         ]right square bracket
         -hyphen-minus, minus         {left curly bracket
         .full stop, dot, point         } right curly bracket
         / solidus, divide   

Implementation Permissions

16
    In a nonstandard mode, the implementation may support a different character repertoire; in particular, the set of characters that are considered identifier_letters can be extended or changed to conform to local conventions.
NOTES
17
1  Every code position of ISO 10646 BMP that is not reserved for a control function is defined to be a graphic_character by this International Standard. This includes all code positions other than 0000 - 001F, 007F - 009F, and FFFE - FFFF.
18
2  The language does not specify the source representation of programs.

Contents   Index   Search   Previous   Next   Legal