File Content: Character Set Table

Structure

cgt-char-set.gif (876 bytes)

Each record describing a member in the Character Set Table is preceded by a byte field containing the value 67 - the ASCII value of "C". This table is used by the DFA State Table to store the valid characters for each edge in the DFA state machine. The file will contain one of these records for each character set used in the table. The Table Count record, which precedes any character set records, will contain the total number of entries.

Fields

Name Type Description
Index Integer This field contains the index of the character set in the Character Set Table. The character set should be stored into the table at this value.
Characters String This field contains a string representing containing the character set. The characters are sorted by their Unicode code-points. This means, a binary search can be used to look if a character exists in the set.