|
|
HP-UX Reference Volume 1 of 5 > eeucset(1) |
|
NAMEeucset — sets and gets code widths for ldterm DESCRIPTIONThe eucset command sets or gets (reports) the encoding and display widths of the Extended UNIX Code (EUC) and UCS Transformation Format (UTF8) characters processed by the current input terminal. EUC is an encoding method for codesets composed of single or multiple bytes. It permits applications and the terminal hardware to use the 7-bit US ASCII code and up to three single byte or multibyte code sets simultaneously. The eucset command without any options, first tries to set the codeset to one of the four HP15 codesets. If unsuccessful, 7-bit US ASCII is used as the default codeset. This command must be used to specify any other EUC codesets, whether they are single byte or multibyte. See the WARNINGS section, for special warnings on the values of the cswidth argument. To utilize this command for UTF8, specify UTF8 with the -c option. OptionsThe eucset command recognizes the following options and arguments:
EUC Code Set ClassesEUC divides codesets into four classes. Each codeset has two characteristics: the number of bytes for encoding the characters in the codeset, and the number of display columns to display the characters in the codeset. All characters within a codeset possess the same characteristics.
The cswidth argument in the eucset command line is a character string that describes the character widths for codesets 1 through 3. This command does not allow the user to modify the settings for codeset 0. The character string is of the following format: X1[:Y1],X2[:Y2],X3[:Y3] The value X1 is the number of bytes required to encode a character in codeset class 1. Y1 is the number of display columns needed to display characters in this class. X2 is the number of bytes required to encode a character in codeset 2, not counting the SS2 byte, and Y2 is the number of display columns for codeset 2 characters. X3 is the number of bytes needed to encode characters in codeset 3, not counting the SS3 byte, and Y3 is the number of display columns required for these characters. The values for the column widths may be omitted if they are equal to the number of encoding bytes. If the encoding value of any of the EUC codesets is set to 0 (zero), this indicates that the codeset does not exist. See the WARNINGS section for special warnings on the values of the cswidth argument. If no cswidth argument is supplied, the eucset command uses the value of the CSWIDTH environment variable. If this variable is not present, the following default string is substituted: 1:1,0:0,0:0 This default string designates that the environment uses a single byte EUC codeset that has characters in the EUC codeset 1 format. If the environment uses a multibyte EUC codeset in the codeset 1 format, single byte or multibyte EUC codesets in the codeset 2 or 3 format, or both, the default setting cannot be used. EXTERNAL INFLUENCESEnvironment Variables
EXAMPLESTo display the encoding and display widths for the EUC codesets 1 to 3 in your environment, enter: eucset -p Assuming eucset has been previously used to set for ja_JP.eucJP, the entry generates the following: cswidth 2:2,1:1,2:2 To change the current settings of the encoding and display widths for the EUC characters in codesets 1 and 2 to two bytes each, enter one of the following:
To set the encoding and display widths for the EUC characters in the locale ja_JP.eucJP, enter: eucset 2:2,1:1,2:2 For zh_TW.eucTW, enter: eucset 2:2,3:2 For ko_KR.eucKR, enter: eucset 2:2 To set the codewidth to that of UTF8, enter: eucset -c UTF8 |
|