Linux Exercise: Localization and Internationalization

In your homedirectory create an empty file "nowfile" using touch. Check using date and ls -l that this file was indeed created now.
- # touch nowfile
- # date
- # ls -l nowfile

Make a list of the Locales that are currently installed on your system.
- # locale -a

Check if the Dutch ("nl_NL") locale is on your system. If it is not, install the appropriate language pack.
- # locale -a | grep nl_NL
- If not present (typically on a CentOS 8 "minimal" installation):
  # yum list langpacks-*
  # yum install langpacks-nl

Look at the current Locale. Run the date command. What is the result?
- # locale
- # date

Switch the Locale to Dutch. Again look at the date command. What is the result?
- # export LC_ALL="nl_NL"
- # date

Perform a cat of a non-existent file. In what language is the error message?
- # cat qwerty

Check the character encoding in your local terminal emulation program (e.g. PuTTy). Make sure it uses UTF-8.
- For PuTTy: Change Settings: Window; Transation. In the top pull-down menu you can choose the encoding. Select UTF-8. If you want to, you can save this configuration so that it is active the next time you use this PuTTy profile.

Verify that the hexdump command is installed. If it is not, install it. (It is normally part of the util-linux RPM.
- # which hexdump
- If hexdump is not installed:
  # yum -y install util-linux

Create a testfile "hello.ascii" with the word "Hello" in it.
- # echo "Hello" > hello.ascii

Look at the file using the hexdump -C command.
- # hexdump -C hello.ascii
  The file has a length of six bytes: Five letters (48 65 6C 6C 6F) plus the linefeed character (0A).

Check whether the hexedit command is installed. If it is not, install it. This is usually a standalone package.
- # which hexedit
- If it is not installed:
  # yum -y install hexedit

Create a file hello2.iso with contents "Hello". Use the hexedit command to edit this file: You are going to replace the second character ("e", or ASCII character 65), with the letter "é" (ISO-8859-1 character E9). So after editing file should contain these six bytes: 48 E9 6C 6C 6F 0A.
- # echo "Hello" > hello2.iso
- # hexedit hello2.iso
  In hexedit you type the hexadecimal code of the bytes you will want in your file. The left column is the position in the file, the middle column is the hexadecimal representation, and the right column shows the ASCII representation - if available. Use Ctrl-X to save the file and exit hexedit.

Try to convert the file to ASCII. Does this work?
- # iconv -f iso_8859-1 -t ascii hello2.iso -o hello2.ascii
  This command will fail, because there is no ASCII representation of the character é.

(Optional) Repeat the previous steps, but this time use the Euro-symbol (€). The Euro-symbol has Unicode code point U+20AC and is UTF-8 encoded as E2 82 AC (three bytes).
The Euro symbol is not part of ISO-8859 codepage 1, so this conversion will go wrong. But it is part of the ISO-8859 codepage 15, where it has hexadecimal value A4. To view the symbol correctly if you view the euro.iso file with cat, you will have to set your SSH client to use ISO-8859-15. If you don't do this, then the symbol will be shown as the universal currency sign, which looks like a square with wings. (This is character A4 in ISO-8859-1, and has Unicode code point U+00A4.)

End of exercise