Exploring and Mastering the Full Go Featureset

Unicode and UTF-8 strings

Learn about strings encodings in Go.

We'll cover the following...

Unicode and UTF-8
String encoding in Go
Rune type
String length
String index operator vs. for … range

Unicode and UTF-8

Unicode and UTF-8 are hairy subjects.

Let’s have a quick recap of Unicode and UTF-8:

Unicode is an international encoding standard for use with different languages and scripts, by which each letter, digit, or symbol is assigned a unique numeric value that applies across different platforms and programs. Essentially it’s a big table of “code points”. It contains most (but not all) of the characters of all languages. Each code point is an index in that table which you can sometimes see specified with the U+ notation such as U+0041 for letter A.
Usually code point means a character, for instance, the

...