What is UTF-8?
UTF-8 is one of the most widely used character encoding standards.
Earlier character encodings like ASCII were only capable of representing a small set of characters.
Strings are complicated. Different programming languages make different choices on how to present this complexity to the programmer.