...

/

Double-quoted Strings Are Binaries

Double-quoted Strings Are Binaries

Learn about double-quoted strings and different Elixir libraries.

We'll cover the following...

Introduction

Unlike single-quoted strings, the contents of a double-quoted string (dqs) are stored as a consecutive sequence of bytes in UTF-8 encoding. Clearly, this is more efficient in terms of memory and certain forms of access, but it does have two implications.

  • First, because UTF-8 characters can take more than a single byte to represent, the size of the binary is not necessarily the length of the string.
     iex> dqs = "∂x/∂y" 
     "∂x/∂y"
     iex> String.length dqs 
     5
     iex> byte_size dqs
     9
     iex> String.at(dqs, 0) 
     "∂"
     iex> String.codepoints(dqs) 
     ["∂", "x", "/", "∂", "y"] 
     iex> String.split(dqs, "/") 
     ["∂x", "∂y"]
    
  • Second, because we’re no longer using lists, we need to learn and work with the binary syntax alongside the list syntax in your code.

Strings and Elixir libraries

When Elixir library documentation uses the word “string” (and most of the time it uses the word “binary”), it means double-quoted strings. The String module defines functions that work with double-quoted strings. Let’s cover some of them below with examples.

  • at(str, offset) returns the grapheme at the given offset ...