Skip to main content

Character Set / Collation

Character Sets, Collations, Unicode, Encoding

utf8

  • Deprecated
  • Deprecated alias for utf8mb3
  • Doesn't include supplementary character (4 bytes)
  • Maximum 3 bytes per character

utf8mb4

Ref: The utf8mb4 Character Set

  • Recommended
  • Include all utf8 characters
  • Include supplementary character (4 bytes)
  • Maximum 4 bytes per character

_general_ci

Ref: Unicode Character Sets

  • Less correct
  • Faster than _unicode_ci
Ä = A
Ö = O
Ü = U

ß = s

_unicode_ci

  • More correct
  • Slower than _general_ci
Ä = A
Ö = O
Ü = U

ß = ss