MySQL answer: utf8_unicode_ci
vs. utf8_general_ci
.
Collation controls sorting behavior. Unicode rationalizes the character set, but doesn’t, on it’s own, rationalize sorting behavior for all the various languages it supports. utf8_general_ci
(ci = case insensitive) is apparently a bit faster, but sloppier, and only appropriate for English language data sets.