Otherwise, all input is assumed to be ASCII by default, which causes UTF-8 characters to be replaced by question marks. "C.UTF-8" the C locale with the UTF-8 codeset.