在 php 中将 UTF-16LE 转换为 UTF-8
我使用 iconv
php 函数,但有些字符不能正确转换:
I use iconv
php function but some characters doesn't convert correctly:
...
$s = iconv('UTF-16', 'UTF-8', $s);
...
$s = iconv('UTF-16//IGNORE', 'UTF-8', $s);
...
$s = iconv('UTF-16LE', 'UTF-8', $s);
...
$s = iconv('UTF-16LE//IGNORE', 'UTF-8', $s);
...
我也尝试了 mb_convert_encoding
函数,但无法解决我的问题.
I also try mb_convert_encoding
function but can't solve my problem.
示例文本文件:9px.ir/utf8-16LE.rar
A sample text file: 9px.ir/utf8-16LE.rar
推荐答案
iconv
支持UTF-16LE
编码.
您可以使用它来将编码从 UTF-16LE
转置为 UTF-8
:
You can use it to transpose the encoding from UTF-16LE
to UTF-8
:
$result = iconv($in_charset = 'UTF-16LE' , $out_charset = 'UTF-8' , $str);
if (false === $result)
{
throw new Exception('Input string could not be converted.');
}
参见iconv
文档.
我只是想知道 UTF 中的所有代码点是否可用-16LE
在 UTF-8
中可用.但我认为这应该适合您的情况.
I'm just wondering if all code-points available in UTF-16LE
are available in UTF-8
. But I assume that this should fit in your case.
我无法在自己的盒子上重现该问题,但在 另一个框我遇到了这个通知:
I was not able to reproduce the problem on a box of my own, but on another box I ran into this notice:
注意:iconv() [function.iconv]: 字符集错误,从 UTF-16LE' 到
UTF-8' 的转换是不允许的 ...
Notice: iconv() [function.iconv]: Wrong charset, conversion from
UTF-16LE' to
UTF-8' is not allowed in ...
看起来并不是所有的 iconv
版本都能真正将 UTF-16LE
转换为 UTF-8
.
Looks like that not all iconv
versions can actually convert UTF-16LE
to UTF-8
.
使用 mb_convert_encoding
Docs 相反,至少在这种情况下是这样 (演示):
It might be a workaround to use mb_convert_encoding
Docs instead, at least it was in this case (Demo):
$result = mb_convert_encoding($str , 'UTF-8' , 'UTF-16LE');
相关文章