|
|
|
Re: utf8_encode(utf8_decode(string)) doesn't return same string [message #180483 is a reply to message #180482] |
Thu, 21 February 2013 13:39   |
Giacomo M
Messages: 15 Registered: February 2013
Karma: 0
|
Junior Member |
add to buddy list ignore all messages by this user
|
|
Il 21/02/2013 16:58, The Natural Philosopher ha scritto:
> "Please note that utf8_decode simply converts a string encoded in UTF-8
> to ISO-8859-1. A more appropriate name for it would be utf8_to_iso88591.
> If your text is already encoded in ISO-8859-1, you do not need this
> function. If you don't want to use ISO-8859-1, you do not need this
> function.
>
> Note that UTF-8 can represent many more characters than ISO-8859-1.
> Trying to convert a UTF-8 string that contains characters that can't be
> represented in ISO-8859-1 to ISO-8859-1 will garble your text and/or
> cause characters to go missing. Trying to convert text that is not
> encoded in UTF-8 using this function will most likely garble the text."
Not my case. I first encode an ISO string in UTF, after i decode the
same string. Why I get differet strings?
Thanks.
--
Giacomo
|
|
|
Re: utf8_encode(utf8_decode(string)) doesn't return same string [message #180484 is a reply to message #180483] |
Thu, 21 February 2013 14:08   |
|
In article <kg5plp$jk2$1(at)speranza(dot)aioe(dot)org>,
Giacomo M <kylnas(at)tiscali(dot)it> wrote:
> Il 21/02/2013 16:58, The Natural Philosopher ha scritto:
>> "Please note that utf8_decode simply converts a string encoded in UTF-8
>> to ISO-8859-1. A more appropriate name for it would be utf8_to_iso88591.
>> If your text is already encoded in ISO-8859-1, you do not need this
>> function. If you don't want to use ISO-8859-1, you do not need this
>> function.
>>
>> Note that UTF-8 can represent many more characters than ISO-8859-1.
>> Trying to convert a UTF-8 string that contains characters that can't be
>> represented in ISO-8859-1 to ISO-8859-1 will garble your text and/or
>> cause characters to go missing. Trying to convert text that is not
>> encoded in UTF-8 using this function will most likely garble the text."
>
> Not my case. I first encode an ISO string in UTF, after i decode the
> same string. Why I get differet strings?
Your title doesn't say that. It says you decode first, then encode.
--
Tim
"That excessive bail ought not to be required, nor excessive fines imposed,
nor cruel and unusual punishments inflicted" -- Bill of Rights 1689
|
|
|
Re: utf8_encode(utf8_decode(string)) doesn't return same string [message #180485 is a reply to message #180484] |
Thu, 21 February 2013 14:13   |
Giacomo M
Messages: 15 Registered: February 2013
Karma: 0
|
Junior Member |
add to buddy list ignore all messages by this user
|
|
Il 21/02/2013 20:08, Tim Streater ha scritto:
> In article <kg5plp$jk2$1(at)speranza(dot)aioe(dot)org>,
> Giacomo M <kylnas(at)tiscali(dot)it> wrote:
>
>> Il 21/02/2013 16:58, The Natural Philosopher ha scritto:
>>> "Please note that utf8_decode simply converts a string encoded in UTF-8
>>> to ISO-8859-1. A more appropriate name for it would be
>> utf8_to_iso88591.
>>> If your text is already encoded in ISO-8859-1, you do not need this
>>> function. If you don't want to use ISO-8859-1, you do not need this
>>> function.
>>>
>>> Note that UTF-8 can represent many more characters than ISO-8859-1.
>>> Trying to convert a UTF-8 string that contains characters that can't be
>>> represented in ISO-8859-1 to ISO-8859-1 will garble your text and/or
>>> cause characters to go missing. Trying to convert text that is not
>>> encoded in UTF-8 using this function will most likely garble the text."
>>
>> Not my case. I first encode an ISO string in UTF, after i decode the
>> same string. Why I get differet strings?
>
> Your title doesn't say that. It says you decode first, then encode.
Is it not the same?
I have an ISO-8859-1 xml file that I have to load with simplexml, so I
first decode it. After that, I want to encode to return to ISO-8859-1.
But I get different string.
--
Giacomo
|
|
|
Re: utf8_encode(utf8_decode(string)) doesn't return same string [message #180490 is a reply to message #180485] |
Thu, 21 February 2013 17:29  |
|
In article <kg5rke$pp4$1(at)speranza(dot)aioe(dot)org>,
Giacomo M <kylnas(at)tiscali(dot)it> wrote:
> Il 21/02/2013 20:08, Tim Streater ha scritto:
>> In article <kg5plp$jk2$1(at)speranza(dot)aioe(dot)org>,
>> Giacomo M <kylnas(at)tiscali(dot)it> wrote:
>>
>>> Il 21/02/2013 16:58, The Natural Philosopher ha scritto:
>>>> "Please note that utf8_decode simply converts a string encoded in UTF-8
>>>> to ISO-8859-1. A more appropriate name for it would be
>>> utf8_to_iso88591.
>>>> If your text is already encoded in ISO-8859-1, you do not need this
>>>> function. If you don't want to use ISO-8859-1, you do not need this
>>>> function.
>>>>
>>>> Note that UTF-8 can represent many more characters than ISO-8859-1.
>>>> Trying to convert a UTF-8 string that contains characters that can't be
>>>> represented in ISO-8859-1 to ISO-8859-1 will garble your text and/or
>>>> cause characters to go missing. Trying to convert text that is not
>>>> encoded in UTF-8 using this function will most likely garble the text."
>>>
>>> Not my case. I first encode an ISO string in UTF, after i decode the
>>> same string. Why I get differet strings?
>>
>> Your title doesn't say that. It says you decode first, then encode.
>
> Is it not the same?
>
> I have an ISO-8859-1 xml file that I have to load with simplexml, so I
> first decode it. After that, I want to encode to return to ISO-8859-1.
> But I get different string.
If you want to do ISO -> UTF8 -> ISO then I would expect to do:
$iso = utf8_decode (utf8_encode ($iso));
according to the documentation, anyway.
--
Tim
"That excessive bail ought not to be required, nor excessive fines imposed,
nor cruel and unusual punishments inflicted" -- Bill of Rights 1689
|
|
|