FUDforum
Fast Uncompromising Discussions. FUDforum will get your users talking.

Home » FUDforum Development » Bug Reports » maillist import from html to utf-8 incorrect
Show: Today's Messages :: Unread Messages :: Show Polls :: Message Navigator
| Subscribe to topic | Bookmark topic 
Switch to threaded view of this topic Create a new topic Submit Reply
maillist import from html to utf-8 incorrect [message #158521] Sun, 01 March 2009 12:19 Go to next message
Peter Vendike is currently offline  Peter Vendike   Denmark
Messages: 65
Registered: February 2009
Location: Denmark
Karma: 0
Member
Translator
add to buddy list
ignore all messages by this user
This problem for maillist.php at least v 1.73 to 1.77

Looking at the danish special letters.

My messages are saved in utf-8 format.
Import of quoted-printable text/plain works fine to utf-8

But in text/html messages quoted-printable are converted to ISO-8859 (I think) 8-bit symbols.

after stripping html codes maillist.php should use the (new?) procedure for text/plain ????


Re: maillist import from html to utf-8 incorrect [message #158522 is a reply to message #158521] Sun, 01 March 2009 12:46 Go to previous messageGo to next message
naudefj is currently offline  naudefj   South Africa
Messages: 3624
Registered: December 2004
Karma: 17
Senior Member
Administrator
Core Developer
remove from buddy list
ignore all messages by this user
Not sure I understand. Here's the difference between 1.73 and 1.75.

Your theme's charset and database charset may also play a role.
Re: maillist import from html to utf-8 incorrect [message #158525 is a reply to message #158522] Sun, 01 March 2009 14:22 Go to previous messageGo to next message
Peter Vendike is currently offline  Peter Vendike   Denmark
Messages: 65
Registered: February 2009
Location: Denmark
Karma: 0
Member
Translator
add to buddy list
ignore all messages by this user
OK, its not special for html. Using maillist.php v.173:


this message was imported correctly from Gmail:

Subject: =?UTF-8?B?RndkOiBkYW5za2UgdGVnbiDDpiDDuCDDpQ==?=
From: Peter Vendike <@mygmailadress.com>
To: Peter Vendike <@myotheradress.dk>
Content-Type: text/plain;
charset=UTF-8
Content-Transfer-Encoding: quoted-printable
Status: R
X-Status: NC
X-KMail-EncryptionState:
X-KMail-SignatureState:
X-KMail-MDN-Sent:

---------- Forwarded message ----------
From: Peter Vendike <v(at)mygmailadress(dot)com>
Date: Sun, 1 Mar 2009 19:57:01 +0100
Subject: danske tegn =C3=A6 =C3=B8 =C3=A5
To: forum <import(at)myforum(dot)com>

De danske bogstaver =C3=A6 =C3=B8 =C3=A5



This one not correctly imported, neither subject nor text:

From: Peter Vendike <@myotheraddress.dk>
To: import(at)myforum(dot)com
Subject: test af =?iso-8859-1?b?5vjl?=
Date: Sun, 1 Mar 2009 19:54:11 +0100
User-Agent: KMail/1.9.5
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline
Message-Id: <200903011954(dot)11779(dot)xxxx(at)xxxx(dot)dk>
Status: RO
X-Status: RSC
X-KMail-EncryptionState:
X-KMail-SignatureState:
X-KMail-MDN-Sent:

dette er en test af danske tegn
The danish letters: =E6 =F8 =E5 =C6 =D8 =C5


Re: maillist import from html to utf-8 incorrect [message #158526 is a reply to message #158525] Sun, 01 March 2009 14:58 Go to previous messageGo to next message
naudefj is currently offline  naudefj   South Africa
Messages: 3624
Registered: December 2004
Karma: 17
Senior Member
Administrator
Core Developer
remove from buddy list
ignore all messages by this user
Do you think this is a new problem? I don't think the previous release could've loaded both these messages successfully either.
Re: maillist import from html to utf-8 incorrect [message #158527 is a reply to message #158526] Sun, 01 March 2009 15:18 Go to previous messageGo to next message
Peter Vendike is currently offline  Peter Vendike   Denmark
Messages: 65
Registered: February 2009
Location: Denmark
Karma: 0
Member
Translator
add to buddy list
ignore all messages by this user
No, I think it's the the same with all versions I tried, must be old.


naudefj wrote on Sun, 01 March 2009 20:58
Do you think this is a new problem? I don't think the previous release could've loaded both these messages successfully either.

Re: maillist import from html to utf-8 incorrect [message #158534 is a reply to message #158527] Mon, 02 March 2009 06:31 Go to previous message
Peter Vendike is currently offline  Peter Vendike   Denmark
Messages: 65
Registered: February 2009
Location: Denmark
Karma: 0
Member
Translator
add to buddy list
ignore all messages by this user
Reading and trying to understand the program file "maillist.php" i believe the text conversion is made by one ore more functions from the include directory, but as I'm not a programmer it would take me realy some time to understand what to change.


Some type af table for character translation is selected wrongly, but is not so bad at all, as the translation is "nearly" right, as I can see by inspecting the forum-message file "msg_10000" with a hex-editor. In the "mistaken" translations the characters are still there, only 8-bit instead off utf-8.
When the forum is set up to utf-8, it should not be possible for this function (wich I don't know) to use a 8-bit translation table
Quick Reply
Formatting Tools:   
  Switch to threaded view of this topic Create a new topic
Previous Topic: Issues with compacting messages
Next Topic: Parse error: syntax error, unexpected '?' in
Goto Forum:
  

-=] Back to Top [=-
[ Syndicate this forum (XML) ] [ RSS ]

Current Time: Tue Oct 17 11:16:40 EDT 2017

Total time taken to generate the page: 0.00698 seconds