From: "Thomas Heller" <thomas.heller@ion-tof.com>
Date: Thu, 10 Jan 2002 22:21:27 +0100
Subject: Re: [Python-Dev] unicode/string asymmetries

From: "Martin v. Loewis" <martin@v.loewis.de>
> > >    unicode("some string", "unicode-escape")
> [...]
> > For example the copyright symbol "©" (repr("©") gives "\xa9").
> > Now I want to convert this string to unicode.
> > u"©" works fine, unicode(variable) gives an ASCII decoding error.
>
> As I said: unicode-escape is the precise encoding that is used to
> parse Unicode strings from source files. It interprets all bytes above
> 128 as Latin-1.
>
I must apologize, because first it didn't seem to work:

>>> print unicode("\xa9", "unicode-escape")

Traceback (most recent call last):
  File "<stdin>", line 1, in ?
UnicodeError: ASCII encoding error: ordinal not in range(128)
>>>

but then I found out that the result simply cannot be printed out,
while the repr of it can be:

>>> unicode("\xa9", "unicode-escape")
u'\xa9'
>>>

Thanks,

Thomas


_______________________________________________
Python-Dev mailing list
Python-Dev@python.org
http://mail.python.org/mailman/listinfo/python-dev