summaryrefslogtreecommitdiff
path: root/TODO.txt
diff options
context:
space:
mode:
authorLeonard Richardson <leonard.richardson@canonical.com>2012-04-26 12:39:44 -0400
committerLeonard Richardson <leonard.richardson@canonical.com>2012-04-26 12:39:44 -0400
commitdffee46db6c0f932ddae300c72c43b7512ea4de2 (patch)
tree5dff4056f801223d25cb9e618bc4b17addf43c2b /TODO.txt
parenta8a8fd33ac48b983feff293b0bddb8f365976632 (diff)
downloadbeautifulsoup4-dffee46db6c0f932ddae300c72c43b7512ea4de2.tar.gz
Fixed a bug in decoding data that contained a byte-order mark, such as data encoded in UTF-16LE. [bug=988980]
Diffstat (limited to 'TODO.txt')
-rw-r--r--TODO.txt5
1 files changed, 5 insertions, 0 deletions
diff --git a/TODO.txt b/TODO.txt
index c9f9baa..b8dbfd2 100644
--- a/TODO.txt
+++ b/TODO.txt
@@ -11,6 +11,11 @@ which worries me and has resulted in a number of bugs.
markup_attr_map can be optimized since it's always a map now.
+Upon encountering UTF-16LE data or some other uncommon serialization
+of Unicode, UnicodeDammit will convert the data to Unicode, then
+encode it at UTF-8. This is wasteful because it will just get decoded
+back to Unicode.
+
CDATA
-----