diff options
author | Stefan Behnel <stefan_ml@behnel.de> | 2012-10-09 09:09:44 +0200 |
---|---|---|
committer | Stefan Behnel <stefan_ml@behnel.de> | 2012-10-09 09:09:44 +0200 |
commit | fc88bcb54ea99ae7c5202b7b625db344554f10d3 (patch) | |
tree | 6c5dbac677d66b3a5a7c714f23f3964463028e50 /IDEAS.txt | |
parent | c4a56945badac3b61e5129141482475a112c6b3d (diff) | |
download | python-lxml-fc88bcb54ea99ae7c5202b7b625db344554f10d3.tar.gz |
cleaned up TODO and IDEAS text files
Diffstat (limited to 'IDEAS.txt')
-rw-r--r-- | IDEAS.txt | 13 |
1 files changed, 12 insertions, 1 deletions
@@ -4,7 +4,7 @@ Things to try out when life permits * zlib-based parsing/serialising of compressed in-memory data * requires a libxml2 I/O OutputBuffer with appropriate I/O functions - that call into the lzma compression routines + that call into the zlib compression routines * lzma-based parsing/serialising of compressed in-memory data @@ -41,3 +41,14 @@ Things to try out when life permits * Disadvantage: the tree has to be built manually. In the current SAX based implementation, libxml2 does it for us. + +* rewrite iterparse() to accept a parser as argument instead of being + one + + * disadvantage: iterparse() can't deal with all parser options + +* provide an HTMLParser wrapper that handles broken encodings in broken + HTML better, e.g. using BeautifulSoup's "unicode dammit" analyser + +* expose namespace prefixes through the QName class + |