author | Michael Krelin <hacker@klever.net> | 2008-01-05 21:47:04 (UTC) |
---|---|---|
committer | Michael Krelin <hacker@klever.net> | 2008-01-05 22:03:59 (UTC) |
commit | 76f52a8fd79dd12680752c017d67d4be01f0afbc (patch) (side-by-side diff) | |
tree | 42d640112a0381707d36ac3d72e48937978d911e /lib/expat.cc | |
parent | a0719fb611507d8b9962b87c600855d8837fc266 (diff) | |
download | libopkele-76f52a8fd79dd12680752c017d67d4be01f0afbc.zip libopkele-76f52a8fd79dd12680752c017d67d4be01f0afbc.tar.gz libopkele-76f52a8fd79dd12680752c017d67d4be01f0afbc.tar.bz2 |
made more robust html discovery by using htmltidy
now when parsing document that we expect might be html we also save first 16K
of the document to the buffer and if the parser choked we run the saved data
through htmltidy and feed the output to the parser again.
Signed-off-by: Michael Krelin <hacker@klever.net>
0 files changed, 0 insertions, 0 deletions