From 76f52a8fd79dd12680752c017d67d4be01f0afbc Mon Sep 17 00:00:00 2001 From: Michael Krelin Date: Sat, 05 Jan 2008 21:47:04 +0000 Subject: made more robust html discovery by using htmltidy now when parsing document that we expect might be html we also save first 16K of the document to the buffer and if the parser choked we run the saved data through htmltidy and feed the output to the parser again. Signed-off-by: Michael Krelin --- (limited to 'README') -- cgit v0.9.0.2