[Date Prev][Date Next][Thread Prev][Thread Next]
- Subject: Re: HTML Parser Recommendation
- From: Rob Kendrick <rjek@...>
- Date: Fri, 24 May 2013 09:42:16 +0100
On Fri, May 24, 2013 at 09:53:27AM +0200, steve donovan wrote:
> On Thu, May 23, 2013 at 4:20 PM, Daniel Silverstone <
> firstname.lastname@example.org> wrote:
> > Parsing HTML is very hard to get right, so you're unlikely to find a pure
> > Lua
> > version of a parser.
> If it's well-formed HTML (and this is often a big 'if') then Penlight's xml
> module has an HTML mode
One of the strengths of libhubbub is that it parses good and bad HTML.
It follows the HTML5 spec, which essentially specificies how MSIE parses