[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: Re: HTML Parser Recommendation
- From: Rob Kendrick <rjek@...>
- Date: Fri, 24 May 2013 09:42:16 +0100
On Fri, May 24, 2013 at 09:53:27AM +0200, steve donovan wrote:
> On Thu, May 23, 2013 at 4:20 PM, Daniel Silverstone <
> dsilvers@digital-scurf.org> wrote:
>
> > Parsing HTML is very hard to get right, so you're unlikely to find a pure
> > Lua
> > version of a parser.
> >
>
> If it's well-formed HTML (and this is often a big 'if') then Penlight's xml
> module has an HTML mode
One of the strengths of libhubbub is that it parses good and bad HTML.
It follows the HTML5 spec, which essentially specificies how MSIE parses
broken HTML.
B.