lua-users home
lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]


On Fri, May 24, 2013 at 09:53:27AM +0200, steve donovan wrote:
> On Thu, May 23, 2013 at 4:20 PM, Daniel Silverstone <
> dsilvers@digital-scurf.org> wrote:
> 
> > Parsing HTML is very hard to get right, so you're unlikely to find a pure
> > Lua
> > version of a parser.
> >
> 
> If it's well-formed HTML (and this is often a big 'if') then Penlight's xml
> module has an HTML mode

One of the strengths of libhubbub is that it parses good and bad HTML.
It follows the HTML5 spec, which essentially specificies how MSIE parses
broken HTML.

B.