[Date Prev][Date Next][Thread Prev][Thread Next]
- Subject: Re: HTML Parser Recommendation
- From: Wesley Smith <wesley.hoke@...>
- Date: Fri, 24 May 2013 18:15:55 -0700
> I just finished writing a complete tokenizer in C as an almost direct
> transliteration of the HTML5 tokenizing rules. I'm confident that it can't
> be done with LPeg, not if you want to be fully standards compliant and
> handle pathological cases, such as spammers might abuse.
I'd be surprised if this was the case. Do you have a particular
example in mind?