[Date Prev][Date Next][Thread Prev][Thread Next]
- Subject: Re: Recommended way to download and parse web pages?
- From: Coda Highland <chighland@...>
- Date: Sat, 16 May 2015 04:56:14 -0700
On Sat, May 16, 2015 at 4:06 AM, Gilles <firstname.lastname@example.org> wrote:
> On Fri, 15 May 2015 21:56:51 +0000, Thijs Schreijer
> <email@example.com> wrote:
>>I think you would need a 'fetching' and a 'parsing' element. For fetching you could use Copas , which has recently gained async client support for http(s) (luasec required for the 's' part). See this example  for fetching multiple pages simultaneously/async.
>>For parsing; depends on the complexity. If it's simple, use lua patterns. Otherwise the proposed lua-gumbo seems a good fit (just read the readme, have no experience with it).
> Thanks for the infos. Are those available as LuaRocks? It's easier to
> install for newbies.
> "lua patterns" = regex?
Not exactly regex, but conceptually similar.
For your edification: http://www.lua.org/pil/20.1.html