[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: Re: Recommended way to download and parse web pages?
- From: Coda Highland <chighland@...>
- Date: Sat, 16 May 2015 04:56:14 -0700
On Sat, May 16, 2015 at 4:06 AM, Gilles <codecomplete@free.fr> wrote:
> On Fri, 15 May 2015 21:56:51 +0000, Thijs Schreijer
> <thijs@thijsschreijer.nl> wrote:
>>I think you would need a 'fetching' and a 'parsing' element. For fetching you could use Copas [1], which has recently gained async client support for http(s) (luasec required for the 's' part). See this example [2] for fetching multiple pages simultaneously/async.
>>
>>For parsing; depends on the complexity. If it's simple, use lua patterns. Otherwise the proposed lua-gumbo seems a good fit (just read the readme, have no experience with it).
>>
>>Thijs
>>
>>[1] https://github.com/keplerproject/copas
>>[2] https://github.com/keplerproject/copas/blob/master/tests/testlimit.lua
>
> Thanks for the infos. Are those available as LuaRocks? It's easier to
> install for newbies.
>
> "lua patterns" = regex?
Not exactly regex, but conceptually similar.
For your edification: http://www.lua.org/pil/20.1.html
/s/ Adam