[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: Re: Recommended way to download and parse web pages?
- From: Gilles <codecomplete@...>
- Date: Sat, 16 May 2015 13:06:30 +0200
On Fri, 15 May 2015 21:56:51 +0000, Thijs Schreijer
<thijs@thijsschreijer.nl> wrote:
>I think you would need a 'fetching' and a 'parsing' element. For fetching you could use Copas [1], which has recently gained async client support for http(s) (luasec required for the 's' part). See this example [2] for fetching multiple pages simultaneously/async.
>
>For parsing; depends on the complexity. If it's simple, use lua patterns. Otherwise the proposed lua-gumbo seems a good fit (just read the readme, have no experience with it).
>
>Thijs
>
>[1] https://github.com/keplerproject/copas
>[2] https://github.com/keplerproject/copas/blob/master/tests/testlimit.lua
Thanks for the infos. Are those available as LuaRocks? It's easier to
install for newbies.
"lua patterns" = regex?