What do you miss in your spiderable package?

katy_wings · March 25, 2015, 5:52pm

I am working on a revamp of the spiderable package, so i wanted to ask the community for a quick feedback about the question in the title.

Currently i am working on:
Replace phantomjs with zombie

Planned are also new settings which can be defined in Meteor.Settings:

verbose
port
host (name or ip)
allowed bot patterns
caching, tmp lifetime
precaching (reocurring process caches urls by time plan)

serkandurusoy · March 25, 2015, 6:40pm

A seeded or (if possible) automatic prerun and configurable caching would be two very great features.

katy_wings · March 25, 2015, 6:47pm

What exactly do you mean with “seeded”?

I think I would do something like:
Google opens site - check if tmp file is in cache and not too old - ouput tmp or new content

simo7 · March 25, 2015, 7:22pm

something like this https://atmospherejs.com/chfritz/spiderable ?

katy_wings · March 25, 2015, 7:32pm

Why doesn’t you just use your mentioned package, when you miss everything of it? xD

serkandurusoy · March 25, 2015, 11:04pm

Oh by seeded I mean providing a list of url’s to fetch and cache.

Compared to automatic (through crawling the whole site beginning at /)

katy_wings · March 26, 2015, 6:12am

oh yeah, thats a cool idea and I see the use case.

katy_wings · April 12, 2015, 3:13pm

Just for the sake of completeness: the new package is out and you can try it out at:
https://atmospherejs.com/lufrai/spiderable2

timbrandin · April 27, 2015, 12:06pm

Have anyone been able to use this together with Meteor Up (mup) and meteorhacks:cluster?
I’m having difficulties debugging, don’t get why you have to set a port.

chenroth · April 27, 2015, 12:24pm

webcomponents.js (Polymer) support