As far as I understand, requests are necessarily stored in a GenServer's state (Crawly.RequestStorageWorker) and it is not possible to plug its own storage module.
This has some disadvantages:
- it doesn't support restart of the server
- it cannot be distributed on several nodes
It would be nice if this module was configurable. That would make backends such as Mnesia or SQL databases possible, enabling distributed crawling.
As far as I understand, requests are necessarily stored in a GenServer's state (
Crawly.RequestStorageWorker) and it is not possible to plug its own storage module.This has some disadvantages:
It would be nice if this module was configurable. That would make backends such as Mnesia or SQL databases possible, enabling distributed crawling.