Sources are controlled by data/allowlist.json and data/seeds.txt.
data/allowlist.json
data/seeds.txt
The prototype intentionally ships with an empty allowlist. Add only domains that you have reviewed and intentionally want to crawl.
{ "domains": [ "example.org" ] }
Back to search