Great project. Unfortunately, the white paper does not yet give much information on how they plan to build the search eninge (other then organisations offering processing time)
Sounds like they want to use distributed crawling like Grub once did:
Which was taking up by Jimmy Wales to do Wikia Search
I like the focus on Europe, I wonder if their plans also involve Qwant:
@djoerd Or like NUTCH does https://cwiki.apache.org/confluence/display/NUTCH/Home
The hard part, however, is to avoid lying, spamming or skewing by (unintentional) malicious nodes. AFAIK this has been unsolved as of yet. Well: probably a few ICO-blockchain-projects that attempt to solve this in that space.
@berkes Does Nutch allow running independent clients?
But you're right: It is hard to manage clients that misbehave...
@djoerd «Nutch can run on a single machine, but gains a lot of its strength from running in a Hadoop cluster».
Its architecture is "clustered". Not distributed in the pure sense, but decentralized.
@berkes Nice, but the page reads a bit too much as an advertisement.
@djoerd thats your typical ICO project.
@djoerd I am happy using @Ecosia
@patricksudlow Still on Duckduckgo. Maybe I should help trees more. .
FAQ doesn't look good on mobile Firefox
The "unofficial" Information Retrieval Mastodon Instance.
Goal: Make idf.social a viable and valuable social space for anyone working in Information Retrieval and related scientific research.
Everyone welcome but expect some level of geekiness on the instance and federated timelines.