Daily Shaarli
April 12, 2020
Historically, disease outbreaks have happened at a time of social inequality and discord, says science journalist Laura Spinney
Exclusive: Documents seen by Guardian show tech firms using information to build ‘Covid-19 datastore’
Solr is the popular, blazing-fast, open source enterprise search platform built on Apache Lucene™.
Weil sie gegen die Ausgangsbeschränkungen verstoßen hatten, wurden zehn Touristen von der indischen Polizei zum Nachsitzen verdonnert: 500 Mal mussten sie sich schriftlich entschuldigen.
International relations expert warns policy failure could do lasting damage as president insults allies and undermines alliances
A research project I spent time working on during my master’s required me to scrape, index and rerank a largish number of websites. While Google would certainly offer better search results for most of the queries that we were interested in, they no longer offer a cheap and convenient way of creating custom search engines.
This need, along with the desire to own and manage my own data spurred me to set about finding a workflow for retrieving decent results for search queries made against a predefined list of websites. That workflow is described here, providing what I hope shall serve as a useful reference for how to go about setting up a small search engine using free and open-source tools.
Note:
• The instructions here assume that you use some UNIX-like operating system (Linux, MacOS, *BSD).
• Any Python code has only been tested using Python 3.7 (The clock is ticking).
• Any code provided is free to use under the MIT license.
My own dire prophecies failed to adequately predict the future and today I see him as someone far more terrifying
A Docker build for Solr, to manage the official Docker hub solr image