DawnSearch

The open source distributed web search engine that searches by meaning

What is DawnSearch?

DawnSearch aims to be an alternative to the search engines controlled by big corporations.

Privacy

This DawnSearch instance does not actively collect data on access, and does not store searches. However, some information may be temporarily stored in log files. Due to the way DawnSearch works, a processed form of your seach query is sent to other instances. Do not use DawnSearch to search for any sensitive information.

Does this work as well as Google, Bing, Brave Search etc?

Currently, no. DawnSearch has just 0.1% of the data of one of a big dataset loaded. And this is still only a part of the internet. Over the next coming months the index will expand, and we will have to discover what that does to the quality of the results. As DawnSearch is an experiment, we hope to find a lot of improvments still.

AI and statistics

DawnSearch uses AI and statistical techniques in order to search. This does mean that biases may be present. For example, certain kinds of language use may not be detected as 'English' and would then be excluded from the index. The AI model used, all-MiniLM-L6-v2, may also prefer certain content over others. This is currently unknown. For example, it could decide it likes pages written by a male author more than written by a female. These biases may come from the training data itself, or it may happen because the AI is not human and thinks differently than we do.

Open Source / Free Software

The code for this instance is available on GitHub under an open source / free software license. In short, anyone is free to modify this software, with the important note that if they give other people access, they will also have to share their modifications with them.