A recent study proves that FAROO delivers the cleanest results compared to Google, Bing, Yandex and Twitter by two orders of magnitude.
According to the results published by the study, 1 in every 7.652 results from Google is infected, but only 1 in every 114.500 results from FAROO is suspicious
That means FAROO is 73 times safer than Google. And it shows consistency by repeating the good results from the previous test in 2013.
Please note that the graphs are in logarithmic scale!
The complete study can be found here: Analysis of 160 Million Websites: Are Google and Other Search Engines Platforms for Distributing Malware?
The Web of Trust
I want to share some insights why FAROO does so much better in the test:
Most search engines are scanning the web pages for malware during the crawling & indexing process and sort out infected links. That helps to reduce the threat but has two serious drawbacks:
- The malware scanners are unable to detect new malware, before it was spotted by the company, analyzed and the scanner updated.
- There is a security gap between two consecutive scans of the page. Search engines are re-indexing and re-scanning pages that already in the index and were declared save only at certain intervals. For pages that are less popular that distance might be several month, and during this time the threat will remain undetected and the infected results are served to the user.
FAROO does not use this fragile approach of scanning web pages to detect malware. Instead FAROO relies on a concept which can be described as “web of trust”.
The web basically consists of web pages (content) connected by links (trust). For the ranking of search results we use both content of web pages and the links between them. But to determine the thrust or authority of web pages we relie solely on links.
This comprises three basic concepts:
- Trustworthy sources: The links have to come from trustworthy sources, with a limited distance within the chain of trust.
- Multiple independent sources: An information is deemed trustworthy if it is referred (linked) to by multiple independent sources
- “Time proven” links: The links have to stay online for an certain amount of time to be considered reliable (e.g. links from Wikipedia pages will be removed after short time if they have been found irrelevant or malicious).
Trust, Relevance & Completeness
Our approach does not only help to improve security but also relevance.
Most search engines are feeding the index with everything they can find, and afterwards try to find the needle in the haystack among all the irrelevant content by ranking.
The careful selection of content from reliable and trusted sources with authority, and those content referred and recommended by them ensures that there is almost only relevant content into the index in the first place.
One might think that this kind of focused crawling could lead to limited content. But that is not the case. The fact that Faroo’s index is not as comprehensive as Google’s is caused solely by its comparatively limited resources.
The “web of trust” concept does not compromise the richness and comprehensiveness of results, even not for long tail queries in expert domains.
This is because all web pages are allowed, once the web site domain has been approved by two reliable and trusted sources and has got a reference from them.