Indexers
Introduction
Yandex search engine or Yandex web search engine (in Russian Яндекс.Поиск) is a search engine. It is owned by Yandex, based in Russia. In January 2015, Yandex search engine generated 51.2% of all search traffic in Russia according to LiveInternet").[1].
In May 2010, Yandex launched Yandex.com, a platform for beta testing and improving search in languages other than Russian.[2].
The search product can be accessed from personal computers, mobile phones, tablets and other digital devices. In addition to web search, Yandex offers a wide range of specialized search services.
In 2009, Yandex launched MatrixNet, a new machine learning method that significantly improves the relevance of search results. It allows the Yandex search engine to take into account a large number of factors when making the decision about the relevance of search results.[3].
Another technology, Spectrum, was released in 2010. It allows you to infer implicit queries and return matching search results. The system automatically analyzes user searches and identifies objects such as personal names, movies or cars. The proportions of search results that respond to different user intentions are based on user demand for these results[4].
First released on July 21, 2017, the Brave web browser features Yandex as one of its default search engines "Brave (web browser)").[5].
Functionality
Overview
The search engine consists of three main components:[6].
Indexing
In general, Yandex indexes the following file types: html, pdf, rtf, doc, xls, ppt, docx, odt, odp, ods, odg, xlsx, pptx.
The search engine can also index text within Shockwave Flash objects (if the text is not placed in the image itself), if these elements are passed as a separate page, which has the MIME type application/x-shockwave-flash and files with the extension .swf[7].
Yandex has 2 scanning robots: the "main" and the "fast" one. The first is responsible for the entire Internet, the second indexes sites with frequently changing and updated information (news sites and news agencies). In 2010, the "fast" robot received a new technology called "Orange", jointly developed by Yandex divisions in California and Moscow.[7].