Supercharge Your Online Visibility! Contact Us and Let’s Achieve Excellence Together!
When initiating an international SEO strategy targeted at Russia (along with select countries in the Middle East and Eastern Europe), it becomes imperative to acknowledge the significant presence of Yandex, a major competitor to Google.
Yandex, established initially as Yandex Search in 1997 by CompTek, commands a substantial market share within Russia, surpassing Google with 47.87% as of August 2017. In contrast, globally, it maintains a smaller share of around 1.5% of total searches. Recent legal developments in Russia have also challenged Google’s dominance, potentially influencing similar actions by the European Union, particularly concerning Google’s control over Android devices.
Given these circumstances, it is prudent to gain a deeper understanding of Yandex. Like Google, Yandex has evolved since its inception to adapt to changing user behaviors and market dynamics and combat search result manipulation by SEO practitioners.
👉Yandex Mobile Algorithms
Yandex’s response to the burgeoning mobile market in Russia is particularly noteworthy. The country boasts nearly 79 million smartphone users, which are projected to grow to over 93 million by 2021. Consequently, there has been a significant emphasis on enhancing the mobile search experience and optimizing for mobile SEO.
In May 2017, the Russian Federal Antimonopoly Service intervened, ruling against Google’s default search engine settings on Android devices, deeming them anti-competitive and detrimental to consumer interests. This decision compelled Google to open the Android platform, enabling users to select alternative search engines as their default choice.
👉Vladivostok
A pivotal moment came with the introduction of Vladivostok in February 2016, a significant update to Yandex’s core algorithm that shifted focus towards mobile user experience and content optimization.
Preceding this update, Yandex had already begun tagging mobile-friendly websites in search results since November 2015, signaling to webmasters the importance of mobile responsiveness. While the Vladivostok update didn’t automatically penalize non-mobile-friendly websites, it did trigger ranking fluctuations, particularly dependent on the user’s device.
Although mobile friendliness constitutes just one of the numerous ranking factors, it remains crucial for enhancing user experience and website visibility.
👉Yandex Artificial Intelligence Algorithms
Yandex’s journey into machine learning commenced in 2009 with the introduction of MatrixNet. This algorithm constructs a formula by considering numerous variables and ranking factors, adjusting their weights based on the search query, intent, and interpretations to provide more pertinent search outcomes.
Initially known as the Snezhinsk algorithm, it prioritized non-commercial content over commercial content for brief queries with multiple interpretations. Moreover, it evolved to assess the overall value of domain ecosystems rather than individual page value.
A subsequent version, Snezhinsk 1.1, released in March 2010, focused on enhancing search result quality for location-specific queries.
👉Palekh
In 2016, Yandex launched the Palekh algorithm, which aimed to refine the core algorithm’s comprehension of long-tail search queries. Palekh utilized neural networks to decipher semantics and identify prevalent interpretations of such queries, enabling the return of relevant search results even when the exact search terms were absent from the content.
👉Korolyov
In August 2017, Yandex introduced the Korolyov update, further refining the Palekh algorithm. Korolyov’s enhancement involved a more thorough analysis of potential search results in real time, significantly accelerating the process. According to Melissa McDonald, Yandex’s International Communications Director, and Andrey Styskin, the Head of Yandex Search, Korolyov builds upon Palekh’s foundations to deliver more comprehensive and faster results.
Korolyov expands upon the foundation laid by Palekh, the initial neural network algorithm developed by Yandex, which focused on the semantic aspect of search queries and learned from user interactions. Unlike Palekh, who primarily analyzed headlines, Korolyov can understand the contextual meaning of queries and correlate them with relevant pages.
Moreover, while Palekh was limited to analyzing 150 pages at a time, Korolyov can handle a significantly larger volume, simultaneously processing up to 200,000 pages.
Like RankBrain, Korolyov employs machine learning techniques to continuously enhance its performance with each new data point, with its findings contributing to the development of MatrixNet. Concurrently, Yandex has integrated data from its crowdsourcing platform Toloka and anonymized user data into MatrixNet, further enriching its capabilities.
Yandex expects Korolyov’s performance to benefit significantly from the vast amount of data generated by its tens of millions of users. Since the introduction of Palekh, this has already led to a 2.8% improvement in search quality. The algorithm employs advanced semantic vectors to analyze the meaning behind search queries, enabling it to better understand the intent behind user searches and match them with relevant content efficiently.
With advancements in technology, the rise of voice search, and recent regulatory developments such as the Russian Federal Antimonopoly Service’s settlement with Google, there’s a possibility that Russia’s substantial number of Android smartphone users may transition from Google to alternative search engines like Yandex in the future.
👉Andromeda
In November 2018, Yandex introduced Andromeda, a significant update following the Korolyov and Palekh updates. Andromeda extensively enhanced Yandex’s search algorithms, totaling over a thousand improvements and new features.
Key upgrades included refining the “quick answer” feature and integrating “experts” to assist users in obtaining more accurate and expedited responses to their queries. Additionally, Andromeda introduced official site badges on search result pages to help users easily identify the most relevant and reliable sources.
Another notable addition was Yandex.Collections, offering users a visual platform to save and navigate content. Users could store search results in a visual tile format, encompassing various media types such as links, images, videos, and locations. This feature enabled users to access their saved collections across mobile and desktop devices and follow public collections based on shared interests.
👉Vega
On December 17th, 2019, Yandex unveiled the Vega update, which marked a significant advancement in its search capabilities. Vega employed neural networks trained by subject matter experts to enhance search results, ensuring greater accuracy and relevance.
Furthermore, Vega implemented AI-powered algorithms to group similar web pages together, optimizing resource utilization and expanding Yandex’s search index to encompass 200 billion documents.
👉Yandex.Q
Yandex.Q, a novel question-and-answer service, was introduced alongside the Vega update by Yandex. It amalgamates the functionalities of Yandex.Experts debuted in the 2018 Andromeda update, and TheQuestion, a Q&A platform acquired by Yandex in early 2019.
With a repository exceeding one million questions and answers curated by subject matter experts, users can submit inquiries via the Yandex.ru search bar. Yandex.Q’s responses are prominently displayed atop search results, similar to Google’s featured snippets.
An illustrative scenario provided by Yandex showcases how users searching for information on Alexander Pushkin might encounter insights from a literary critic, or those researching seal behaviors could receive input from the head of the National Arctic and Antarctic Museum.
👉Yandex Link-Based Algorithms
Like its counterpart, Google, Yandex has grappled with link spamming and Private Blog Networks (PBNs). In response, Yandex has implemented various measures to combat manipulation and ensure the fair ranking of search results.
👉Nepot Filter
One notable initiative was the introduction of the Nepot Filter in 2005, aimed at curbing the proliferation of link exchanges and PBNs. Unlike traditional approaches focusing on link quality, this filter targeted websites engaged in the rapid accumulation of unnatural links and imposed penalties accordingly. Subsequently, in March 2008, an algorithmic update was rolled out, although its specific details remained undisclosed, suggesting a continued effort by Yandex to combat link spam.
👉Ranking Without Links
In December 2013, the former head of search at the company announced plans for an algorithm that would not factor in links for ranking purposes. Subsequently, a beta version of this algorithm was launched in Moscow, focusing on sectors like real estate, tourism, and consumer household appliances. Later, at the ByNet Week conference in 2015, Sadovsky introduced a new link-based algorithm named Minusinsk.
Following this announcement, numerous websites were alerted via Yandex.Web admins should avoid linking spam and other manipulative tactics to influence search outcomes. The implementation of Minusinsk occurred in three distinct phases: May 15, 27, and June 23.
👉Yandex Local Algorithms
The approach to local SEO in Russia, particularly through Yandex, diverges from Google’s strategies for large countries like the United States. Yandex implemented significant changes to its algorithm to enhance local search relevance.
In 2006, Yandex introduced automatic geo-classification of websites, aiming to deliver more pertinent results for users seeking local information. This innovation aligned search queries more accurately with users’ local intents.
👉Arzamas & Konakovo
The first notable local algorithm, Arzamas, was introduced in April 2009, following the 2008 Magadan updates. Arzamas automatically determined regional relevance for search queries, a departure from the manual selection option provided by the 2006 update.
Through Yandex.Webmaster tools and website administrators gained the capability to specify regions, facilitating more targeted search results. Additionally, regional search result pages were launched in key cities, including Moscow, Saint-Petersburg, Ukraine, Belarus, and Kazakhstan.
Arzamas underwent further refinement in June and August 2009, enhancing ranking formulas for cities like Yekaterinburg. Moreover, older and trusted websites saw improved rankings for queries irrespective of location. In December 2009, the Konakovo update expanded Yandex’s local ranking considerations to encompass 1,250 cities, broadening the scope of regional relevance factors.
👉Obninsk
In September 2010, Yandex made significant enhancements to its algorithm’s capability to identify a website’s geographical region, even if the site administrator had not specified a region in Yandex.Webmaster. This update, known as the Obninsk algorithm, also targeted low-quality link spam, thereby improving search result relevance.
In 2012, Yandex expanded its local results algorithm to include image search, tailoring results based on the user’s location in Russia, Ukraine, Belarus, or Kazakhstan.
👉Yandex Content & Quality Algorithms
Like Google, Yandex has implemented various measures to ensure search results align with user search intents and provide high-quality content.
👉Unnamed 2007 Update
An unnamed update in July 2007 introduced new ranking formulas and factor weighting for single and multi-word queries, accompanied by enhanced support from the Yandex search team.
👉8 SP1
In 2008, Yandex introduced its first officially named algorithm, 8 SP1, to address the dominance of older, larger websites in search results. 8 SP1 introduced a “Trust Rank” to evaluate website credibility, reducing the reliance on backlinks.
👉Magadan
Following 8 SP1, Magadan was introduced in May 2008 to improve the interpretation of abbreviations in search queries. Web admins could test Magadan’s search results and provide feedback via buki.yandex.ru.
Magadan 2.0, released in July of the same year, incorporated considerations for content uniqueness and better differentiation between commercial and non-commercial, as well as local and national queries.
👉Nakhodka
In September of 2008, Yandex implemented Nakhodka, a refinement to search results that targeted internal pages of promoted websites, expanding beyond just the homepage. Nakhodka’s primary focus was to address the issue of cloaking more effectively.
Around the same time, Yandex introduced the AGS filter, which was officially acknowledged in September 2008. According to Yandex, they had been developing this filter since 2006. Initially designed as a penalty mechanism, the AGS filter restricted search results to display only one to ten pages from a website. However, this restriction was lifted in 2014.
👉AGS Filter
Initially, the AGS filter primarily targeted websites with low-quality or duplicate content. Subsequent updates, such as AGS 30 in December 2009, extended penalties to include sites with unique but poor-quality content lacking user value. AGS 40, introduced in November 2013, aimed at websites designed solely to generate affiliate link clicks or advertising revenues, something Google’s Fred updated in 2017.
In 2015, Yandex rolled out a significant update to AGS, focusing on websites that sell and place links.
👉Krasnodar
Another notable development occurred in 2010 with the Krasnodar update, which introduced Spectrum. Spectrum categorized queries into 60 semantic categories, adjusting the weighting and presentation of search results accordingly. This diversification aimed to provide users with a broader range of relevant search results, encompassing various types of websites beyond just commercial product pages, including eCommerce stores and informational sites like Wikipedia, blogs, and forums. This enhancement aimed to increase the likelihood of users finding the most suitable search results for their queries.
👉Reykjavik & Kaliningrad
In 2011, the Reykjavik update altered search results based on the user’s browser language and search language preference, marking Yandex’s initial move towards personalized SERPs at the user level. Subsequently, the Kaliningrad update 2012 expanded this approach by considering a user’s search history and behavior on search results pages.
👉You Are Spammy
During the same year, Yandex implemented the “You Are Spammy” filter to address issues related to overoptimized, spam-filled, and keyword-stuffed content. Additionally, website usability gained significance, leading to the development of a dedicated algorithm focusing on e-commerce website usability, content quality, and trustworthiness following a successful beta test in the Moscow region.
👉Obtrusive & Fake Pop-Ups
In May 2012, Yandex began addressing the problem of websites employing deceptive pop-ups, including fake Windows or social media notifications. This effort was further reinforced in 2014 with updates aimed at downgrading websites featuring adult or obscene advertisements and those with aggressive and intrusive adverts within organic search results.
👉Search Result Manipulation
Yandex, like Google’s algorithms, adjusts its search results based on user behavior data. Towards the end of 2014, Russian link networks offering a “link strengthening” service began manipulating search result user data. This manipulation involved excessive clicking on client results or websites within their network linking to client websites, purportedly to enhance client site performance.
👉Introduction of the Website Quality Index and Removal of the Tic Indicator
In response, Yandex took measures in August 2018 by discontinuing the use of the TIC (Thematic Citation Index), which evaluated the quality of inbound links to a website. Instead, they introduced the website quality index (IKS in Russian). This new index incorporates data from various sources such as Metrica, Maps, and Zen, among others, to gain a deeper understanding of the website’s content and purpose.
👉Exploring the Impact of the Yandex Data Breach on SEO
The breach of data from Yandex has shed light on the inner workings of search engines, prompting a significant shift in SEO strategies. This revelation provided an unprecedented view into the factors influencing the ranking of results by a major search engine player.
The specifics revealed in the Yandex data breach unveiled a wealth of information regarding 1,900 factors influencing search algorithm rankings. This unprecedented disclosure offered insights into various aspects, such as the importance of page rank, link longevity, content relevance over time, and more.
The leak revealed that Yandex favors websites with video content, mobile-friendly designs, and user-generated reviews. A preference for .com domains and content proximity to users’ geographical locations was also highlighted.
Further examination of the leaked information emphasized the significance of URL structure, document age, content freshness, internal linking, and crawl depth in determining visibility within Yandex’s search results.
Yandex’s policies severely penalize websites that manipulate clicks or user behavior. Moreover, the effectiveness of Yandex’s machine learning models in detecting manipulative tactics like Private Blog Networks (PBNs) underscores the evolving landscape of digital marketing integrity.
The disclosure underscores the sophistication of web crawlers, like those utilized by Yandex, in evaluating content quality online, prompting a reevaluation of SEO strategies across the industry. Looking ahead to the “Implication of Yandex Data Breach on SEO,” let’s explore how these revelations might impact the current ranking landscape.
👉Revealing The Top Yandex SEO Ranking Factors
Sl No. | Yandex SEO Ranking Factor | What It Does? |
0 | PR | Page Rank. The factor will be remarked. |
1 | TR | |
2 | LR | Link relevance. The factor will be remarked. |
3 | PrBonus | Priority bonus, priority 7 – text priority. The binary factor, matters 0 for all monosyllabic requests, and the value of 1 for almost all two or more words, except for a very small number of answers for which there is not a single link that has passed quorum, and the text also did not pass the quorum. |
4 | TRp1 | Stript priority for TR is a text priority – there are all the words of the request somewhere in the document (while they pass contextual restrictions on the request, for example, both words DB in one sentence). |
5 | TRp2 | PHRASE priority for TR is a text priority – there are all the words of the request in a row in the document. |
6 | LRp1 | (strict) there is all the words of the request in one link. |
7 | LRp2 | (Phrase) There are all the words of the request in a row in one line. |
8 | TRtitle | The presence of an accurate phrase (request text) in the header (more precisely, in the first sentence of the document). Contextual restrictions and feet are taken into account exactly as in TRP2, i.e. Factor [8] Minors Factor [5] |
9 | TRhr | There was a plot that passed the quorum in which all the word positions are designated as those who have the relevance of Best_relev (title or Meta Keywords). |
10 | Removed_10 | |
11 | News | This is the news (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushichiekomponenty/klassificacionnye?v=tkd#h45859-3 Patterns in URL $))))). |
12 | Shop | This is a proposal store (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayafformula/tekushichiekomponenty/klassificacionnye?v=tkd#h45859-4 Patterns in Url ‘))))))))). Not used (depreded) |
13 | Cat | This is a catalog (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayafformula/tekushhiekomponenty/klassificacionnye? . |
14 | YaBar | Attendance from the bar – ((http://wiki.yandex-team.ru/andrejjkostjagin/yabarlog/hoststat data description)). The factor will be remarked. |
15 | Long | Long document (the longer the document, the greater the value of the factor). |
16 | TRhitw | Hitweigt is a variant of textual relevance, in which the weights of all hits are considered equal (i.e., they do not take into account the allowances for title and the proximity of words). In this case, the corresponding hits must be restricted by the syntactic sorcerer, i.e. We can assume that the TRHITW factor is 0 and only when Softandok is 0 |
17 | LongQuery | The amount of IDF words of the request. The name does not reflect the essence: for example, for the request of ‘Gadyach’ this factor will be more than for the request of ‘Moscow Peter Yekaterinburg Samara’. |
18 | PureText | Long text without links. |
19 | Root | This is a muzzle. |
20 | Removed20 | |
21 | Removed21 | |
22 | Geo | Means the coincidence of the region of the user and the site at the level of countries. Binar factor: 1-rush, 0-no. It is based on ((http://wiki.yandex-team.ru/ Yandexposisk/ Classification of Sytraitniki/ Geographic/Sospolzanievpoysk Geoklassification of sites))))))) |
23 | SubqueryThMatch | Coincidence of thematic spectra of request and document. Request themes-the result of work ((http://wiki.yandex-team.ru/evgenijjkroxalev/subquery Rules of the sorcerer Subquerysearch)) The subject of the document is taken from Yandex-Catalog |
24 | SR | The complex Static Rank is assembled from static components according to a separate formula ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/#oftnd1 *))). |
25 | TRref | The factor about the number of Refines. In the queries, there is a feature of user refines (” word that is faced with a percentage sign ‘). According to the idea, this means something like ‘it would be good if the word in the document was’. The only famous ((http://staff.yandex-team.ru/gulin Andrey Gulin)) the valuable use of this feature is a request [ %official %site name of the film]. This feature is unknown to users, because Not described in any documentation. It is planned that it will disappear from the tongue of requests, but in the sorcerer the words with the priority of User_refine will remain. The factor indicates how much the maximum user_refine was simultaneously found in the framework of a single hit in the quorum. It is believed that there are from 0 to 3 (if> 3, then it is believed that 3). This number is waved in the half interval [0.1] |
26 | TRboost | The number for which some linseed factors are multiplied (namely, factors number 6, 7, 47, 66), if text relevant 0, and there are few links |
27 | TRLRlemma | In textual relevance, Lemma coincides. |
28 | TrafgraphOutAll_share_d | Remapped mascot feature TrafgraphOutAll_share_d |
29 | RelevSentsDssm | DSSM model, trained for reformulations, in the document uses relevant to the request of the proposal |
30 | FreshNewsDetectorPredict | The value of the news detector calculated in the Hippo. Always 0 with a detector value less than the threshold. |
31 | LRHitNum100 | The transformed number of words of the request in all url linos. |
32 | LRHitNumGt16 | The document LR> 20 The number of words of the words of the request in the Links> 16, the factor about LR. |
33 | PctLinks | For documents with a high LR, a normalized lincat relevance excluding proximity, for documents with a low LR 0 |
34 | HasLR | URL High LR. |
35 | LinkQuality | The quality of incoming links (the classifier of the bream) is broken, cm [405] |
36 | The value of the cosinematchmaxprediction factor for the Stryim ALICEMUSIC | |
37 | NumLinks | The number of incoming links. Remembrance. |
38 | PopularQ | The popularity of the request |
39 | TRUnmapped | TR divided by a cube of the number of words in a request and transformed by a standard REMAPTR. |
40 | RusLang | The language of the document is Russian. |
41 | AddTime | The time of adding a page, more – a more old document; The root is placed from time displayed at the interval [0.1] so that 3+ years gives 1. |
42 | IsMainPage | If the main page of the owner (most often a second -level domain, for example xxxx.ru), then the factor is 1. For bums, hosting, personal blogs, etc. (for example, Lifejornal, People.ru, etc.) – domains of the third level (such as xxxxx.narod.ru) will also have an equal factor 1. |
43 | AddTimeMP | The time for adding the main page of the owner (host?) Will be remaped like Addtime. |
44 | AliceMusicTrackTitleAnnotationMaxValueWeighted | The value of the factor annotationmaxvalueEWEEGTED for Stryim ALICEMUSIC |
45 | QueryURLClicksPCTR | How often they click in this URL for this request – CTR blasting for the correction factor |
46 | TextBM25 | Simple BM25 in text. |
47 | LinkBM25 | Simple BM25 for links, the weights of the braces are not taken into account. |
48 | TLBM25 | Simple BM25 in text and links at the same time. |
49 | TLp1 | All the words of the request are in the text + links. |
50 | Adv | There is advertising on the site. |
51 | YandexAdv | On the site there is an advertisement for Yandex. |
52 | NoSpam | The Classifier of Spam for Picks from Antispam recognized the site not (!) Spam. Those. 0 = spam, 1 = good. |
53 | TxtPair | Simple BM25 in pairs of words – we take all pairs of words of the request and consider the number of their entry into the text of the document. In the quality of the weight of the pair we use the sum of the scales of words. It does not work if there is a stop-word in the request |
54 | LnkPair | The same as txtpair, but for links; Link weights are not taken into account. |
55 | TxtBreak | BM25 from the number of sentences in the document in which it occurs. |
56 | TxtHead | BM25 according to only in the heading. |
57 | TxtHiRel | BM25 according to only with High Rel-bots (‘significant’, with the allocation (<b> ITP)). |
58 | Removed_58 | WordCount |
59 | WordCount | Min (number of words of request/10, 1.f) |
60 | InvWordCount | 1 / quantity_lov_v_ |
61 | HasNoTR | The document has no TR. |
62 | HasNoLR | The document has no LR. |
63 | HasNoQueryURLShows | For this Urla, for this request, there is no information about clickness 1 – request or request -URLA in the click database, 0 – query URL in the clicks database |
64 | HasNoQueryShows | For this request, there is no information about clickability 1 – there is no request in the click database, 0 – the request is in the click database. |
65 | Hops | The number of hops of Url inpans (such as less – closer to the muzzle, the lower the value (0 – the muzzle, 1 – from the muzzle cannot be reached, 0 <can get from the muzzle <1). Normal value for the root of the nosta 0.0039). |
66 | LogLR | Logarithm from LR, linearly displayed in [0.1]. |
67 | TxtPairEx | the presence of pairs of words in the exact form |
68 | TxtBreakEx | the number of sentences in which there are many words in the exact form |
69 | TxtHeadEx | the presence of words in the header in the exact form |
70 | TxtHiRelEx | BM25 in the exact form |
71 | TxtBm25Ex | Simple BM25 in the exact form. |
72 | TxtPairSy | the presence of pairs of words taking into account synonyms (> = txtpair) |
73 | TxtBreakSy | the number of sentences in which there are many words taking into account synonyms |
74 | TxtHeadSy | the presence of words in the header, taking into account synonyms |
75 | TxtHiRelSy | BM25 taking into account synonyms |
76 | TxtBm25Sy | Simple BM25 taking into account synonyms. |
77 | QueryDOwnerClicksPCTR | How often they click in the URLs of this Domainid for this request – Ctr Domainid blasting for the correction factor |
78 | HasNoQueryDOwnerShows | For this Domainid for this request, there is no information about clickability 1 – request or request -owner is not in the clicks database, 0 – the request for clicks is in the database of clicks |
79 | OwnerClicksPCTR | The owner’s clickness regardless of the request |
80 | Megafon | The relative frequency of the words in the links (1 – the words of the request are often found in links, 0.3 – rarely); More precisely, the value of this factor is pessimized provided: TR = 0 && LR = 0 & (there is not a single link with all the words of the request) && (did not pass the quorum) && (at least one pair of words of the request is found in the text) |
81 | XLRp0 | There are all the words of the request in the links |
82 | XLRp1 | There are all the words of the request in one link |
83 | XLRp2 | There is a link that has passed quorum |
84 | XLRgood | What is the share of “good” links |
85 | XLRmanyBad | How many “bad” links (bad = DPR = 0) |
86 | XLRmaxDpr | Maximum DPR links |
87 | XLRtfidf | TFIDF ordinary TF*IDF by links. The frequency of the word in the links is multiplied by the reverse document frequency and summarized in all words, then it is normalized to the length of the document. |
88 | XLRrelev | Linkovaya relevance by Gulina |
89 | XLRrelev200 | Linkovaya relevance by Gulina |
90 | XLRlogRelev | Linkovaya relevance by Gulina |
91 | BFexact | There is an exact form of all words of the request in the text/lincers |
92 | BFlemma | There is a lemma of all the words of the request in the text/lincers |
93 | SoftAndOk | The document passed Softand on the restrictions of the syntactic sorcerer. Only for documents with textual relevance. For monosyllabic requests, always 1. |
94 | NewLinkQuality | The quality classifier of incoming links 2 is broken, cm [407] |
95 | Ukrainian | It is equal to one if the site has a Ukrainian geoist (i.e. 1 – Ukrainian site) |
96 | IsBlog | Page from the blogochosting |
97 | IsLivejournal | Page with Livejournal.com |
98 | Removed_98 | Spam2 |
99 | Spam2 | Automatic classifier spam named after Alekseeva, the likelihood that the website spam (0 is not spam, 1- spam) |
100 | TextFeatures | The quality of the text. It is considered a rather complex formula |
101 | TextLike | Text quality (classifier Alekseeva) |
102 | Removed_102 | Removed_103 |
103 | Removed_103 | YaBarCoreOwner |
104 | YaBarCoreOwner | The core of the audience of owners according to Yandex.Mrazusing |
105 | YaBarCoreHost | The core of the audience of the hosts according to Yandex.Mrazusing |
106 | HasYaBarCore | Does the host have a host |
107 | SpamKarma | Spam karma named after antispamers is the likelihood that the host is spam; based on Whois information |
108 | MusicQ | The musicality of the request. The results of the sorcerer Anton Konygin. |
109 | XLExactMatches | The number of links that exactly coincide with a request |
110 | DocLen | Document length in sentences |
111 | UrlLen | The length of the URL, divided by 5 |
112 | QueryNonCommerciality | The commercial request for the dictionary of phrases from Direct: 0 – maximum commercial, 1 – minimal. |
113 | HostSize | The size of the Host named after Raskovalov in the documents without taking into account the takes (each double is taken into account in the factor by an independent document) |
114 | IsHTML | Document type – HTML |
115 | LinkSpeed | The number of reverse dispersion times of the appearance of links with the words of the request |
116 | XThLRrelev | Link relevance, taking into account thematicity |
117 | XThLRrelev200 | Link relevance, taking into account thematicity |
118 | XThLRlogRelev | Link relevance, taking into account thematicity |
119 | XLerfLRrelev | Link relevance, taking into account the quality of each link |
120 | XLerfLRrelev200 | Link relevance, taking into account the quality of each link |
121 | XLerfLRlogRelev | Link relevance, taking into account the quality of each link |
122 | XLerfThLRlogRelev | Link relevance, taking into account the quality of each link and thematicity of each link |
123 | XNonCommLRlogRelev | Link relevance, taking into account the non -profitability of each link |
124 | XNonCommThLRlogRelev | Link relevance, taking into account the non -profitability of each link and thematic |
125 | XNonCommLerfLRlogRelev | Link relevance, taking into account the non -profitability of each link and quality of each link |
126 | XNonCommLerfThLRlogRelev | Link relevance, taking into account the non -profitability of each link, the quality of each link and thematicity |
127 | GeoCityProxim | Means the coincidence of the region mentioned in the request and found sites at the level of areas. Binar factor: 1-rush, 0-no. It is based on ((http://wiki.yandex-team.ru/ Yandexposisk/ Classification of Sytraitniki/ Geographic/Sospolzanievpoysk Geoklassification of sites))))))) |
128 | LinksWithWordsPercent | The percentage of incoming links with the words of the request |
129 | LinksWithAllWordsPercent | The percentage of incoming links with all the words of the request |
130 | PornoQuery | Are there any words from Yweb/Pornofilter/Porno.query. |
131 | IsPorno | Document from porn kitski |
132 | IsComm | A document from a commercial clay. Not used (depreded) |
133 | IsFake | Fast document |
134 | IsSEO | The page title contains commercial vocabulary. Not used (depreded) |
135 | IsWiki | page from ru.wikipedia.org |
136 | IsEShop | Commercial page (Classifier Savina) |
137 | GeoRegionProxim | HasNoAllWordsTRSy |
138 | HasNoAllWordsTRSy | The document does not have all the words of the request (with an accuracy to a synonym) |
139 | NumWordsTRSy | The percentage of the words of the request in the document (with an accuracy to a synonym) |
140 | HasAllWordsTRSy | The document has all the words of the request (with an accuracy to a synonym) |
141 | NumWordsLR | The percentage of the words of the request in the links (with an accuracy to a synonym) |
142 | HasAllWordsLR | There are all the words of the request in the links (with an accuracy to a synonym) |
143 | PayDetectorPredict | The value of the commerce detector calculated in the Hippo. |
144 | TxtInvPair | Tr by pairs of words in the reverse order |
145 | LnkInvPair | Lr by pairs of words of the request in the reverse order |
146 | TxtSkipPair | TR by pairs of words of the request through one word in texts |
147 | LnkSkipPair | Lr by pairs of words of the request through one word in texts |
148 | NumWordsTRFm | The percentage of all the words of the request in the text (with an accuracy to the form) |
149 | HasAllWordsTRFm | The document has all the words of the request (with an accuracy to the form) |
150 | QDiversity | The degree of centralization of the points from which the request is set |
151 | QBlog | Whether the request of blog vocabulary contains |
152 | XGeoLRlogRelev | log (lr, narrowed to the country of the user) |
153 | XLerfGeoLRlogRelev | log (leerflr, narrowed to the country of the user) |
154 | NonCommercialQuery | Binar non -profit request: Querynoncommerciality> 0.965. |
155 | XLExactMatchesMap | The number of links that coincide with the text of the request (other Remap) |
156 | XLerfNormLRlogRelev | Xlerflrlogrelev (normalized for the amount of LerF-wwees of all links, and not for the amount of their source scales) |
157 | XNonCommNormLRlogRelev | Xnoncommlrlogrelev (normalized for the amount of noncomm all links, and not for the amount of their source scales) |
158 | XNonCommThNormLRlogRelev | Link relevance, taking into account the non -profitability of each link and thematic |
159 | XNonCommLerfNormLRlogRelev | Xnoncommelrfnormlrlogrelev (normalized for the amount of noncommlrf-wigles of all links, and not for the amount of their source scales) |
160 | XNonCommLerfThNormLRlogRelev | Link relevance, taking into account the non -profitability of each link, the quality of each link and thematicity |
161 | Nevasca1 | The content of content is not used. ‘Hoost is good (from 0 to 1), calculated on the basis of how many and what hosts the content from this one borrow. |
162 | Nevasca2 | The content of content is not used. ‘Host’s badness’ (from 0 to 1) – is proportional to the number of secondary content on the host.’ The host (from 0 to 1) is proportional to the number of secondary content on the host. |
163 | LinkAge | The average age of links that brought something to LR linkage = min (log (average age of links)/7, 1), 3 years are adopted for 1 |
164 | TLen | The length of the page text in the words tlen = map (number of words, 1/400), where map (x, y) = x*y / (1 + x*y) |
165 | IsUnreachable | The page is unattainable by the links from the muzzle. |
166 | XLangLRlogRelev | LR, taking into account the coincidence of the language and request |
167 | XLerfLangLRlogRelev | LR, taking into account the coincidence of the language of the link and request and accuracy |
168 | QueryURLClicksFRC | the ratio of the number of clicks on this Urlu to all clicks on request |
169 | QueryDOwnerClicksFRC | the ratio of the number of clicks on this Domainid to all clicks on request |
170 | QueryURLClicksPCTR_copy | [Bug: A copy of factor 45] How often they click in this URL for this request – CTR blasting for a correction factor |
171 | DoppQueryUrlSessionClicksFRCCity | What part (on average by the session) from the user Urlov’s user, this URL user, who has been completed to it, is this URL. It is considered to be user sessions. |
172 | QueryURLClicksPCTR_Reg | How often do they click in this URL for this request – CTR blasting for the correction factor, by small regions from Relev_regions.web.txt |
173 | QueryDOwnerClicksPCTR_Reg | How often they click in the URLs of this Domainid for this request – Ctr Domainid to the correction factor, by small regions from Relev_regions.web.txt |
174 | QueryURLClicksFRC_Reg | The ratio of the number of clicks on this Urlu to all clicks on request, by small regions from Relev_regions.web.txt |
175 | QueryDOwnerClicksFRC_Reg | The ratio of the number of clicks on this Domainid to all clicks on request, by small regions from Relev_regions.web.txt |
176 | QueryURLClicksCombo_Reg | Query URL Clicks Combo, in small regions from Relev_regions.web.txt |
177 | QueryDOwnerClicksCombo_Reg | Query Download Clicks Combo, in small regions from Relev_regions.web.txt |
178 | XLRCatalogRelev | LR for catchard descriptions |
179 | XLRYaCatalogRelev | LR to write off in Yandex.Catalog |
180 | ExactWordOrderLen | The length of the maximum coincidence of forms in the text and request |
181 | ExactWordOrderWeight | Weight of maximum coincidence of forms in the text and request |
182 | WordOrderLen | The length of the maximum coincidence in the lemma in the text and request |
183 | WordOrderWeight | The weight of the maximum coincidence by lemma in the text and request |
184 | LinkMaxAge | The maskimal age of a significant accumulation of links that brought something to LR |
185 | TRp1All | Options for relevant factors taking into account the feet of words |
186 | LRp1All | Options for relevant factors taking into account the feet of words |
187 | TLp1All | Options for relevant factors taking into account the feet of words |
188 | BFexactAll | Options for relevant factors taking into account the feet of words |
189 | BFlemmaAll | Options for relevant factors taking into account the feet of words |
190 | PassageLegacyTR | TR of the best passage – how high -quality snippet |
191 | TxtBM25AttenSyn | Tr with discount for suggestions |
192 | MaxWordHostRank | Host Rank according to the most pronounced word of request (usually this is the name of the site) |
193 | MaxWordHostClicks | Domattr clickness for the most expressed word. For example, for all requests in which there is a word Wikipedia click on Wikipedia warders. |
194 | DomPhraseRank | Hostrank by separate words |
195 | DomPhraseClickRank | Domain clickability by words |
196 | IsForum | URL satisfies forum_detector regularly |
197 | AliceMusicTrackTitleAnnotationMatchWeightedValue | The value of the Factor AnnotationMatchweighhedValue for Stryim ALICEMUSIC |
198 | IsObsolete | The URL has an ancient date. Ancient news are recognized. Factor 1 if there is a year in Url <= 2007. |
199 | TRWithStops | Weight of maximum coincidence of forms in the text and request |
200 | LRWithStops | Weight of maximum coincidence of forms in the text and request |
201 | HasPayments | The page has a about ‘payment SMS’. |
202 | IsLinkPessimised | Antispamers pessimized the site – all dynamic link factors are reset. Zerolnk.flt |
203 | EshopValue | Stage of the page |
204 | PornoValue | Pornography of the page |
205 | TrafgraphOutAll_share_m | Remapped mascot feature TrafgraphOutAll_share_m |
206 | TrafgraphOutAllSE_share_d | Remapped mascot feature TrafgraphOutAllSE_share_d |
207 | TrafgraphOutAllSE_share_m | Remapped mascot feature TrafgraphOutAllSE_share_m |
208 | NoExtClicksShare | Remapped mascot feature NoExtClicksShare |
209 | CountersSearchTraffic1 | Search traffic – transitions from search engines to the site (2nd formula) |
210 | CountersSearchTraffic2 | Search traffic – transitions from search engines to the site (2nd formula) |
211 | DomPhraseYabar | Transitions to the site from search engines by individual words, according to the bar |
212 | AliceMusicArtistNameBclmMixPlainK000001 | BCLMIXPLAINK000001 factor |
213 | QueryUrlLCS | The largest total tuning of Urla and request, normalized by the length of Urla |
214 | OnlyUrl | All coincidences are only in the URL, there are no coincidences in the text |
215 | GeoRelevRegionCity | GeoRelevRegionRegion |
216 | GeoRelevRegionRegion | GeoRelevRegionCountry |
217 | GeoRelevRegionCountry | Three levels of coincidence of the geography of the user and page |
218 | XLRGeoRelevRegionCity | XLRGeoRelevRegionRegion |
219 | XLRGeoRelevRegionRegion | XLRGeoRelevRegionCountry |
220 | XLRGeoRelevRegionCountry | Three levels of coincidence of the region of links and request |
221 | GeoCountryProxim | Geographical proximity |
222 | IsNavQuery | Is the request for navigation, on the clicking of the answers |
223 | MaxWordHostYaBar | The most characteristic word of the request corresponding to the site, according to the bar |
224 | FirstWordHostClicks | The clickability of the host according to the first word of the request. Quite often, the first (last) word word is a clear indication of the site on which the information should be sought. |
225 | AliceMusicArtistNameCMMatchTop5AvgMatch | The value of the CMMATCHTCHTOP5AVGMATCH factor for the Stryim ALICEMUSIC |
226 | QueryDOwnerYabarVisits | QueryDOwnerYabarVisitors |
227 | QueryDOwnerYabarVisitors | QueryDOwnerYabarAvgTime |
228 | QueryDOwnerYabarAvgTime | The average for users Active continuous time of the user is (in second) on the host pages after the transition on request from the search engine (the factor depends on the pair (request, Domattr)). |
229 | QueryDOwnerYabarAvgTime2 | The average for users Active continuous time of the user is (in second) on the host pages after the transition on request from the search engine (the factor depends on the pair (request, Domattr)). In the inside of the Yandex. Bara/elements/browser counter |
230 | QueryDOwnerYabarAvgActions | The average for users is the number of active actions (clicks, clicks) with the continuous finding of the user on the host pages after the transition on request from the search engine (the factor depends on the pair (request, Domattr)). . In the inside of the Yandex. Bara/elements/browser counter |
231 | QueryUrlYabarVisits | QueryUrlYabarVisitors |
232 | QueryUrlYabarVisitors | The number of unique visitors from search engines for a specific request |
233 | QueryUrlYabarAvgTime | The average for users Active continuous time of the user (in second) on the page after the transition on request from the search engine (the factor depends on the pair (request, URL)). |
234 | QueryUrlYabarAvgTime2 | The average for users Active continuous time of the user (in second) on the page after the transition on request from the search engine (the factor depends on the pair (request, URL)). In the inside of the Yandex. Bara/elements/browser counter |
235 | QueryUrlYabarAvgActions | The average for users is the number of active actions (clicks, keystrokes) on the page after the transition on request from the search engine (the factor depends on the pair (request, URL)) |
236 | DssmBertDistillSinsigMseBaseRegChain | A pool of logs is marked with BERT trained on Sinsig. DSSM model is trained on this pool using BaseregionChain |
237 | DssmBertDistillRelevanceMseBaseRegChain | A pool from PRS logs is marked using BERT, trained for relevance. DSSM model is trained on this pool using BaseregionChain |
238 | AliceMusicArtistNamePerWordCMMaxMatchMin | PERWORDCMMAXMATCHMIN Factor value for Alicemusic Stryim |
239 | AliceMusicArtistNameAttenV1_Bm15_K05 | The value of the factor attenv1_bm15_k05 for the Stryim ALICEMUSIC |
240 | AliceMusicAlbumTitleAnnotationMaxValueWeighted | The value of the factor annotationmaxvalueEWEEGTED for Stryim ALICEMUSIC |
241 | IsForeignQuery | Request is not in Russian |
242 | IsForeignCluster | foreign cluster document |
243 | PageRegionSizeIn | The size of the page of the page |
244 | PageRegionInvSizeIn | The factor is inversely proportional to the size of the page region |
245 | QueryRegionSize | The size of the region of the request |
246 | QueryRegionInvSize | The factor is inversely proportional to the size of the regional region |
247 | GeoGeometryProxim | The geographical proximity of the user and the site |
248 | RingsHostRankBadnessOld | Characterizes the promotion of the site with ling rings. Value is the share of external links that are included in the lingon rings and battleships. |
249 | YabarHostVisitors | The number of unique visitors, remarks exponentially |
250 | YabarHostSearchTraffic | The share of traffic from search engines |
251 | YabarHostInternalTraffic | The share of suits to the site is not by links (set with hands or from bookmarks) |
252 | YabarHostAvgTime | average for users Active continuous time for user finding (in sec) on the host pages |
253 | YabarHostAvgTime2 | The average for users Active continuous time of the user (in second) on the pages of the host. In the inside of the Yandex. Bara/elements/browser counter |
254 | YabarHostAvgActions | The average for users is the number of active actions (clicks, clicks) with the continuous finding of the user (in second) on the pages of the host. |
255 | YabarHostBrowseRank | Implementation of the algorithm described in the article ((http://wiki.yandex-team.ru//h.yandex.net/?http%3A%2F%2FreseRosoft.microsoft.com%2fen-US%2FPEOPLIULIUUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUUP032-LIUUUU .pdf http://research.microsoft.com/en-us/people/tyliu/fp032-liu.pdf)) |
256 | YabarUrlVisits | Varla’s attendance according to I-Bara |
257 | YabarUrlVisitors | The number of unique visitors to Urla |
258 | YabarUrlAvgTime | The average for users time is the user on the page. It is read as the difference between neighboring transitions. |
259 | OwnerSatisfied4Rate | This is the SEA factor = s4_r/ (k_r+10) where S4_R is the number of clicks> 180 sec, k_r – the total number of clicks. It is considered taking into account reformalization. |
260 | OwnerSatisfied4Rate_Reg | This is the SEA factor = s4_r/ (k_r+10) where S4_R is the number of clicks> 180 sec, k_r – the total number of clicks. It is considered taking into account reformalization. Localized version |
261 | UrlQueryVariety | The degree of variety of requests for which this Urla click |
262 | IsCommByKeywords | Commercial page by keywords. Not used (depreded) |
263 | DocIdfSum_broken | IDF for various parts of the document, broken, are not used |
264 | TitleIdfSum_broken | IDF for various parts of the document, broken, are not used |
265 | HeadingIdfSum_broken | IDF for various parts of the document, broken, are not used |
266 | NormalTextIdfSum_broken | IDF for various parts of the document, broken, are not used |
267 | XLRVideoRelev | Link factor about the presence of a video on the page. |
268 | AuxTextBM25 | BM25 for the user region for localized queries, for the unflapped in Cuba, is a country. The texts of the queries sent for the regions can be viewed in Relev_regions.txt in the sorcerer |
269 | AuxLinkBM25 | The same for lingonic relevance |
270 | CommLinksSEOHosts | The share of incoming corrupt links. The algorithm for recognition of commercial links is implemented. The factor will be remarked to [0.1] if the share of such links is 50%, otherwise 0. ((http://wiki.yandex-team.ru/svetlanashorina/topseolinks selection of wound sites)))))) |
271 | CommLinksSEOHostsPornoQuery | Previous factor multiplied by Pornoquery |
272 | CommLinksSEOHostsNonComm | ComMlinksseohosts factor multiplied by Noncommercialquery |
273 | TovarCategoryQuery | The request mentions the product category. Not used (depreded) |
274 | TovarCategoryVendor | The request mentions a vendor. Not used (depreded) |
275 | Diversity2 | Geographical distribution of the request |
276 | NightQuery | The request is set mainly at night |
277 | MorningQuery | The request is set mainly in the morning |
278 | DayQuery | The request is given mainly in the afternoon |
279 | EveningQuery | The request is set mainly in the evening |
280 | HourDiversity | The severity of the querial tasks at different times of the day |
281 | LCor | Characterizes the frequency of words in links. The factor is large, if the word that played in a lincoat relevance is rare for links. |
282 | SubqueryThMatchA | Coincidence of thematic spectra of request and document. Request themes – the result of work ((http://wiki.yandex-team.ru/evgenijjkroxalev/subquery Rules of the sorcerer Subquerysearch)) The subject of the document is determined by the automatic classifier |
283 | TRDocQuorum | The weight of the words of the request that is in the text |
284 | LRDocQuorum | The weight of the words of the request that is in the Links |
285 | TRLRDocQuorum | The weight of the words of the request that is in the text and links |
286 | OwnerSDiffClickEntropy | Entropy – distribution of clicks |
287 | OwnerSDiffShowEntropy | Entropy – distribution of shows |
288 | OwnerSDiffCSRatioEntropy | Entropy – Distribution of clique/shows. |
289 | XPornoLRlogRelev | Document Porn on the text of Leskok |
290 | XPornoNormLRlogRelev | Document Porn on the text of Leskok, other normalization |
291 | XPornoQuery | Classifier of Porn Causions, another dictionary than Pornoquery |
292 | AliceMusicAlbumTitleAttenV1_Bm15_K05 | The value of the factor attenv1_bm15_k05 for the Stryim ALICEMUSIC |
293 | GeoCountryCountryProxim | The geographical proximity of the country of the site and the country of request |
294 | UrlDomainFraction | Coating domain three -bouqu and request. (Chelyabinsk lottery – Chelloto. We translate a request to translite, find the three -book that are covered (Che, Hel, Lot, Olo), we look at what share of all three -bouquets are covered) |
295 | UrlPathAndParamsFraction | The same as the previous factor, but about the entire Url except the domain |
296 | SpecificalQuery | The request is local-specific. The request is often reformulated with the obvious task of the region. ((https://ml.yandex-team.ru/archive/thread1433892/#Message1433892 more)) |
297 | JokerLen | We consider text features, believing that the page title is attributed to each of its proposal, i.e. The distance between the word from Title and any other word 1 sentence. Len is the maximum attitude of words from the request of the text met in some sentence (with attributed Title) in relation to the length of the request. Example [Harms Circus Vertunov] for ((http://wiki.yandex-team.ru//h.yandex.net/?http%3A%2F%2FWWWWIKILIVRES.info%2FWIKI%2F%25D0%25A6%25D %25b8%25D1%2580%25D0%25D0%25A %25BC%25D1%2581%of this document)) |
298 | JokerWeight | The ratio of the amount of IDF words in a sentence+Title to all words. |
299 | ExactJokerLen | The same as Jokerlen, in the exact forms |
300 | ExactJokerWeight | The same as Jokerweight, in the exact forms |
301 | More120SecVisitsNotSearchShare | Remapped mascot feature More120SecVisitsNotSearchShare |
302 | LnkBreak | Analogs of the corresponding text factors for links. BM25 from the number of links in which a coincidence occurred. |
303 | LnkBm25Ex | Simple BM25 in the exact form in link texts |
304 | LnkPairSy | The presence of pairs in the links of the words, taking into account synonyms |
305 | LnkBrkSy | The number of links passed the threshold |
306 | LnkBm25Sy | Simple BM25 by links taking into account synonyms |
307 | VideoQuery | Request about the video |
308 | OwnerClicksPCTR_Reg | The owner’s clickness regardless of the request, separately in the regions |
309 | OwnerSDiffClickEntropy_Reg | Entropy is the distribution of clicks. Regionalized |
310 | OwnerSDiffShowEntropy_Reg | Entropy is the distribution of shows. Regionalized |
311 | OwnerSDiffCSRatioEntropy_Reg | Entropy – distribution of clique/shows. Regionalized |
312 | Adultness | equals 2 * NastyContent |
313 | HostAdultness | equals 2 * NastyContent |
314 | KCHostAdultness | always zero |
315 | IsCom | Domna in Zone .com |
316 | IsUa | Domain in the .ua zone |
317 | IsNotRu | Domain is not in the .ru zone |
318 | XLRMarketRelev | LR by links from Yandex.Market |
319 | Poetry | The poetry of the document |
320 | PoetryQuad | The maximum poetry of the quatrain |
321 | EngLang | Document language – English |
322 | Has2ExactQueryParts | The request is completely covered by two exact groups consisting of an exact Match of the words of a contract in a row ((http://wiki.yandex-team.ru/poiskovajaplatform/tr/coveragebygroups about grouping in groups)) |
323 | HasLevensht1QueryFragment | There is a group consisting of an Exact Match of the words of the request that covers the request (possibly with a pass, addition or replacement of a word) |
324 | LargestSyInexactGroup | The share of the request, covered by the longest group consisting of any hits (including word forms and synonyms). Possibly with a pass, addition or replacement of a word |
325 | TimeProfilesMatchWD | Characterizes the proximity of temporary profiles of request and documents on business days |
326 | TimeProfilesMatchWE | Characterizes the proximity of temporary profiles of the request and documents on weekends |
327 | CyrLang | The language of the document is Cyrillic |
328 | GeoRegionalityU | Requestful factors – the result of work ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/georegionality classifier of geolocalization of the request))))))))))))) |
329 | GeoRegionalityR | R- Georelevan – regional results in the issuance could be useful, but nothing more |
330 | GeoRegionalityV | V- geovital – regional issuance is of fundamental importance |
331 | UrlHasNoDigits | There are no numbers in Urla |
332 | AliceMusicTrackArtistNamesAllWcmMaxMatch | ALLWCMMAXMatch factor |
333 | AliceMusicTrackAlbumTitleCosineMatchMaxPrediction | The value of the cosinematchmaxprediction factor for the Stryim ALICEMUSIC |
334 | SynS1 | Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more)) |
335 | SynFLremap1 | Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more)) |
336 | SynFLremap2 | Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more)) |
337 | OwnerSessNormDuration | ND/K normalized time to click |
338 | UrlSessNormDurRate | nd/i |
339 | QueryDOwnerSessNormDuration | CONTRY / K |
340 | QueryDOwnerWeightClick | w/k |
341 | QueryDOwnerOnlyClickRate | o/i |
342 | QueryDOwnerClickSummary | Selected formula |
343 | QueryDOwnerSatisfied4Rate | r_s4b/(r_k + 10) |
344 | SyntQuality | Does the request have a complete syntactic analysis |
345 | PageDate | The date of the document that is registered on the page is remarkable |
346 | VisitsPVisitors | Remapped mascot feature VisitsPVisitors |
347 | RingsHostRankBadness2 | Additional factors about the promotion of the site with ling rings, ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/antispam?v=181r#h58953-4)) |
348 | RingsHostRankBadness3 | Additional factors about the promotion of the site with ling rings, ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/antispam?v=181r#h58953-4)) |
349 | RingsHostRankBadness4 | Additional factors about the promotion of the site with ling rings, ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/antispam?v=181r#h58953-4)) |
350 | HasTextPos | The document has textual relevance |
351 | QSegmentsBM25 | BM25, where the selected segments of the request act as ‘words’ |
352 | QSegmentsWeight | Weight’ of the segments of the request in the text |
353 | SynPercentBadWordPairs | An indicator of the unnaturalness of the text from the point of view of the Russian language. The number of bad pairs of words in the text, transferred to the segment [0.1] according to the Z/(Z+10) formula |
354 | SynNumBadWordPairs | The proportion of bad steam among all found in the table: Z/(X+1), where Z is the number of bad couples in the text, and X is (http://wiki.yandex-team.ru/evgenijgrechnikov/testsynonimizers of 2000-navigable )) steam |
355 | NumLatinLetters | The number of Latin letters in the text (not counting the markings) driven into [0.1] formula n/(n+100) |
356 | RingsHostRankBadness1 | Additional factors about the promotion of the site with ling rings, ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/antispam?v=181r#h58953-4)) |
357 | DocIdfSumFixed | Previous factors – fixed |
358 | TitleIdfSumFixed | Previous factors – fixed |
359 | HeadingIdfSumFixed | Previous factors – fixed |
360 | NormalTextIdfSumFixed | Previous factors – fixed |
361 | QueryURLClicksCombo | factor cunningly combined from FRC and Pseudo-CTR |
362 | QueryDOwnerClicksCombo | factor cunningly combined from FRC and Pseudo-CTR |
363 | LRAmortizedByAge | Link relevance with pessimization for great age Link |
364 | RusWordsInText | The number of words in the text (the word is what the lemmeter selected) is displayed in [0.1] according to the formula x/(x+a) |
365 | RusWordsInTitle | The number of words of the Russian language in the title |
366 | MeanWordLength | The average length of the word |
367 | PercentWordsInLinks | The percentage of the number of words inside the tag <a> .. </a> from the number of all words |
368 | PercentVisibleContent | The percentage of the number of words outside the tags (outside the brackets <>) from the number of all words |
369 | PercentFreqWords | The percentage of the number of words, which are 200 the most frequent words of the language, from the number of all words of the text |
370 | PercentUsedFreqWords | The number used in the text 500 of the most popular words of the language, divided by 500 |
371 | TrigramsProb | Logarithm of average geometric probabilities of trigrams in the text. (the probability of a trigram – the number of its meetings in the text, divided by the number of all trigrams) is displayed in [0.1] according to the formula -x (x+a) |
372 | TrigramsCondProb | Logarithm of the average geometric conditional probabilities of trigrams. The conditional probability of a trigram is its probability, divided by the probability of a bigram from the first two words |
373 | DoppDOwnerPCTR | The analogue of the QueryDownerClickSpCTR factor differs from it in that the requests are normalized by doppelgage (details of such normalization -((http://staff.yandex-team.ru/finder Andrei Plakhov)), code/yandex/doppelganges) |
374 | DoppDOwnerPCTR_Reg | The analogue of the QueryDownerClickspCTR factor differs from it in that the requests are normalized according to doppelgage (details of such normalization -((http://staff.yandex-team.ru/finder Andrei Plakhov)), code/yandex/Doppelganges). Localized to Relev_regions.web.txt |
375 | DoppUrlPCTR | The analogue of the QueryurlClickSpCTR factor differs from it in that the requests are normalized by doppelgagers (details of such normalization – ((http://staff.yandex-team.ru/finder Andrei Plakhov)), code – Yandex/Doppelganges) |
376 | DoppUrlPCTR_Reg | The analogue of the QueryurlClickSpCTR factor differs from it in that the requests are normalized by doppelgage (details of such normalization – ((http://staff.yandex-team.ru/finder Andrei Plakhov)), code – Yandex/Doppelganges). Localized to Relev_regions.web.txt |
377 | UrlBM25 | BM25 on URL’U |
378 | HasBigPicture | The page has a big picture |
379 | MatrixNet | Matrixnet is applied to all factors – formula (tg_unized – to prevent the entrance to any formulas) |
380 | DaterAge | The difference between the current date and the date of the document defined by the dates, 1 – the date of the document is equal to the current, 0 – the document of 10 years or more, if the date is not defined, equal to 0. Attention! ((1 – dateraage)*60)^2 = age of the page In days. |
381 | IsHardPessimization | Hard pessimization (AKA PR = 0), binary factor, is considered in Antispam |
382 | CInDegree1 | The host factors determine the sites screwed by the links-the second and third incoming degrees ((http://wiki.yandex- team.ru/jandekspoisk/kachestvopoiska/obshayafformula/ |
383 | CInDegree2 | The host factors determine the sites screwed by the links-the second and third incoming degrees ((http://wiki.yandex- team.ru/jandekspoisk/kachestvopoiska/obshayafformula/ |
384 | NumNonRussianLinks | The number of incoming links without Russian letters. Remembrance. |
385 | TextMaxForms | The maximum number of forms in all words of the request is max in all words of the request request_form_dl_lov/64 |
386 | TextWeightedForms | The sum of the number of forms balanced by the scales of words – the amount in all words of the request of the number_form_dly_lov/64*weight_lov; REMAP species x/(1 + x). |
387 | TextForms | The unwarmed amount of the number of forms is the amount in all words of the request of the number_form_dl_lov/64/number_lov_ |
388 | LinkMaxForms | The maximum number of forms in all words of the request |
389 | LinkWeightedForms | Summer of the number of forms balanced by scales |
390 | LinkForms | Undested amount of the number of forms |
391 | TR_W1 | Analogues of the factors of the same name, the weight of the word = 1 |
392 | XLR_W1 | Analogues of the factors of the same name, the weight of the word = 1 |
393 | TextBM25_Fm_W1 | Analogues of the factors of the same name, the weight of the word = 1 |
394 | TextBM25_Sy_W1 | Analogues of the factors of the same name, the weight of the word = 1 |
395 | LinkBM25_W1 | Analogues of the factors of the same name, the weight of the word = 1 |
396 | TLBM25_W1 | Analogues of the factors of the same name, the weight of the word = 1 |
397 | QSegmentsBreaks | Request segments are parts of the request, which in themselves are frequency requests. The factor shows how much the segments are in the text. value 0 – all words are found only within the framework of the indicated segments, 1 – all the entries break segments |
398 | AliceMusicTrackLyricsCMMatchTop5AvgMatch | The value of the CMMATCHTCHTOP5AVGMATCH factor for the Stryim ALICEMUSIC |
399 | NumeralsPortion | The share of different parts of speech in the text. The share of numerals (among all words that managed to recognize part of the speech) |
400 | ParticlesPortion | The share of particles |
401 | AdjPronounsPortion | The share of pronoun adjectives |
402 | AdvPronounsPortion | The proportion of pronoun nouns |
403 | VerbsPortion | The share of verbs |
404 | FemAndMasNounsPortion | The share of words that can be both masculine nouns and nouns of the feminine, but not of the middle kind, among all nouns (examples: ‘hummingbirds’ are an example of an indefinite kind that can be determined in two ways, ‘Alexander’ is homonymy). |
405 | LinkQualityFixed | Quality of incoming links (hauser classifier) corrected |
406 | HasLinkQualityFixed | Considered Linkqualy for this page or not (did not think, if there are few links) corrected |
407 | NewLinkQualityFixed | Quality classifier of incoming links 2 corrected |
408 | IsOrg | The request is the name of the organization (example: Gazprom, Gazprom) ((http://wiki.yandex-team.ru/arsengadzhikurbanov/warees Description)) |
409 | AliceMusicArtistNameCMMatchTop5AvgMatchValue | The value of the CMMATCHTCHTOP5AVGMATCHVALUE factor for Stryim ALICEMUSIC |
410 | LongestText | The size of the largest text segment (from the factor [18] puretext) |
411 | SmartUkrainian | SmartBelorussian |
412 | SmartBelorussian | LRWithoutRare |
413 | LRWithoutRare | Link relevance without taking into account rare words |
414 | DifferentInternalLinks | The number of different internal links to the page |
415 | HasDeterminedCities | The city is defined for the site |
416 | GeoRegionalityUNew | Requestful factors – the result of the work ((http://wiki.yandex-team.ru/poiskovajaplatforma/lingvistika/zaprosnyjefefactory/localizovannyjezaprosya classifier of the request of the request)) – a new version of factors [328] – [328] – [328]: u – u – u – u – u – u – uceleless sites the request is meaningless; |
417 | GeoRegionalityRNew | Запросные факторы – результат работы ((http://wiki.yandex-team.ru/PoiskovajaPlatforma/Lingvistika/ZaprosnyjeFactory/LocalizovannyjeZaprosy классификатора геолокализованности запроса)) – новая версия факторов [328]-[330]: R – георелевантные – региональные результаты в issuing could be useful, but nothing more; |
418 | GeoRegionalityVNew | Requestful factors – the result of work ((http://wiki.yandex-team.ru/poiskovajaplatforma/lingvistika/zaprosnyjefefactory/localizovannyjezaprosya classifier of the request of the request)) – a new version of factors [328]: Vegetable fundamental importance. |
419 | AliceMusicArtistNamePerWordCMMaxPredictionMin | The value of the perwordcmmaxpredictionmin factor for Stryim ALICEMUSIC |
420 | UkrainPageRank | Ukrainian Page Rank |
421 | QClassDownload | #NAME? |
422 | QClassBrandnames | The result of the classifier of the request – in the request there are words from the corresponding dictionary. brand |
423 | QClassDisease | Medication Dictionary |
424 | QClassKak | question |
425 | QClassMoscow | Specific request for Moscow |
426 | QClassOAO | organization |
427 | QClassPorno | porn |
428 | QClassTravel | trips |
429 | VideoRating | The popularity of the video roller comes from the video |
430 | PeriodicLinkDatesPercent | The frequency of links to the site |
431 | LinkAlmostPeriod | The number of almost-periodic links |
432 | QDOwnerStatPower | The number of Owner shows on request, normalization x/(100 + x). |
433 | QUrlStatPower | The number of URL shows on request, normalization x/(100 + x). |
434 | HasLiRuCounter | The presence of a LiveInternet meter |
435 | OwnerReqsPopularity | The popularity of Owner is in requests |
436 | DssmYaMusicASREarlyBindingCe | DSSM model with early binding, trained on reforming and learned by ASR hypotheses of musical requests for Alice |
437 | DssmBertDistillSinsigCeCountryRegChain | A model trained on a PRS-Law PRS to predict BERT, trained on sinsig_ce with threshold value 0.5, using a chain of regions to the country |
438 | DssmYaMusicEarlyBindingCe | DSSM model with early binding, trained on reforming and learned on musical requests for Alice |
439 | SecondIndegDistrXi | Eleven factors based on the statistical properties of the distributions of incoming degrees of peaks referring to the fixed top of the hostographer. ((Http://wiki.yandex-team.ru/jandekpoisk/kachestvopoiska/obshayaformula/tekushhiekmponenty/HostdDEGRE) |
440 | PiracyDetectorPredict | The value of the pirate detector calculated in the hippo. |
441 | AliceMusicUrlTypeIsAlbum | Type of canonized Urla Yandex Music – Album |
442 | FirstValidTs10Days | It is considered as (10-x) where X is the return of the document in days (continuously) regarding the validity time of the document in Samovar |
443 | HostInQuery | The host of the document is recognized in the request |
444 | VitalHostInQuery | URL consists only of the host, which is recognized in the request |
445 | YandexNewsStoryUrl | URL is the plot of Yandex News |
446 | RcSpylogUrlRationalSigmoidD1T240 | URL feature computed from rapid clicks spy_log counters with decay of 1 day |
447 | RcSpylogUrlRationalSigmoidD1T240Frozen | URL feature computed from rapid clicks spy_log counters with decay of 1 day |
448 | RcSpylogUrlRationalSigmoidD0_5T30 | URL feature computed from rapid clicks spy_log counters with decay of 0.5 days |
449 | RcSpylogUrlRationalSigmoidD0_5T30Frozen | URL feature computed from rapid clicks spy_log counters with decay of 0.5 day |
450 | Timestamp | They are considered as (80 – x) / 80, where X is the age of the document in the watch. Factors make sense only for the fast -button base (the last 80 hours). Not used in ranking. Used in disconnecting. |
451 | AddTimeFull | They are considered as (80 – x) / 80, where X is the age of the document in the watch. Factors make sense only for the fast -button base (the last 80 hours). Not used in ranking. Used in disconnecting. |
452 | Swbm25 | Cunning BM25 in a sliding window. The size of the window is set in sentences. ‘Jokers’ are used for headlines and the beginning of the document. Morphological proximity and structure of the text are taken into account. The weight of the window fades with the removal from the beginning of the document. |
453 | PositionLanguageModel | The factor about that, a good snippet can turn out. |
454 | TxtPair_W1 | Simple BM25 in pairs of words – we take all pairs of words of the request and consider the number of their entry into the text of the document. Weight = 1. It does not work if there is a stop-word in the request |
455 | AuraDocLogShared | Logarithm of the number of shingles on which this document is not unique |
456 | AuraDocLogAuthor | Logarithm of the number of shingles on which this owner of the document is recognized as the author |
457 | AuraDocMeanSharedWeight | The average weight of non-ugly shingles of this document |
458 | MarketQualityRating | Mascot feature MarketQualityRating |
459 | Medical2HostQuality | Medical host quality for new marks. |
460 | Medical2HostQualityFresh | Medical host quality for new marks for experiments. |
461 | FinLawHostQuality | Finance or law host quality for new marks. |
462 | FinLawHostQualityFresh | Finance or law host quality for new marks for experiments. |
463 | SosHostQuality | Finance or law host quality for new marks. |
464 | SosHostQualityFresh | Finance or law host quality for new marks for experiments. |
465 | CsDocumentationHost | Factor for host in list of documentation cs hosts for experiments |
466 | Remved_466 | RegHostRank |
467 | RegHostRank | It reads in the same way as the Hostrank factor, but not on all the Owner graph, but on its subrack, consisting of Owner’s in this region. Belonging to the region is determined by TLD, or by the presence of pages with this Owner in the index, about which the GEO or Geoa classifier says that they are from this region. Mapped in the same way as the Hostrank factor, from 0 to 1 with 256 gradations |
468 | RegIsWiki | A document from the language section of Wikipedia corresponding to the user region |
469 | LanguageCompliance | The language of the document corresponds to the language language |
470 | CountryPopularQ | The popularity of the request within the country |
471 | CountryQDiversity | The degree of centralization of the points from which the request is set (inside the country) |
472 | CountryQDiversity2 | Geographical distribution of the request within the country |
473 | CountryHour | The hour at which this request is given the most |
474 | CountryHourDiversity | The degree of severity of the querial tasks at different times of the day (inside the country) |
475 | Removed_475 | NationalDomain |
476 | NationalDomain | The country of the document (domain) and the country of the user coincide ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaFormula/tekushhiekomponijafaktorov#national |
477 | IsPornoAdvert | On the Porn Advertising page |
478 | RcSpylogUrlRationalSigmoidD3T120 | URL feature computed from rapid clicks spy_log counters with decay of 3 days |
479 | CountryQueryRegionality | Country classifier of localization – how much the request implies the context of the country |
480 | NumSlashes | The number of slashes in Url |
481 | BM25FdPR_obsolete | BM25 with different parameters for different fields, including an incoming anchortekst. The weight of the text of the links included on the page is normalized depending on Delta Page Rank links |
482 | WatchVideo | The presence of a built -in video player on the page |
483 | DownloadVideo | Video for downloading |
484 | RcSpylogUrlRationalSigmoidD3T120Frozen | URL feature computed from rapid clicks spy_log counters with decay of 3 days |
485 | RcSpylogUrlRationalSigmoidD14T300 | URL feature computed from rapid clicks spy_log counters with decay of 14 days |
486 | SubRelevance | The service factor that was needed to search the site, and in the future it will still be needed. |
487 | GskUrlModel | The factor is calculated from the text of Url using the classifier of sequences Quality/Seq/GSK |
488 | UrlTrigrams | Model with the training of each trigram on ‘+’ and ‘-‘ Urlah. It does not depend on the request. |
489 | RcSpylogUrlRationalSigmoidD14T300Frozen | URL feature computed from rapid clicks spy_log counters with decay of 14 days |
490 | RcSpylogAge | Age of rapid clicks spy_log update, in seconds |
491 | RcSpylogFreshness | Freshness of rapid clicks spy_log update |
492 | YmwFull | The size of the minimum piece of text, including all the words of the request found in the document. Not used now. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/ymw Read more)) |
493 | Bclm | Buettcher, Clarke and Lushman factor (modified) ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushichiekomponenty/bclm more))))))))) |
494 | QueryCommercialityMx | The measure of ‘commercial’ request. It is a comprehensively calculated Matrixnet factor formula for the procurement vocabulary in direct + for user queries + add. Intensive dictionaries. Requests with intensity to buy a factor seeks to -> 1 commodity requests -> 0.6 with intensity cannot buy, reviews, etc. -> 0 ((http://wiki.yandex-team.ru/Faktorydljanovogokatorazaprosov Factors of the Classifier))) (HTTP : //wiki.yandex-team.ru/jandekspoisk/antispam/antiseo/klassifikATORCHESSKIXZAPROSOV STUNITURE OF HIM)) |
495 | FieldLM | Unigramal language model. Language is modeling according to the document, smoothed out by the general linguistic model. When building a model, the document uses information on which field of the document met the word request (Title, Head or Plain Text) |
496 | GeoCityUrlRegionCity | The coincidence of geography, determined from the Url of the document and the city of the request (IP or LR) |
497 | GeoCityUrlRegionRegion | The coincidence of geography, determined from the Url of the Document and the Request region (IP or LR) |
498 | GeoCityUrlRegionCountry | The coincidence of geography, determined from the Url of the document and the country of request (IP or LR). Actual for Russia and Ukraine. |
499 | GeoCityUrlGeoCityCity | The coincidence of geography, determined from Url of the document and the city in the request (GEOCITY rule) |
500 | PayAppDetectorPredict | The value of the chopped commerce detector, calculated in the hippo. |
501 | TitleTrigramsQuery | Calculates the coating of the request with letter trigrams of the document header |
502 | TitleTrigramsTitle | Calculates the heading of the heading of the document header with letter trigrams |
503 | InlinksModel | Probabilistic model built on the texts of incoming links |
504 | QueryWordSequencesTR | He considers the amount of entry of the following type: the sequence of words of the request with a length of more than two, found in one sentence; It is normalized for the length of the document. |
505 | QueryWordSequencesLR | He considers the sum of the following species: the sequence of words of the request more than two, met in one link; It is normalized to the number of links. |
506 | OwnerNavQuota | The share of clicks for navigation requests |
507 | GeoRelevAlienCity | The result has a geography of the user at the city level ([415] == 1 && [215] == 0) |
508 | GeoVQueryInUserCity | Request geovitality for results from the user region |
509 | GeoVQueryInAlienCity | Request geovitality for the results is not from the user region |
510 | HostReliability | The share of the Urlov that respond without errors |
511 | DmozThemeMatchAll | Coincidence of the thematic spectrum (according to DMOZ) request and document. The theme of the request is determined ((http://wiki.yandex-team.ru/jandekspoisk/zarubezhnyjjinternet/dmozqueryClassifier1 The rule of the sorcerer Dmoztheme)) |
512 | DmozThemeMatchBest | Coincidence of the thematic spectrum (according to DMOZ) request and document. The theme of the request is determined by the best result ((http://wiki.yandex-team.ru/jandekspoisk/zarubezhnyjjinternet/dmozqueryClassifier1 Rules for the sorcerer DmozTheme)) The subject of the document is determined by the automatic classifier |
513 | Mpsa | Evaluates the minimum distance between the pairs of words of the request, taking into account the remoteness of the pair from the beginning of the document (Minimal Pair Size with Attenuation). Steles are understood to mean all consistent bigrams of the words of the request. Thus, the number of vapor is equal to the number of words in a request reduced by 1. Accordingly, the factor makes sense for requests consisting of more than one word. (Http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/ Tekushhiekomponenty/MPSA MPSA)) |
514 | Bclm2 | It differs from BCLM in that the weights of all words are considered the same. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/bclm2 BCLM2)))))))))))) |
515 | AbsolutePLM | Text relevant based on the language model, taking into account the absolute position. We go along the text with a window of 20 words, build a language model on each window (that is, the distribution of probabilities in the words of the Russian language) and calculate the probability of generating a request. For removal from the beginning of the document, we finish the model. |
516 | PageRegionCoverage | PageRegionSize |
517 | PageRegionSize | The size of the page of the page |
518 | PageRegionRelCoverage | RcSpylogFreshnessAtReq |
519 | RcSpylogFreshnessAtReq | Freshness of rapid clicks spy_log update, calculated at the request time |
520 | IsGeo | It launches on the basic search under the name ISGEO the maximum weight of the meters of the gelator in the request. A geo-object is understood as an object of the category GEO, Geo1, Geoaddr, Geoaddr1, Landmark, Landmark1 (see ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects kaovsky allocation))))))))))))))))))))))))))))))). wiki.yandex-team.ru/arsengadzhikurbanov/wares Read more)) |
521 | IsMusic | It launches for the basic search under the name ISMUSIC the maximum weight of the Music or Music1 category of the category of the Category in the request. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/warees more))))))))))))))))))))) |
522 | BclmLite | Modification of the BCLM2 factor, lightweight for use in tulle. The main difference is that BCLMLite does not use absolute displacements of words relative to the beginning of the document. Instead, the factor works with the usual positions of the type <number_product, the position_v_production>. At the same time, the proximity between the words is taken into account only inside the sentence. (Http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaFormula/tekushichiekomponenty/bclmlite bclmlite)))))))))))))) |
523 | NearbyQuery | When responding to a request, the results are important in close proximity ([pharmacies], [children’s clinic]) |
524 | CityQuery | When answering a request, the results within the city are important (the bulk of localized queries) |
525 | AdmQuery | When responding to a request, the results from the region, the region of the user ([airport], [dairy]) are important |
526 | NumLinksFromMP | The number of incoming muzzle links |
527 | YmwFull2 | Fixed YMWFull. It differs from the previous version only by behavior on 2 -word queries. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/ymw Read more)) |
528 | FullQuorum | Binary factor, every word of the request is in the text or in the links |
529 | AuxCTextBM25 | Country praets’ (AUXQC) |
530 | AuxCLinkBM25 | Country praets’ (AUXQC) |
531 | Soft404 | Page – ‘404’ (share of tokens ‘404’ in relation to the total number of tokens on the page) |
532 | RcSpylogUrlRationalSigmoidD1T240AtReq | URL feature computed at the request time from rapid clicks spy_log counters with decay of 1 day |
533 | DBM25 | BM25, in which the weight of the word is machine -like |
534 | QueryWordCohesionTR | The factor evaluates as the words of the request is grouped with each other in the text of the document without taking into account their order. ((http://wiki.yandex-team.ru/sergejjkrylov/queryWordCohesionTR Description)) |
535 | OwnerSessNormDuration_Reg | ND/K normalized time to click |
536 | RcSpylogUrlRationalSigmoidD0_5T30AtReq | URL feature computed at the request time from rapid clicks spy_log counters with decay of 0.5 days |
537 | QueryDOwnerSessNormDuration_Reg | CONTRY / K |
538 | QueryDOwnerWeightClick_Reg | w/k |
539 | QueryDOwnerOnlyClickRate_Reg | o/i |
540 | QueryDOwnerClickSummary_Reg | Selected formula |
541 | QueryDOwnerSatisfied4Rate_Reg | r_s4b/(r_k + 10) |
542 | SegmentAuxAlphasInText | Number of letters in the AUX segment |
543 | SegmentAuxSpacesInText | The number of spaces in the AUX segment |
544 | SegmentContentCommasInText | The number of commas in the Content segment |
545 | IsShop | Page is a store. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/opisanijafaktorov#SSHOP Description)). Not used (depreded) |
546 | XLRGeoRelevRegionNatDomain | AuraDocLogOrigin |
547 | AuraDocLogOrigin | Logarithm of the number of shingles in the document added by the owner of the site as original texts in ((http://wiki.yandex-team.ru/jandekspoisk/jekosistema/marketingPr/webmasters/plan/vtorcontect of originality plugin)). It does not participate in the formula, it is needed to disconnect the takes |
548 | AuraDocMeanFltAuthorSource | The average filtered number of sources of authorship of the document. It does not participate in the formula, it is needed to disconnect the takes |
549 | QueryRefTrigramQuery | ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/opisanijafaktorov#queryreftrigrams Description)))))))))))))))))) |
550 | QueryRefTrigramReferences | ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/opisanijafaktorov#queryreftrigrams Description)))))))))))))))))) |
551 | IdfVariance | Dispersion of IDF words, |
552 | UrlNGramsModel | Urlngramsmodel ranking factor in ERF |
553 | NationalLanguage | The language of the document corresponds to the country’s request |
554 | OwnerIsCommercial | GeoCountryUrlRegionCountry |
555 | GeoCountryUrlRegionCountry | GeoCountryUrlGeoCountry |
556 | GeoCountryUrlGeoCountry | NumLinksFromSegmentContent |
557 | NumLinksFromSegmentContent | Locm |
558 | Locm | The order of words in exiles. |
559 | UrlQueryVariety_Reg | The degree of variety of requests for which this Urla click is read by regions |
560 | UrlSessNormDurRate_Reg | nd/i |
561 | FiltrationSegments | The share of the segments of the request present in the text |
562 | LanguageGoodForTurkey | The language of the document is one of the permissible for Turkey (Turkish, English, German, French, Arabic, Azerbaijani) or the document has zero length. In the search stage is calculated only for Isrealgeolocal requests. |
563 | DBM25_2 | Variation of Temo ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/DBM25 dBM25), cm. |
564 | GeoDispersion | Document links dispersion |
565 | QueryDownerEnoughClicked | The number of clicks on the owner and the number of clicks on request more than 5 |
566 | BM25FdPRFixed | BM25FDPR with standardization on the average length of the document, depending on the language of the document. ((http://wiki.yandex-team.ru/bm25frework test results.)) |
567 | LanguagePopularity | The popularity of the language of the document. Number from 0 to 1. (http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/languaguaguagepopalarity))))))) |
568 | QueryDOwnerWeightedSumFRCAndBM25FdPRFixed | The amount of factors QueryDownerClicksFRC and BM25FDPRFIXED with scales 0.358449 and 0.184922, respectively. ‘565’ in the name of the factor does not need to be perceived literally, it is Legashi or a typo. |
569 | QueryDOwnerWeightedSumMaxWHRAndOnlyClickRate | The amount of factors 192 and 341 with scales 0.298942 and 0.454625, respectively. |
570 | RcSpylogUrlRationalSigmoidD3T120AtReq | URL feature computed at the request time from rapid clicks spy_log counters with decay of 3 days |
571 | RcSpylogUrlRationalSigmoidD14T300AtReq | URL feature computed at the request time from rapid clicks spy_log counters with decay of 14 days |
572 | Tocm | The factor evaluates the differences in the positions of words in the heading from the posterity in the request |
573 | RelevGeoLinksPercent | LangDispersion |
574 | LangDispersion | Dispersion of languages in XMAP |
575 | HasMisspell | There is a typo in the request |
576 | DBM30Smerch | Variation of Temo ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/DBM25 dBM25), cm. |
577 | IsUrlForClickDeboost | It is known about URL that it is shown too often with very low relevance (according to Bert and/or BM25) |
578 | UrlLinkPercent | The ratio of the number of incoming links, the text of which is the URL, is one of the incoming links |
579 | DssmBertDistillL2 | A pool of logs is marked with BERT trained on Sinsig. DSSM model is trained on this pool using BaseregionChain |
580 | NumNonLettersInUrl | The number of ‘Nebukv’ in Url |
581 | UrlLen2 | The length of the URL with an accuracy to the symbol. Disconnected in production. |
582 | IsHub | Habi page |
583 | StaticTitleComm | The degree of commerce page title. Not used (depreded) |
584 | StaticTitleBM25Ex | BM25 page title by its text |
585 | StaticTitleLRBM25 | BM25 page title by texts of links to it |
586 | SeoInPayLinks | The number of COO-Thrilling links between hosts |
587 | USLongPeriodUrlMobileDt180Avg | Static URL factor for search sessions for 1600 days calculated on mobile sessions. Average dwelltime, and DwellTime is cut from the session if more than 180 seconds |
588 | USLongPeriodUrlMobileLongClickProb | Static URL factor for search sessions for 1600 days calculated on mobile sessions. The probability that the URL click will be more than 120 seconds |
589 | USLongPeriodUrlMobileLossesProb | Static URL factor for search sessions for 1600 days calculated on mobile sessions. The probability that URL is not clicks if they click at least one URL below. |
590 | USLongPeriodUrlMobileDt3600AvgReg | Static URL factor for search sessions for 1600 days calculated on mobile sessions. Average dwelltime, and DwellTime is cut from the session if more than 3600 seconds. Localization to the level of countries. |
591 | USLongPeriodUrlMobileDt180AvgReg | Static URL factor for search sessions for 1600 days calculated on mobile sessions. Average dwelltime, and DwellTime is cut from the session if more than 180 seconds. Localization to the level of countries. |
592 | HpDetectorPredict | The value of the health detector calculated in the Hippo. |
593 | IsFeedListing | OffersBase feature for ecoboost. |
594 | IsFeedMain | OffersBase feature for ecoboost. |
595 | IsFeedStratocaster | OffersBase feature for ecoboost. |
596 | IsFeedAny | OffersBase feature for ecoboost. |
597 | TitleInLinksTrigrams | The share of unique trigrams in the trigrams of links |
598 | LinksInTitleTrigrams | Share of unique trigrams of links in trigrams header |
599 | TrashAdv | The greasy of the page |
600 | MetrikaUrlVisits | Similar to Yabarurlvisits |
601 | UrlGeoAdms | The URL document corresponds to the user (http://wiki.yandex-team.ru/jandekspoisk/kacheStvopoiska/geo/regnavquerispoisk/KacheStvopoiska/GEO/RENAVAVQURIES) |
602 | UrlGeoCity | URL document corresponds to the city of the user |
603 | RegNavQuery | Regional and navigation request – in the user region there are one or more navigation results on it |
604 | YabarUrlLcAc | The number of sessions in which Url was the last, classified as the sessions in which Url appeared |
605 | SOMaxSumSourceRank | The sum of the maximum values of Sourcerank’s for each incoming link, taking into account the uniqueness of the owner. |
606 | DBM35 | BM25 in texts and links with special. Libra in the level of coincidence (shape, lemma, synonym) |
607 | TRLRQuorumFm | The weight of the words of the request that is in the text in the exact form |
608 | TRLRQuorumLemma | The weight of the words of the request that is in the text with an accuracy to lemma |
609 | TRLRQuorumSyn | The weight of the words of the request that is in the text |
610 | IsHum | It launches on the basic search under the name ISHUM the maximum weight of the enclosed object of the Hum or Hum1 category in the request. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#ishum more))))) |
611 | IsText | It launches on the basic search under the name ISTEXT the maximum weight of the TEXT or Text1 category of the category of the category met in the request. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#istext more))) |
612 | IsPicture | It launches on the basic search under the name Ispicture the maximum weight of the Picture or Picture1 category of the category of the category of the category in the request. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#ispicture)))))))))))))))))) |
613 | MaxOne | Returns the maximum degree of household objects in the request under the name Wmaxone. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#maxone more))))))) |
614 | MinOne | Returns the maximum degree of household objects in the request under the name Wminone. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#minone more))))) |
615 | OqBm25Str | BM25 on the request for Domattr index |
616 | OqBm25Lem | BM25 on the request for Domattr index |
617 | OqBm25Syn | BM25 on the request for Domattr index |
618 | OqBclmWeighted | BCLM for the Domattr Index |
619 | OqBclmPlain | BCLM on the request of the owners index |
620 | LinksAlive | Allows you to evaluate whether the document is ‘alive’ is from the point of view of links to it coming. |
621 | SmallWindow | Maximum amount weight of the words of the request in the window of 50 words |
622 | MetrikaUrlVisitors | Similar to Yabarurlvisitors |
623 | MetrikaUrlAvgTime | Similar to Yabarurlavgtime |
624 | MetrikaUrlCoreAudience | The core of the page of the pages on which there is a metric counter |
625 | RegexMaxClickPercent | The share of clicks on this Urlu among all clicks on similar requests |
626 | RegexCtr | Corrected CTR of this Urla for all similar requests |
627 | DomPhraseClickRankBi | Bigram domain clickability (excluding thesaurus extensions of requests) |
628 | DomPhraseYabarBi | Transitions to the site from search engines by biograms, according to the bar (excluding thesaurus extensions of requests) |
629 | LastWordHostClicks | The clickableness of the host according to the latest request (excluding thesaurus extensions of requests) |
630 | HostHasFeedUrls | OffersBase feature for ecoboost. |
631 | IsFeedOffer | OffersBase feature for ecoboost. |
632 | HostEcomKernel1 | Business kernel. |
633 | HostEcomKernel2 | Business kernel. |
634 | HostEcomKernel3 | Business kernel. |
635 | RcSearchBaseUrlRationalSigmoidD1TM600AtReq | URL feature computed at the request time from rapid clicks search counters with decay of 1 day |
636 | SynSetLocm | Копия фактора ((http://wiki.yandex-team.ru/JandeksPoisk/KachestvoPoiska/ObshayaFormula/TekushhieKomponenty/Locm LOCM)) для((http://wiki.yandex-team.ru/JandeksPoisk/KachestvoPoiska/ObshayaFormula/TekushhieKomponenty /Synset sinsetas)). |
637 | SynSetLinkBM25 | A copy of the LinkBM25 factor for ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayafformula/tekushhiekomponenty/synset Sinsetov)). |
638 | RcSearchBaseUrlContrastD30Odd0_9_X_D30T1AtReq | URL feature computed at the request time from rapid clicks search counters with decay of 30 days |
639 | Removed_639 | DmozQueryBestTheme |
640 | DmozQueryBestTheme | The most likely theme of the request determined ((http://wiki.yandex-team.ru/jandekspoisk/zarubezhnyjjinternet/dmozqueryClassifier1 The rule of the sorcerer DmozTheme)), only the most popular topics are taken into account (but there are more than in the DMOZQUREMES factor). The factor contains the likelihood of a correspondence of the request of the theme, but for each topic, its own interval is taken on the segment [0..1] |
641 | DmozQueryThemes | The theme of the request determined ((http://wiki.yandex-team.ru/jandekspoisk/zarubezhnyjjinternet/dmozqueryClassifier11 The sorcerer of the sorcerer Dmoztheme)), only a few of the most popular topics are taken into account. |
642 | DiversityCategNeedPhoto | 0 or 1, depending on the presence in the request of the clearly expressed intent Need_photo from the variety |
643 | DiversityCategNeedMap | 0 or 1, depending on the presence in the request of the clearly expressed intent Need_map from the variety |
644 | LongQuerySyn | The factor is an analogue of LongQuery (the sum of the IDF words of the request), but with the ‘correct’ accounting of synonyms. Specifically, a minimum of IDF (i.e. the most frequent) of synonyms and words is selected. |
645 | UrlHasShortCountryNameToken | Url contains a token that coincides with the short name of the user country. The factor is considered only on the EU stream. |
646 | TurkeyPageRank | Personalized Turkish Pagerank |
647 | ExpectedFound | Expected number of found on request |
648 | FooterInLinksTrigrams | The share of unique trigrams of a footer fragment in trigrams of links |
649 | LinksInFooterTrigrams | The share of unique trigrams of links among a fragment of trigrams of a footer |
650 | ErratumLogQueryProbability | Double logarithm of the probability of a request for a language model of the Erratum typo service |
651 | UrlIsMarketOffer | URL is an offer in the latest version of the market base. |
652 | DBM40 | Variation of Temo ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/DBM25 dBM25), cm. |
653 | Removed_653 | BM25_0 |
654 | BM25_0 | Variation on the topic BM25 |
655 | BM25_1 | Variation on the topic BM25 |
656 | BM25_0123 | Variation on the topic BM25 |
657 | QueryUrlCorrectedCtr | Fixed’ clicks counted using REQUESTAGGRETELIB |
658 | QueryUrlCorrectedCtr_Reg | Fixed’ clicks calculated using Requestaggregatelib. Regional version |
659 | YabarUrlVisits_Reg | Regional attendance of Urla according to the I-Bara |
660 | MetrikaUrlHostVisitTime | The average time of the user stay on the host with an external (from another non-search site) entry from a specific URL |
661 | MetrikaUrlHostVisitDepth | The average ‘depth’ (the number of transitions within the framework of the host) of the user stay on the host with an external (from another non-playing site) entry from a specific URL |
662 | DBMNumbers | DBM separately by numbers |
663 | DBMGeo | DBM separately by geo-objects of request |
664 | DBMSubstantive | DBM separately on the noun |
665 | AvgSessionLen | The average length of the logical session in which there was a request |
666 | NHopTextBclmWeighted | BCLM (Weighted) by Hopes texts. |
667 | YabarUrlDownloads | Assessment of the probability of leaps from the document |
668 | Bocm | Evals the correspondence of the positions of words in the sentences of the document to the positions of words in the request. |
669 | HostUserLeakage | User outflow coefficient from the search after a visit to the site |
670 | FioMatch | The document contains a name from the request. |
671 | IsIndexPage | This is Index. (HTML/PHP/ASPX?/…), without CGI parameters. It is considered to be for all takes. |
672 | IsIndexPageSoft | This is Index. (HTML/PHP/ASPX?/…), possibly with CGI parameters. It is considered to be for all takes. |
673 | IsOwner | Whether the host is the owner, conditionally host == Owner (Host). |
674 | MinPathLen | The minimum length of Pathandquery for all half -shoes. |
675 | XLerfGeoLRlogRelevCnt | Regionalized (only links from the country of request are taken) variant of the Xlerfgeolrlogrelev factor |
676 | XNonCommLerfNormLRlogRelevCnt | Regionalized (only links from the country of request are taken) variant of the factor XNONCOMMLERFNORMLRLOGRELAV |
677 | LocmCnt | Regionalized (only links from the country of request are taken) Variant of Locm factor |
678 | XLRrelevCnt | Regionalized (only links from the country of request are taken) variant of factor xlrrelev |
679 | XLerfLRrelev200Cnt | Regionalized (only links from the country of request are taken) variant of factor Xlerflrrelev200 |
680 | NavLinear | ((http://wiki.yandex-team.ru/jandekspoisk/antispam/polunavigacionnyezaprosy#faktornnostiparyurl-zapros classifier)) pairs of vitalnikov [query url], Url Vital for the request, if value is valuable for Ф> 0. |
681 | RankComGoodness | Classifier for estimates of commercial sites |
682 | HasDownloadLinkOnFile | The document has a direct link to the file |
683 | HasDownloadLinkOnFileHosting | The document has a link to filehosting |
684 | DiversityCategDownload | 0 or 1 – whether the request is matured by the tickt |
685 | DiversityCategReview | 0 or 1 – whether the request is matured by the tickt |
686 | DiversityCategWatch | 0 or 1 – whether the request is matured by the tickt |
687 | QrTur | The prediction of the share of “good” (at least two different cities and frequency> = 10) references to the request with geography in Turkey |
688 | QueryThEncyclopedic | The result of the work of the lexical classifier of requests predicting the likelihood of click on the theme of 3561 |
689 | QueryThVideohosting | The result of the work of the lexical classifier of requests predicting the likelihood of click on the page 3973 page |
690 | IsNavMxQuery | Rank ‘navigation’ |
691 | QueryUrlYabarVisits_Reg | Regional attendance from search engines for a specific request |
692 | ClickedWithAnotherSEClicks | Clicks on the urlahs shown in the issuance for requests, by which they went to look for other search engines |
693 | ShowsWithAnotherSEClicks | Urlov shows in the issuance for requests, by which they went to look for other search engines |
694 | CommercialOwnerRank_Reg | Classifier of the commerciality of the site |
695 | HostIsMarketOffer | In the latest version of the market base there are offers from this host. |
696 | BclmMax | The proximity of the words of the request to the most difficult word. |
697 | UrlPronRegexpMatch | Url satisfies the regexp expression set in the prone |
698 | HasUserReviews | The document contains user review/comment |
699 | RegexMaxClickPercentReg | The share of clicks on this Urlu among all clicks according to similar requests, the country version, see ((http://wiki.yandex-team.ru/development/poisk/Arcadia/indexregex Indexregex))))))))) |
700 | RegexCtrReg | Corrected CTR of this Urla for all similar requests, country version, see (http://wiki.yandex-team.ru/development/poisk/Arcadia/indexregex Indexregex)))))) |
701 | Found | The average number of found on request |
702 | YabarWordDepthNodesGradientMin | The angle in the Depth Nodes space, counted only by words (min for all) |
703 | DBM15Wares | RankComGoodnessBar |
704 | RankComGoodnessBar | Classifier that approximate the quality of commercial sites based on user behavior data |
705 | DocCreateMonth | The time of creating a document with an accuracy of 1.0 is the current month, 0- 10 years ago and older. Temporarily disconnected |
706 | DocUpdateMonth | The time for updating the document with an accuracy of 1.0 is the current month, 0- 10 years ago and older. Temporarily disconnected |
707 | XLRSourceRank | XLRMainPage |
708 | XLRMainPage | DaterStatsYearNormLikelihood |
709 | DaterStatsYearNormLikelihood | The function of the credibility of the distribution of years in the document. Temporarily disconnected |
710 | HostNumSovetnik | Num of Sovetnik URLS |
711 | LcmVar | Dispersion of the number of words in the links. |
712 | DaterStatsAverageSourceSegment | The arithmetic mean position of dates in the document. Temporarily disconnected |
713 | DBM15Wares2 | Cabm |
714 | Cabm | BM with attenuation in the text of catalog links. |
715 | BeastNqUrlMeanPos | The average position of Urla for a normalized request |
716 | BeastNqOwnerMeanPos | The average position of Domattr for a normalized request |
717 | BeastUrlMeanPos | The average position of Urla for all requests |
718 | BeastHostMeanPos | The average position of the host for all requests |
719 | BeastUrlNumQueries | Number of requests for URL |
720 | BeastHostNumQueries | Number of requests for host |
721 | YabarHostBrowseRank_Reg | Implementation of the algorithm described in the article ((http://wiki.yandex-team.ru//h.yandex.net/?http%3A%2F%2FreseRosoft.microsoft.com%2Fen-US%2FPEOPLIULIUUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUUP032-LIUUUU .pdf http://research.microsoft.com/en-us/people/tyliu/fp032-liu.pdf)) by large regions (tube) |
722 | Removed_722 | SegmentWordPortionFromMainContent |
723 | SegmentWordPortionFromMainContent | The share of the words of the document from the segments with Score> 2. |
724 | UrlDomainSimilarityFixed | TotalDups |
725 | TotalDups | RankBoostGoodness |
726 | RankBoostGoodness | The rank of site quality used for boosts of the Moscow commercial formula |
727 | QueryDOwnerClicksFRCRegGeo | QueryURLClicksFRCRegGeo |
728 | QueryURLClicksFRCRegGeo | LanguageDistribution |
729 | LanguageDistribution | UrlShowsWithNextPageClicksP1 |
730 | UrlShowsWithNextPageClicksP1 | UrlShowsWithNextPageClicksP10 |
731 | UrlShowsWithNextPageClicksP10 | The factor is used in Selectionrank. TG_UNUSED: should not be included in the formulas to avoid feedback |
732 | QueryURLClicksPCTRYear | QueryURLClicksPCTRPreviousYear |
733 | QueryURLClicksPCTRPreviousYear | SmallWindowAttenuation |
734 | SmallWindowAttenuation | RcSearchBaseUrlRationalSigmoidD3T120AtReq |
735 | RcSearchBaseUrlRationalSigmoidD3T120AtReq | URL feature computed at the request time from rapid clicks search counters with decay of 3 days |
736 | OwnerCTRWithNextPageClicksP10 | CommRus |
737 | CommRus | The weight of the document on a monosyllabic dictionary of commercial vocabulary |
738 | WikiLinkCount | UrlInLinksTrigramsStatic |
739 | UrlInLinksTrigramsStatic | LinksInUrlTrigramsStatic |
740 | LinksInUrlTrigramsStatic | UkrIsQueryLang |
741 | UkrIsQueryLang | Shows that a request in Ukrainian |
742 | QueriesAvgCM2 | Average query commerciality |
743 | QiQueryCount | The number of requests in the group of frequency requests similar to a given |
744 | QiUrlFreqWeightedFRC | FRC groups of frequency requests similar to a given, with averaging through the sum of clicks and shows |
745 | QiUrlFreqWeightedFRCReg | FRC groups of frequency requests similar to a given, with averaging through the sum of clicks and shows, according to regional statistics |
746 | RcSearchBaseUrlRationalSigmoidD1TM600Frozen | URL feature computed from rapid clicks search frozen counters with decay of 1 day |
747 | WordHostWikiSum | The relative popularity of the Word -Host pair, where Word is the word from the Title article on Wikipedia, and the Host is the host that is referred to in this article. |
748 | RegWordHostClicksSum | The relative clickability of the three Countryid-Word-Host according to the search in Yandex. |
749 | RegWordHostYabarSum | The relative clickability of the three Countryid-Word-Host according to the data from popular search engines on the bar and Similargroup. |
750 | RegexMaxClickPercentYabarReg | The share of clicks on this Urlu among all clicks on similar requests, counted according to Popular Search Engine |
751 | YabarHostSurfTrDpNdLeafLn | The length of the Depth Nodes petal counted for hosts |
752 | YabarHostSurfTrNdTmGrDsp | Dispersion of the angle in the space of Nodes Time, calculated for hosts |
753 | YabarHostSurfTrNdTmLeafLn90 | 0.9-quarter of the length of the petal in the space of Nodes Time, calculated for hosts |
754 | WordHostDownloadProbability | The average according to the request is the probability of download the file from the host after click. |
755 | NastyContent | Content ugliness factor. |
756 | SynnormURLPCTR | CTR according to click data, the request is normalized according to Sinsets |
757 | SynnormURLPCTRReg | Regional CTR according to click data, the request is normalized according to Sinsets |
758 | UrlQueryTrigramsStatic | Static trigrams intercection of url and queries by which users visited the url. |
759 | AdvAspam | HasPornoQuery |
760 | HasPornoQuery | The result of the work of Adult Rules for the Sorcerer. |
761 | QUBm15Weighted | Weighed BM15 for a request for an index document – a list of requests for which they switched to it. |
762 | WeightedSumIsIndexPageBocm | WeightedSumIsIndexPageIsNavMxQuery |
763 | WeightedSumIsIndexPageIsNavMxQuery | BrowserHostDownloadProbability |
764 | BrowserHostDownloadProbability | The likelihood of a racing from a host after click (on the logs of the bar). |
765 | NHopChainsCountFrc | The number of chains on request / (the number of chains in which URL + the number of chains on request participated). |
766 | NHopIsFinal | The number of chains in which Url was the last normalized for the total number of chains in which this URL was. |
767 | VisitsFromWiki | Number of transitions to URL from Wikipedia |
768 | RcSearchBaseUrlContrastD30Odd0_9_X_D30T1Frozen | URL feature computed from rapid clicks search frozen counters with decay of 30 days |
769 | RegBrowserUserHub | The page indicator is like a hub (how many pages are the bar users pass from it). |
770 | AuxTitleBM25 | TEXTBM25 is considered in the title by the text of the name of the user region – similar to the factor 268. |
771 | Bclmf | BCLM for Annotation index, doc text and links. |
772 | NoProductsProbability | DSSM Prediction of the probability of URL + Title that there is no product on the page. |
773 | PopularSEFRCBrowser | FRC Popular Search System for Browser Logs |
774 | LogCtrMean | Weighted mean of log(query_clicks)/log(query_shows) for given host. Weights are proportional to log(query_shows) + 0.2. |
775 | QueryUrlNhopTotalFrc | The number of transitions on the request for URL, found in the Hopes chain, normalized to the general garlic of the transitions on request. |
776 | QueryUrlNhopIsFinal | The probability of Urla to be the last upon request in the chain of Hopes. |
777 | OneProductProbability | DSSM Prediction of the probability of URL + Title, which is on the page one product. |
778 | ManyProductsProbability | DSSM Prediction of the probability of URL + Title, that there are a lot of goods on the page. |
779 | RcSearchBaseUrlRationalSigmoidD3T120Frozen | URL feature computed from rapid clicks search frozen counters with decay of 3 days |
780 | GeoCityUrlHasCity | For Urla, a geo-approval of the city level is determined according to the rules of the BUKI-1125 |
781 | GeoCityUrlHasCountry | For Urla, a geo-approval of the country’s level is determined according to the BUKI-1125 rules |
782 | GeoRelevRegionCityGeoa | Factor Gorelevregions of the 1th Attichut and Geoa |
783 | GeoRelevRegionRegionGeoa | Factor GorelevregionRegionRegion Natthew GEOA |
784 | GeoGeometryProximGeoa | Factor Geogeetryproxim ▪ Attributu GEOA |
785 | GeoRelevAlienCityGeoa | Factor Gorelevaliencity n Att. Att. Attibtu Geoa |
786 | GeoVQueryInUserCityGeoa | Factor Geovqueryinusercidence n Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Attfruut and Geoa |
787 | GeoVQueryInAlienCityGeoa | Geovquery Geovqueryinieniencity n Att. Att. Attib |
788 | PageRegionSizeGeo | PageRegionsize Factor by GEO attribute |
789 | PageRegionCoverageGeo | PageRegioncoverage Factor GEO attribute |
790 | PageRegionCoverageAdresa | PageRegioncoverage Factor on Adresa attribute |
791 | GeoRelevRegionCityAdresa | GeorelevregionCity Factor on Adresa attribute |
792 | DoppQueryUrlSessionClicksFRC | What part (on average in the session) from the clinked in this query Urlov is this URL. It is considered to be user sessions. |
793 | OwnerIsActualShop | Aries is a store |
794 | OwnerIsService | Aries is a service |
795 | NHopTextBclmPlane | BCLM (Plane) in texts from hopes. |
796 | SameQueryReturnFRCBrowser | FRC by transitions from requests that were set by the user several times |
797 | QueryURLISBMCTR | The average weight of the shows on the first page; Click weighs 1, non -click – according to the SBM_GAMMAS table |
798 | QueryURLISBMCTRReg | The average weight of the shows on the first page; Click weighs 1, non -click – according to the SBM_GAMMAS table. Regional version |
799 | RegexBeastPositionReg | Half -Summaria assessment of the position of Url with a median position for all similar queries on bisters |
800 | RcSpylogHostRationalSigmoidD3T0AtReq | Host feature computed at the request time from rapid clicks spy_log counters with decay of 3 days |
801 | RcSpylogHostRationalSigmoidD3DTM3600AtReq | Host feature computed at the request time from rapid clicks spy_log counters with decay of 3 days |
802 | RcSpylogHostRationalSigmoidD14T0AtReq | Host feature computed at the request time from rapid clicks spy_log counters with decay of 14 days |
803 | RcSpylogHostRationalSigmoidD14DTM3600AtReq | Host feature computed at the request time from rapid clicks spy_log counters with decay of 14 days |
804 | RcSpylogHostRationalSigmoidedCTRD3DT0TM3600AtReq | Host feature computed at the request time from rapid clicks spy_log counters with decay of 3 days |
805 | RcSpylogHostRationalSigmoidedCTRD14DT0TM3600AtReq | Host feature computed at the request time from rapid clicks spy_log counters with decay of 14 days |
806 | RcSpylogHostRationalSigmoidD3T0Frozen | Host feature computed from rapid clicks spy_log counters with decay of 3 days |
807 | RcSpylogHostRationalSigmoidD3DTM3600Frozen | Host feature computed from rapid clicks spy_log counters with decay of 3 days |
808 | RcSpylogHostRationalSigmoidD14T0Frozen | Host feature computed from rapid clicks spy_log counters with decay of 14 days |
809 | RcSpylogHostRationalSigmoidD14DTM3600Frozen | Host feature computed from rapid clicks spy_log counters with decay of 14 days |
810 | RcSpylogHostRationalSigmoidedCTRD3DT0TM3600Frozen | Host feature computed from rapid clicks spy_log counters with decay of 3 days |
811 | RcSpylogHostRationalSigmoidedCTRD14DT0TM3600Frozen | Host feature computed from rapid clicks spy_log counters with decay of 14 days |
812 | CommercialDssmOddLike | Finetuned reformulations DSSM to commercial clicked bargain odd-like target from visit log |
813 | DistributorHosts | Is legal video distributor |
814 | OneProductProbabilityAvg | Average value of feature OneProductProbability |
815 | ManyProductsProbabilityAvg | Average value of feature ManyProductsProbability |
816 | PayDetectorPredictAvg | Average value of feature PayDetectorPredict |
817 | OwnerIsPartner | Aries is a partner |
818 | ShopInShopUrl | The document is Shopinshop |
819 | QueryConversionDetectorPredict | The value of the conversion of the request calculated in the Hippo. |
820 | FioFromOriginalRequestBodyChain0Wcm | The factor according to the name from the original request is considered according to the contents of the document. Algorithm: Chain0wcm |
821 | ProductOfferAnyAvailable | At least one offer from a sporled scheme has an accessibility status. |
822 | ProductOfferNoProducts | There is not a single offer in the porous scheme. |
823 | BadYtierUrl | For Ural from Ytier, it is known that he has content of low qualities |
824 | NormYtierUrl | For Urla from Ytier, it is known that he has the content of acceptable qualities |
825 | GoodYtierUrl | For Urla from Ytier, it is known that he has content of good qualities |
826 | BestYtierUrl | For Urla from Ytier, it is known that he has the content of excellent qualities |
827 | HostIsEcomPurchase | The host has an ecom purchase. |
828 | HostIsVisitLogsPurchase | The host has a purchase by Visit Log. |
829 | YandexMarketProductUrl | URL is a product on the market. |
830 | YandexMarketProductIncludeOfferidUrl | URL is a product on the market and has Offerid. |
831 | ShopInShopCPAUrl | URL is Shopinshopcpa. |
832 | ProductOfferNotAvailable | At least one offer from a sporled scheme has an inaccessibility status. |
833 | OwnerIsEcomPurchase | Owner has an ecom purchase. |
834 | OwnerIsVisitLogsPurchase | Owner has a purchase by Visit Log. |
835 | NavParasites | DSSM Prediction of the probability of URL + Title that the document is an overlap. |
836 | OfferAvailabilityIsSetUp | In the offer from the new Parser, the PartnerOfferContent Available field is exhibited. |
837 | OfferAvailability | In an offer from the new Parser, the PartnerOfferContent Available Field == True. |
838 | URLClicksMaxGeoCityFRCWeight | Normalized corrected clicks count by query with user’s city(gc=) mentioned |
839 | URLClicksMaxGeoCityFRCRatio | Normalized corrected clicks maximum ratio by query with user’s city(gc=) mentioned |
840 | URLClicksMaxGeoAlienCityFRCRatio | Normalized corrected clicks maximum ratio by query with not user’s city(gc=) mentioned |
841 | FastMx | PurchaseTotalPredict |
842 | PurchaseTotalPredict | The value of Purchasetotalpredict, calculated in the Hippo. |
843 | SerpSummarySurplusPredict | The value of serpsummarysurpluspredict, calculated in the hippo. |
844 | YabarUrlRevisits | User return on URL |
845 | RequestWith120D3ClickPartPredict | Requestwith120d3ClickpartPredict value, calculated in the Hippo. |
846 | QueryNavParasitesDetectorPredict | The value of the requester detector of the parasites calculated in the hippo. |
847 | BrowserHostCntDwellTimeLog | Middle Logarithm of the user on the host with localization in the country; It is considered according to Yabar logs |
848 | BrowserHostDwellTimeRegionFrc | The attitude of Dwell Time on a host in this region to Dwell Time on a host in all regions |
849 | BrowserUrlDwellTimeRegionFrc | The attitude of Dwell Time on the page in this region to Dwell Time on a page in all regions |
850 | BrowserBookmarksUrl | The more users add to bookmarks a url, the more factor value it has |
851 | SosDssm | Predict SOS.DSSM models by URL + Title. |
852 | MedDssm | Med.DSSM Predictions URL + Title models. |
853 | FinLawDssm | FIN_LAW.DSSM Predictions URL + TITLE. |
854 | WikiInfobox | On danny url is a link from inFobox-ov to Wikipedia. |
855 | CrueltyDssm | Predict Cruelty.dssm URL + TITLE models. |
856 | HalfEcomPredict | The value of Halfecompredict, calculated in the Hippo. |
857 | PrefixSuffixMaxClickPercentReg | A factor similar to RegexmaxclickPercentreg, but calculated by Preffix-Suffix Generalization. |
858 | PrefixSuffixMaxClickPercentYabarReg | A factor similar to REGEXMAXCLICKPERCENTYABARREG, but calculated according to PREFFIX-SUFFIX Generalization. |
859 | DssmNavigationL2 | Request and documentary navigation model. |
860 | YabarHostSurfTrNdHgGr | The average sung of inclination in the plane of the top |
861 | QueryUrlCorrectedCtrXfactor | Request-murl factor. Value is the result of the collaborative filtration of data for the QueryurlCorrectedCTR factor |
862 | FullMatrixNet | The value of Matrixnet Slow Round Model |
863 | FastMatrixNet | Matrixnet value of a quick ranking model |
864 | FilterMatrixNet | Matrixnet Matrixnet Model Model |
865 | SmallWindowAttenuationQ | QueryDocTitleRangesMatchingScore |
866 | QueryDocTitleRangesMatchingScore | The factor on the text of the request and heading (Title) of the document, assessment of the compliance of numerical ranges in words-markers |
867 | FullPolynom | The value of Polynom Slow Round Model |
868 | FastPolynom | The value of Polynom of a quick ranking model |
869 | FilterPolynom | The value of Polynom filter ranking model |
870 | IsTranslatedDocument | A sign that the document was received by machine transfer |
871 | MedDssmWithTrash | Prediction of Med_with_Trash.DSSM (Medic. Document model with Tresh Valley in Lern) Models for URL + Title. |
872 | FinLawDssmWithTrash | Prediction FIN_LAW_WITH_TRASH.DSSM (Fin-Jur. Document model with a tresh valve in Lern) Models for URL + Title. |
873 | FioFromOriginalRequestBodyMinWindowSize | The factor according to the name from the original request is considered according to the contents of the document. The minimum window size, which includes all the words of the request. It is normalized for the number of words in the request. |
874 | FioFromOriginalRequestTextCosineMatchMaxPrediction | Factor for name from the original request text of the document. Algorithm Cosinematchmaxpredical. |
875 | AllFioFromOriginalRequestAllMaxFBodyChain0Wcm | The factor for all the name from the original request Aggregation on all extensions. Type of aggregation for extensions: the greatest value of the factor; It is considered according to the contents of the document. Algorithm: Chain0wcm |
876 | AllFioFromOriginalRequestAllMaxFBodyMinWindowSize | The factor for all the name from the original request Aggregation on all extensions. Type of aggregation for extensions: the greatest value of the factor; It is considered according to the contents of the document. The minimum window size, which includes all the words of the request. It is normalized for the number of words in the request. |
877 | SamplePeriodClickFrcSyn | The share of Urla in the total number of Urlov closed for the session on request (Synnorm). |
878 | SamplePeriodDayFrcSyn | The average share of clicks for this UrLU for this request among all clicks for this request (Synnorm) during the day. |
879 | SamplePeriodDayFrc | The average share of clicks for this UrLU for this request among all clicks for this request (QNORM) during the day. |
880 | QiQueryUrlCorrectedCtrXfactor | QI version of factor 861. MaxValue over the set of popular similar queries. |
881 | QiQueryURLISBMCTRReg | QI version of factor 798. MaxValue over the set of popular similar queries. |
882 | AllFioFromOriginalRequestAllMaxFTextCosineMatchMaxPrediction | The factor for all the name from the original request Aggregation on all extensions. Type of aggregation for extensions: the greatest value of the factor; The text of the document. Algorithm Cosinematchmaxpredical. |
883 | DssmPageQuality | DSSM, predicting the Page Quality rating for the document |
884 | HasTurboEcom | MemorandumUrlType |
885 | MemorandumUrlType | SamplePeriodDayFrcXfactor |
886 | SamplePeriodDayFrcXfactor | Request-murl factor. Value is the result of the collaborative filtration of data for the SampleperiodDayFRC factor |
887 | FastFilterMatrixNet | Matrixnet value with a fast filter model |
888 | FastFilterPolynom | The value of Polynom with a quick filter ranking model |
889 | QiSamplePeriodDayFrc | QI version of factor 879. |
890 | MetaMatrixNet | Matrixnet Meaning on Mete |
891 | MetaPolynom | Meaning Polynom on a mete |
892 | ShortVideo | A document is a short video (Tiktok, Reels, Shorts) |
893 | TelegramChannelWebFormat | Document-telegram channel in web format |
894 | TelegramPost | Document – post in telegram |
895 | CorrectedCtrAnnotationMatchPrediction | Correctedctrreg factor in the annotation index, AnnotationMatchprediction factor |
896 | CorrectedCtrQueryMatchPrediction | Correctedctrreg factor in the annotation index, QueryMatchpredical factor |
897 | CorrectedCtrValueWcmAvg | Correctedctrreg factor in the annotation index, factor Valuewcmavg |
898 | CorrectedCtrBm15V4K5 | Correctedctrreg factor in the annotation index, factor BM15V4K5 |
899 | IsNotCgi | Factor about the presence of a symbol ‘?’ In Url. It is zero if the Url has CGI parameters (more precisely: all duplicate have a symbol ‘?’ In Url). |
900 | AliceClickDssm | DSSM CLOSE DISCOUNT according to data specific for Alice |
901 | TelFullAttributeTextBocm15K001 | The factor for telephone attributes Tel_Full from the original request text of the document. Algorithm for aggregation of the scales of words BOCM15. Normalization coefficient 0.01. |
902 | Removed_902 | SamplePeriodDayFrcQueryMatchPrediction |
903 | SamplePeriodDayFrcQueryMatchPrediction | SampleperiodDayFRC Factor in the annotation index, QueryMatchpredical factor |
904 | SamplePeriodDayFrcAnnotationMatchPrediction | SampleperiodDayFRC Factor in the annotation index, AnnotationMatchprediction factor |
905 | OneClickQueryMatchPrediction | OneClick factor in the annotation index, QueryMatchpredical factor |
906 | OneClickAnnotationMatchPrediction | OneClick factor in the annotation index, AnnotationMatchpredical factor |
907 | OneClickBm15AK4 | OneClick factor in the annotation index, factor BM15AK4 |
908 | OneClickBocmWeightedW1K3 | OneClick Factor in the annotation index, Bocmweightedw1k3 factor |
909 | LongClickQueryMatchPrediction | LongClick factor in the annotation index, Querymatchpredical factor |
910 | LongClickAnnotationMatchPrediction | LongClick Factor in the annotation index, AnnotationMatchpredical factor |
911 | LongClickBm15AK4 | Longclick factor in the annotation index, factor BM15AK4 |
912 | LongClickBocmWeightedW1K3 | LongClick factor in the annotation index, factor Bocmweightedw1k3 |
913 | SplitDwellTimeQueryMatchPrediction | SPLITDWELLTIME Factor in the annotation index, QueryMatchpredical factor |
914 | SplitDwellTimeAnnotationMatchPrediction | SPLITDWELLTIME Factor in the annotation index, AnnotationMatchpredical factor |
915 | BQPRQueryMatchPrediction | BQPR factor in the annotation index, QueryMatchpredical factor |
916 | BQPRAnnotationMatchPrediction | BQPR Factor in the annotation index, AnnotationMatchpredical factor |
917 | YabarVisitsQueryMatchPrediction | Yabarvisits Factor in the annotation index, QueryMatchpredical factor |
918 | YabarVisitsAnnotationMatchPrediction | Yabarvisits Factor in the annotation index, AnnotationMatchprediction factor |
919 | YabarTimeQueryMatchPrediction | Yabartime factor in the annotation index, QueryMatchpredical factor |
920 | YabarTimeAnnotationMatchPrediction | Yabartime Factor in the annotation index, AnnotationMatchpredical factor |
921 | SimpleClickQueryMatchPrediction | SimpleClick factor in the annotation index, QueryMatchpredical factor |
922 | SimpleClickAnnotationMatchPrediction | SimpleClick factor in the annotation index, AnnotationMatchpredical factor |
923 | LongClickBocmPlain | LongClick factor in the annotation index, BOCMPLain factor |
924 | RandomLogDBM35XfactorFullMatchPrediction | The result of collaborative filtration for the FI_DBM35 factor from Random Log in the annotation index, Fullmatchpredical factor |
925 | RandomLogDBM35XfactorAnnotationMatchPrediction | The result of collaborative filtration for the FI_DBM35 factor from Random LOG in the annotation index, AnnotationMatchpredical factor |
926 | OneClickSynonymMatchPrediction | OneClick Factor in the annotation index, SynonyMatchpredical factor |
927 | OneClickFullMatchPrediction | OneClick factor in the annotation index, Fullmatchpredical factor |
928 | OneClickValueWcmAvg | OneClick factor in the annotation index, factor Valuewcmavg |
929 | OneClickBocmWeightedMaxK1 | OneClick Factor in the annotation index, Bocmweightedmaxk1 factor |
930 | OneClickBm15StrictK2 | OneClick factor in the annotation index, factor BM15StRictk2 |
931 | OneClickBm15MaxK3 | OneClick factor in the annotation index, factor BM15MAXK3 |
932 | OneClickBclmPlainW1K3 | OneClick factor in the annotation index, factor BCLMPLainW1k3 |
933 | OneClickValueWcmMax | OneClick factor in the annotation index, factor Valuewcmmax |
934 | OneClickValueWcmPrediction | OneClick factor in the annotation index, factor ValuewcMPREDION |
935 | OneClickBclmWeightedK3 | OneClick factor in the annotation index, factor BCLMWEIGHTEDK3 |
936 | BQPRBocmWeightedW1K3 | BQPR Factor in the annotation index, Bocmweightedw1k3 factor |
937 | BQPRBm15StrictK2 | BQPR factor in the annotation index, factor BM15StRictk2 |
938 | SplitDwellTimeBocmWeightedMaxK1 | SPLITDWELLTIME Factor in the annotation index, factor Bocmweightedmaxk1 |
939 | SplitDwellTimeFullMatchPrediction | SPLITDWELLTIME Factor in the annotation index, Fullmatchpredical factor |
940 | SplitDwellTimeValueWcmAvg | SPLITDWELLTIME Factor in the annotation index, factor Valuewcmavg |
941 | CorrectedCtrBm15StrictK2 | Correctedctrreg factor in the annotation index, factor BM15StricTK2 |
942 | FractionOfQueriesWithGeoPredicted | Prediction of a share of requests with geography on a bag of words built for request |
943 | IsExactUrl | The request is a Urle with an accuracy of the points and testing characters – the ISURL sorcerer’s rule is used |
944 | RandomLogDBM35XfactorValueWcmMax | The result of collaborative filtration for the FI_DBM35 factor from Random LOG in the annotation index, the factor Valuewcmmmax |
945 | RandomLogDBM35XfactorValueWcmAvg | The result of collaborative filtration for the FI_DBM35 factor from Random log in the annotation index, Valuewcmavg factor |
946 | RandomLogDBM35XfactorBm15StrictK2 | The result of collaborative filtration for the FI_DBM35 factor from Random LOG in the annotation index, factor BM15StricTK2 |
947 | RandomLogDBM35XfactorBclmPlainW1K3 | The result of collaborative filtration for the FI_DBM35 factor from Random log in the annotation index, the BCLMPLainW1K3 factor |
948 | RandomLogDBM35XfactorBclmWeightedK3 | The result of collaborative filtration for the FI_DBM35 factor from Random LOG in the annotation index, the BCLMWEIGHTEDK3 factor |
949 | RandomLogDBM35XfactorBocmWeightedW1K3 | The result of collaborative filtration for the FI_DBM35 factor from Random log in the annotation index, the Bocmweightedw1k3 factor |
950 | CorrectedCtrXfactorAnnotationMatchPrediction | CorrectedctrxFactor in the annotation index, AnnotationMatchprediction factor |
951 | CorrectedCtrXfactorQueryMatchPrediction | CorrectedctrxFactor in the annotation index, QueryMatchpredical factor |
952 | CorrectedCtrXfactorValueWcmMax | CorrectedctrxFactor in the annotation index, factor Valuewcmmax |
953 | CorrectedCtrXfactorValueWcmAvg | CorrectedctrxFactor in the annotation index, factor Valuewcmavg |
954 | CorrectedCtrXfactorBocmWeightedW1K3 | CorrectedctrxFactor in the annotation index, Bocmweightedw1k3 factor |
955 | CorrectedCtrXfactorBclmPlainK3 | CorrectedctrxFactor in the annotation index, BCLMPLALINK3 factor |
956 | CorrectedCtrXfactorBclmMixPlainW1K1 | CorrectedctrxFactor in the annotation index, factor BCLMIXPLainW1K1 |
957 | AliceTimespentSuffixSum | The prediction of the total time spent to the end of the session, provided that this pair is implemented by the request-document |
958 | AliceTimespent | The prediction of the contribution of this pair request-document to the timetable |
959 | SamplePeriodDayFrcValueWcmAvg | SampleperiodDayFRC Factor in the annotation index, factor Valuewcmavg |
960 | SamplePeriodDayFrcBm15MaxK3 | SampleperiodDayFRC Factor in the annotation index, factor BM15MAXK3 |
961 | SamplePeriodDayFrcBocmWeightedK3 | SampleperiodDayFRC Factor in the annotation index, Bocmweightedk3 factor |
962 | SamplePeriodDayFrcBocmDoubleK5 | SampleperiodDayFRC Factor in the annotation index, BocmDoublek5 factor |
963 | SplitDwellTimeBm15MaxK3 | SPLITDWELLTIME Factor in the annotation index, factor BM15MAXK3 |
964 | SimpleClickBclmWeightedK3 | SimpleClick factor in the annotation index, BCLMWEIGHTEDK3 factor |
965 | AliceMaxPercentPlayed | The prediction of the percentage of the length of the track, which will be lost subject to the implementation of this pair of the request |
966 | WeightedUnMatchUrlPredictedAndUserRegion | The likelihood that the Yweb/Robot/urlgeo_ml region is correct is correct, provided that the city is predicted |
967 | PopularSEFRCBrowserAnnotationMatchPrediction | PopularseFrcbrowser Factor in the annotation index, AnnotationMatchprediction factor |
968 | PopularSEFRCBrowserSynonymMatchPrediction | PopularseFrcbrowser Factor in the annotation index, SynonyMatchprediction factor |
969 | PopularSEFRCBrowserValueWcmPrediction | PopularseFrcbrowSER Factor in the annotation index, factor ValuewcMPrediction |
970 | PopularSEFRCBrowserBclmWeightedV2K3 | PopularseFrcbrowSER Factor in the annotation index, factor BCLMWEIGHTEDV2K3 |
971 | PopularSEFRCBrowserBclmMixPlainW1K1 | PopularseFrcbrowSER Factor in the annotation index, factor BCLMIXPLainW1K1 |
972 | XLRAnnotationMatchPrediction | It is considered to be a linkend index. Max (SUM (IDF)) for all links that are the subset of Query / Sum (IDF) for Query |
973 | OneClickAnnotationMatchPredictionWeighted | OneClick Factor in the annotation index, AnnotationMatchpredictionWeighted factor |
974 | LongClickAnnotationMatchPredictionWeighted | LongClick Factor in the annotation index, AnnotationMatchpredictionWeighted factor |
975 | YabarTimeAnnotationMatchPredictionWeighted | Yabartime Factor in Annotacionnom Indexes, Factor AnnotationMatchpredictionWeighted |
976 | PageHasMapsApi | Equal to one if the page connects JS-API of any geo-data supplier |
977 | LongClickSPAnnotationMatchPrediction | LongClicksAMPLEPEROD Factor in the annotation index, AnnotationMatchprediction factor |
978 | LongClickSPQueryMatchPrediction | LongClicksAMPLEPEROD Factor in the annotation index, QueryMatchpredical factor |
979 | LongClickSPValueWcmAvg | LongClicksAMPLEPEROD Factor in the annotation index, factor Valuewcmavg |
980 | LongClickSPValueWcmPrediction | LongClicksampleperiod Factor in the annotation index, factor ValuewcMPREDION |
981 | LongClickSPBclmPlainW1K3 | LongClicksAMPLEPEROD Factor in the annotation index, factor BCLMPLainW1K3 |
982 | LongClickSPBclmWeightedK3 | LongClicksampleperiod Factor in the annotation index, BCLMWEIGHTEDK3 factor |
983 | LongClickSPBocmWeightedW1K3 | LongClicksAMPLEPEROD Factor in the annotation index, factor Bocmweightedw1k3 |
984 | LongClickSPBclmPlainK5 | LongClicksAMPLEPEROD Factor in the annotation index, BCLMPLAINK5 factor |
985 | LongClickSPBclmWeightedV2K3 | LongClicksAMPLEPEROD Factor in the annotation index, factor BCLMWEIGHTEDV2K3 |
986 | LongClickSPBocmDoubleK5 | LongClicksAMPLEPEROOD Factor in the annotation index, factor BocmDoublek5 |
987 | LongClickSPBm15StrictK2 | LongClicksAMPLEPEROD Factor in the annotation index, factor BM15StRictk2 |
988 | URLClicksMaxGeoRegionFRCRatio | Normalized corrected clicks maximum ratio by query with user’s city(gc=) mentioned equal by region |
989 | URLClicksMaxGeoRegionOnlyFRCRatio | Normalized corrected clicks maximum ratio by query with user’s city(gc=) mentioned equal to user’s region |
990 | BQPRSampleWcmCoverageMax | BQPR in a minimumatic period. Annotation index. WCMCOVERAGEMAX factor |
991 | BQPRSampleFullMatchPrediction | BQPR in a minimumatic period. Annotation index. Fullmatchprediction factor |
992 | BQPRSampleAnnotationMatchPredictionWeighted | BQPR in a minimumatic period. Annotation index. Factor AnnotationMatchpredicalWeighted |
993 | BQPRSampleValuePcmAvg | BQPR in a minimumatic period. Annotation index. Valuepcamavg factor |
994 | BQPRSampleValueWcmAvg | BQPR in a minimumatic period. Annotation index. Valuewcmavg factor |
995 | BQPRSampleBm15V4K8 | BQPR in a minimumatic period. Annotation index. BM15V4K8 factor |
996 | BQPRSampleBocmWeightedV4K8 | BQPR in a minimumatic period. Annotation index. BocmweightedV4K8 factor |
997 | BQPRSampleWcmMax | BQPR in a minimumatic period. Annotation index. Samplewcmmax factor |
998 | BQPRSampleSynonymMatchPrediction | BQPR in a minimumatic period. Annotation index. SynonyMatchprediction factor |
999 | BQPRSampleAnnotationMatchPrediction | BQPR in a minimumatic period. Annotation index. Factor AnnotationMatchpredical |
1000 | BQPRSampleSuffixMatchCount | BQPR in a minimumatic period. Annotation index. SuffixMatchCount factor |
1001 | BQPRSampleWcmCoveragePrediction | BQPR in a minimumatic period. Annotation index. WCMCOVERAGEPREDION factor |
1002 | DoubleFrcFullMatchPrediction | DoubleFRC in the annotation index, Fullmatchpredical factor |
1003 | DoubleFrcSynonymMatchPrediction | DoubleFRC in the annotation index, SynonyMatchpredical factor |
1004 | DoubleFrcAnnotationMatchPrediction | DoubleFRC in the annotation index, AnnotationMatchpredical factor |
1005 | DoubleFrcAnnotationMatchPredictionWeighted | DoubleFRC in the annotation index, AnnotationMatchpredicalWeighted factor |
1006 | DoubleFrcQueryMatchPrediction | DoubleFRC in the annotation index, QueryMatchpredical factor |
1007 | DoubleFrcValueWcmAvg | DoubleFRC in the annotation index, factor Valuewcmavg |
1008 | DoubleFrcBocmWeightedMaxK1 | DoubleFRC in the annotation index, factor Bocmweightedmaxk1 |
1009 | DoubleFrcBm15V4K5 | DoubleFRC in the annotation index, factor BM15V4K5 |
1010 | DoubleFrcBocmWeightedV4K5 | DoubleFRC in the annotation index, BocmweightedV4K5 factor |
1011 | DoubleFrcBocmDoubleK1 | DoubleFRC in the annotation index, BocmDoublek1 factor |
1012 | REMOVED_1012 | REMOVED_1013 |
1013 | REMOVED_1013 | REMOVED_1014 |
1014 | REMOVED_1014 | REMOVED_1015 |
1015 | REMOVED_1015 | REMOVED_1016 |
1016 | REMOVED_1016 | REMOVED_1017 |
1017 | REMOVED_1017 | REMOVED_1018 |
1018 | REMOVED_1018 | REMOVED_1019 |
1019 | REMOVED_1019 | REMOVED_1020 |
1020 | REMOVED_1020 | REMOVED_1021 |
1021 | REMOVED_1021 | REMOVED_1022 |
1022 | REMOVED_1022 | REMOVED_1023 |
1023 | REMOVED_1023 | XfDtShowAllMinW |
1024 | XfDtShowAllMinW | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: The minimum expansion weight. |
1025 | XfDtShowAllMaxFFieldSet2Bm15FLogK0001 | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: BM15 in the group of streams 2. The maximum value of the factor for extensions. |
1026 | XfDtShowAllMaxFFieldSet3BclmWeightedFLogW0K0001 | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: BCLMWEIGHTEDFLOGW0 in the Stream group 3. The maximum value of the expansion factor. |
1027 | XfDtShowAllMaxFFieldSetUTBm15FLogW0 | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: BM15FLOGW0 for Urlu and Title. The maximum value of the expansion factor. |
1028 | XfDtShowAllMaxFTextCosineMatchMaxPrediction | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: CosinemaxMatchprediction in text and Title. The maximum value of the expansion factor. |
1029 | XfDtShowAllMaxFUrlBm15K01 | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: BM15 on Ural. The maximum value of the expansion factor. |
1030 | XfDtShowAllMaxWFLongClickSPFullMatchValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: Fullmatchvalue by stream LongClicksp. The maximum balanced value of the expansion factor. |
1031 | XfDtShowAllMaxWFOneClickFullMatchValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: Fullmatchvalue according to Stream OneClick. The maximum balanced value of the expansion factor. |
1032 | XfDtShowAllSumW2FSumWFieldSet1Bm15FLogK0001 | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: BM15FLOG by the Stream group 1. The average balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) for extensions. |
1033 | XfDtShowAllSumW2FSumWFieldSetUTBm15FLogW0 | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: BM15FLOGW0 for Urlu and Title. The average balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) for extensions. |
1034 | XfDtShowAllSumWFSumWBodyMinWindowSize | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: Minwindowsize in text. The average balanced values of the expansion factor. |
1035 | XfDtShowBagOfWordsFieldSetBagOfWordsOriginalRequestFractionExact | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: ORIGINALREQUARY ORIGINALREKETRACTRENEXACT for a group of streams for bag factors (text, Title, annotation streams). |
1036 | XfDtShowBagOfWordsLongClickSPCosineMatchMaxPrediction | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: CosinemaxMatchpredical bag for stream LongClicksp. |
1037 | XfDtShowBagOfWordsLongClickSPCosineMatchWeightedValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: CosinematchWeightedValue bag by stream LongClicksp. |
1038 | XfDtShowBagOfWordsSimpleClickAnnotationMatchAvgValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: SIMPLECLIC SIMPLECLICS bag. |
1039 | XfDtShowBagOfWordsTitleCosineMaxMatch | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: CosinemaxMattcg bag. |
1040 | XfDtShowTopMinWFFieldSet3BclmWeightedFLogW0K0001 | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: BCLMWEIGHTEDFLOGW0 in the Stream group 3. The minimum balanced value of the factor for the expansion top. |
1041 | XfDtShowTopMinWFLongClickSPAnnotationMatchWeightedValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: AnnotationMatchWeightedValue by stream LongClicksp. The minimum balanced value of the factor on the expansion top. |
1042 | XfDtShowTopMinWFMaxWLongClickSPAnnotationMatchWeightedValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: AnnotationMatchWeightedValue by stream LongClicksp. The minimum balanced value of the factor for the expansion top extensions normalized for maximum weight by the Top Extensions. |
1043 | XfDtShowTopSumW2FSumWBodyChain0Wcm | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: Chain0wcm in text. The average balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) according to the expansion top. |
1044 | XfDtShowTopSumW2FSumWLongClickSPFullMatchValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: Fullmatchvalue by stream LongClicksp. The average balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) according to the expansion top. |
1045 | XfDtShowTopSumW2FSumWOneClickFullMatchValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: Fullmatchvalue according to Stream OneClick. The average balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) according to the expansion top. |
1046 | XfDtShowTopSumWFSumWFieldSet3BclmWeightedFLogW0K0001 | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: BCLMWEIGHTEDFLOGW0 for the Stream group 3. The average balanced values of the factor for the expansion top. |
1047 | OneClickFrcXfSpFullMatchPrediction | OneClickFRC, calculated by the sampled period and collaboratively expanded, Fullmatchpredical factor |
1048 | OneClickFrcXfSpAnnotationMatchPredictionWeighted | OneClickFRC, calculated by the sampled period and collaboratively expanded, AnnotationMatchpredictionWeighted factor |
1049 | OneClickFrcXfSpValueWcmAvg | OneClickFRC, calculated by the sampled period and collaboratively expanded, Valuewcmavg factor |
1050 | OneClickFrcXfSpWcmMax | OneClickFRC, calculated by the sampled period and collaboratively expanded, WCMMAX factor |
1051 | OneClickFrcXfSpWcmCoveragePrediction | OneClickFRC, calculated by the sampled period and collaboratively expanded, WCMCOVERAGEPREDION factor |
1052 | OneClickFrcXfSpWcmCoverageMax | OneClickFRC, calculated by the sampled period and collaboratively expanded, WCMCOVERAGEMAX factor |
1053 | OneClickFrcXfSpPcmMax | OneClickFRC, calculated by the sampled period and collaboratively expanded, PCMMAX factor |
1054 | OneClickFrcXfSpPrefixMatchCount | OneClickFRC, calculated by the sampled period and collaboratively expanded, PrefixmatchCount factor |
1055 | OneClickFrcXfSpSuffixMatchCount | OneClickFRC, calculated by the sampled period and collaboratively expanded, SuffixMatchcount factor |
1056 | OneClickFrcXfSpBm15V0W1K1 | OneClickFRC, calculated by the sampled period and collaboratively expanded, factor BM15V0W1K1 |
1057 | IsLocalProbability | The value of the classifier of localization for request |
1058 | IsRelevLocaleRU | Relev_locale == ru |
1059 | IsRelevLocaleUA | Relev_locale == ua |
1060 | IsRelevLocaleBY | relev_locale == by |
1061 | IsRelevLocaleKZ | relev_locale == kz |
1062 | IsRelevLocaleTR | relev_locale == tr |
1063 | IsRelevLocaleWORLD | relev_locale == world |
1064 | QClassPornoVw | Porn query classification result from Wizard (iad_vw flag, based on Vowpal Wabbit) |
1065 | FullUrlFraction | URL coating with trigrams from the request. Analogue of Urldomainfraction, Urlpathandparamsfraction factors. |
1066 | QueryDwellTimeFullMatchPrediction | Querydwelltime, фактор full -match prediction |
1067 | QueryDwellTimeSynonymMatchPrediction | QueryDwellTime, фактор SynonymMatchPrediction |
1068 | QueryDwellTimeAnnotationMatchPrediction | QueryDwellTime, фактор AnnotationMatchPrediction |
1069 | QueryDwellTimeAnnotationMatchPredictionWeighted | QueryDwellTime, фактор AnnotationMatchPredictionWeighted |
1070 | QueryDwellTimeQueryMatchPrediction | QueryDwellTime, фактор QueryMatchPrediction |
1071 | QueryDwellTimeValueWcmAvg | Querydwelltime, Valuewcmavg factor |
1072 | QueryDwellTimeBclmPlainW1K3 | Querydwelltime, BCLMPLainW1k3 factor |
1073 | QueryDwellTimeBm15CoverageV4K3 | Querydwelltime, BM15CoveRAGEV4K3 factor |
1074 | QueryDwellTimeBclmPlainK4 | Querydwelltime, BCLMPLAINK4 factor |
1075 | QueryDwellTimeBocmWeightedV4K5 | QuerydwellTime, factor Bocmweightedv4k5 |
1076 | More90SecVisitsShare | The share of visits for which the time spent during the day on the host is more than 90 seconds |
1077 | More160SecVisitsShare | The share of visits for which the time spent during the day on the host is more than 160 seconds |
1078 | RankHackedNovaPhp | Rank of hacked sites |
1079 | RankAgs4 | Rank AGS4 |
1080 | MaxQsDocClassQsRankPthQuerySpam | Maximum QSRANK on the owner |
1081 | AvgQsRankOnNotSubdomainDocs | Average QSRANK on the main domain |
1082 | VisitorsReturnMonthShare | The share of users who returned within a month |
1083 | VisitorsReturnMonthNumber | The number of users returning within a month |
1084 | RankXitDoor | Rank Dorweev |
1085 | AvgTitleCapitalLettersRatio | Share of the capital letters in Title |
1086 | FromSearchShareNormalized | The share of incoming traffic from search engines among all incoming traffic |
1087 | GreenTrafficShareNormalized | The share of direct visits among all incoming traffic |
1088 | AvgQsFWnd500TOKEN | Middle QSRank in a sliding window |
1089 | MinOwnerQsRank | Minimum QSRANK |
1090 | AvgNumhops | Average HOPS |
1091 | UrlBm15K01 | Bm15K01 factor over hits from Url |
1092 | TitleBm15K01 | Bm15K01 factor over hits from Title |
1093 | TitleBocm15K001 | Bocm15K001 factor over hits from Title |
1094 | TextBm11Norm16384 | Bm11Norm16384 factor over hits from Text |
1095 | TextBocm11Norm256 | Bocm11Norm256 factor over hits from Text |
1096 | TextCosineMatchMaxPrediction | CosineMatchMaxPrediction factor over hits from Text |
1097 | FieldSet1Bm15FLogK0001 | Bm15FLogK0001 factor over hits from FieldSet1 stream |
1098 | FieldSet2Bm15FLogK0001 | Bm15FLogK0001 factor over hits from FieldSet2 stream |
1099 | FieldSet3BclmWeightedFLogW0K0001 | BclmWeightedFLogW0K0001 factor over hits from FieldSet3 stream |
1100 | FieldSetUTBm15FLogW0K00001 | Bm15FLogW0K00001 factor over hits from FieldSetUT stream |
1101 | BodyChain0Wcm | Chain0Wcm factor over hits from Body |
1102 | BodyPairMinProximity | PairMinProximity factor over hits from Body |
1103 | BodyMinWindowSize | MinWindowSize factor over hits from Body |
1104 | PopularSeFrcBrowserCosineMatchMaxPrediction | CosineMatchMaxPrediction factor over hits from PopularSeFrcBrowser stream |
1105 | DoubleFrcMixMatchWeightedValue | MixMatchWeightedValue factor over hits from DoubleFrc stream |
1106 | DoubleFrcAnnotationMaxValueWeighted | AnnotationMaxValueWeighted factor over hits from DoubleFrc stream |
1107 | DoubleFrcAnnotationMaxValue | AnnotationMaxValue factor over hits from DoubleFrc stream |
1108 | DoubleFrcAnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from DoubleFrc stream |
1109 | DoubleFrcAllWcmWeightedValue | AllWcmWeightedValue factor over hits from DoubleFrc stream |
1110 | DoubleFrcAllWcmMatch95AvgValue | AllWcmMatch95AvgValue factor over hits from DoubleFrc stream |
1111 | DoubleFrcAllWcmWeightedPrediction | AllWcmWeightedPrediction factor over hits from DoubleFrc stream |
1112 | DoubleFrcAllWcmMatch80AvgValue | AllWcmMatch80AvgValue factor over hits from DoubleFrc stream |
1113 | DoubleFrcFullMatchValue | FullMatchValue factor over hits from DoubleFrc stream |
1114 | DoubleFrcFullMatchAnyValue | FullMatchAnyValue factor over hits from DoubleFrc stream |
1115 | DoubleFrcExactQueryMatchAvgValue | ExactQueryMatchAvgValue factor over hits from DoubleFrc stream |
1116 | OneClickFrcXfSpBclmMixPlainKE5 | BclmMixPlainKE5 factor over hits from OneClickFrcXfSp stream |
1117 | OneClickFrcXfSpBm15StrictAnnotationK01 | Bm15StrictAnnotationK01 factor over hits from OneClickFrcXfSp stream |
1118 | OneClickFrcXfSpAllWcmWeightedValue | AllWcmWeightedValue factor over hits from OneClickFrcXfSp stream |
1119 | OneClickFrcXfSpAllWcmWeightedPrediction | AllWcmWeightedPrediction factor over hits from OneClickFrcXfSp stream |
1120 | OneClickFrcXfSpAllWcmMatch80AvgValue | AllWcmMatch80AvgValue factor over hits from OneClickFrcXfSp stream |
1121 | OneClickFrcXfSpMixMatchWeightedValue | MixMatchWeightedValue factor over hits from OneClickFrcXfSp stream |
1122 | OneClickFrcXfSpAnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from OneClickFrcXfSp stream |
1123 | OneClickFrcXfSpBclmPlaneProximity1Bm15W0Size1K0001 | BclmPlaneProximity1Bm15W0Size1K0001 factor over hits from OneClickFrcXfSp stream |
1124 | OneClickFrcXfSpBclmWeightedProximity1Bm15Size1K001 | BclmWeightedProximity1Bm15Size1K001 factor over hits from OneClickFrcXfSp stream |
1125 | BQPRSampleBclmMixPlainKE5 | BclmMixPlainKE5 factor over hits from BQPRSample stream |
1126 | BQPRSampleAllWcmWeightedValue | AllWcmWeightedValue factor over hits from BQPRSample stream |
1127 | BQPRSampleAllWcmWeightedPrediction | AllWcmWeightedPrediction factor over hits from BQPRSample stream |
1128 | BQPRSampleAllWcmMaxPrediction | AllWcmMaxPrediction factor over hits from BQPRSample stream |
1129 | BQPRSampleAllWcmMatch80AvgValue | AllWcmMatch80AvgValue factor over hits from BQPRSample stream |
1130 | BQPRSampleMixMatchWeightedValue | MixMatchWeightedValue factor over hits from BQPRSample stream |
1131 | BQPRSampleCosineMatchMaxPrediction | CosineMatchMaxPrediction factor over hits from BQPRSample stream |
1132 | BQPRSampleAnnotationMaxValueWeighted | AnnotationMaxValueWeighted factor over hits from BQPRSample stream |
1133 | BQPRSampleAnnotationMaxValue | AnnotationMaxValue factor over hits from BQPRSample stream |
1134 | BQPRSampleAnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from BQPRSample stream |
1135 | BQPRSampleBocm15K001 | Bocm15K001 factor over hits from BQPRSample stream |
1136 | BQPRSampleBclmPlaneProximity1Bm15W0Size1K0001 | BclmPlaneProximity1Bm15W0Size1K0001 factor over hits from BQPRSample stream |
1137 | BQPRSampleBclmWeightedProximity1Bm15Size1K001 | BclmWeightedProximity1Bm15Size1K001 factor over hits from BQPRSample stream |
1138 | LongClickSPBclmPlaneProximity1Bm15W0Size1K0001 | BclmPlaneProximity1Bm15W0Size1K0001 factor over hits from LongClickSP stream |
1139 | LongClickSPBm15MaxAnnotationK001 | Bm15MaxAnnotationK001 factor over hits from LongClickSP stream |
1140 | LongClickSPFullMatchValue | FullMatchValue factor over hits from LongClickSP stream |
1141 | LongClickSPMixMatchWeightedValue | MixMatchWeightedValue factor over hits from LongClickSP stream |
1142 | LongClickSPCosineMatchMaxPrediction | CosineMatchMaxPrediction factor over hits from LongClickSP stream |
1143 | LongClickSPAnnotationMaxValue | AnnotationMaxValue factor over hits from LongClickSP stream |
1144 | LongClickSPAnnotationMaxValueWeighted | AnnotationMaxValueWeighted factor over hits from LongClickSP stream |
1145 | LongClickSPAnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from LongClickSP stream |
1146 | LongClickSPAllWcmMatch95AvgValue | AllWcmMatch95AvgValue factor over hits from LongClickSP stream |
1147 | LongClickSPAllWcmWeightedValue | AllWcmWeightedValue factor over hits from LongClickSP stream |
1148 | LongClickSPAllWcmMaxMatch | AllWcmMaxMatch factor over hits from LongClickSP stream |
1149 | LongClickSPAllWcmWeightedPrediction | AllWcmWeightedPrediction factor over hits from LongClickSP stream |
1150 | LongClickSPBocm15K001 | Bocm15K001 factor over hits from LongClickSP stream |
1151 | LongClickSPQueryPrefixMatchOriginalWordValue | QueryPrefixMatchOriginalWordValue factor over hits from LongClickSP stream |
1152 | SamplePeriodDayFrcBclmPlaneProximity1Bm15W0Size1K0001 | BclmPlaneProximity1Bm15W0Size1K0001 factor over hits from SamplePeriodDayFrc stream |
1153 | SamplePeriodDayFrcAttenV1Bm15K05 | AttenV1Bm15K05 factor over hits from SamplePeriodDayFrc stream |
1154 | SamplePeriodDayFrcFullMatchValue | FullMatchValue factor over hits from SamplePeriodDayFrc stream |
1155 | SamplePeriodDayFrcFullMatchAnyValue | FullMatchAnyValue factor over hits from SamplePeriodDayFrc stream |
1156 | SamplePeriodDayFrcAllWcmWeightedValue | AllWcmWeightedValue factor over hits from SamplePeriodDayFrc stream |
1157 | SamplePeriodDayFrcAllWcmWeightedPrediction | AllWcmWeightedPrediction factor over hits from SamplePeriodDayFrc stream |
1158 | SamplePeriodDayFrcAllWcmMatch95AvgValue | AllWcmMatch95AvgValue factor over hits from SamplePeriodDayFrc stream |
1159 | SamplePeriodDayFrcAllWcmMatch80AvgValue | AllWcmMatch80AvgValue factor over hits from SamplePeriodDayFrc stream |
1160 | SamplePeriodDayFrcMixMatchWeightedValue | MixMatchWeightedValue factor over hits from SamplePeriodDayFrc stream |
1161 | SamplePeriodDayFrcAnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from SamplePeriodDayFrc stream |
1162 | SamplePeriodDayFrcAnnotationMaxValue | AnnotationMaxValue factor over hits from SamplePeriodDayFrc stream |
1163 | SamplePeriodDayFrcAnnotationMaxValueWeighted | AnnotationMaxValueWeighted factor over hits from SamplePeriodDayFrc stream |
1164 | SamplePeriodDayFrcBocm15K001 | Bocm15K001 factor over hits from SamplePeriodDayFrc stream |
1165 | CorrectedCtrXFactorAllWcmWeightedValue | AllWcmWeightedValue factor over hits from CorrectedCtrXFactor stream |
1166 | CorrectedCtrXFactorAllWcmMaxPrediction | AllWcmMaxPrediction factor over hits from CorrectedCtrXFactor stream |
1167 | CorrectedCtrXFactorAllWcmWeightedPrediction | AllWcmWeightedPrediction factor over hits from CorrectedCtrXFactor stream |
1168 | CorrectedCtrXFactorAllWcmMatch80AvgValue | AllWcmMatch80AvgValue factor over hits from CorrectedCtrXFactor stream |
1169 | CorrectedCtrXFactorMixMatchWeightedValue | MixMatchWeightedValue factor over hits from CorrectedCtrXFactor stream |
1170 | CorrectedCtrXFactorAnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from CorrectedCtrXFactor stream |
1171 | CorrectedCtrXFactorBclmPlaneProximity1Bm15W0Size1K001 | BclmPlaneProximity1Bm15W0Size1K001 factor over hits from CorrectedCtrXFactor stream |
1172 | CorrectedCtrXFactorBclmWeightedProximity1Bm15Size1K001 | BclmWeightedProximity1Bm15Size1K001 factor over hits from CorrectedCtrXFactor stream |
1173 | LongClickAllWcmMaxPrediction | AllWcmMaxPrediction factor over hits from LongClick stream |
1174 | LongClickMixMatchWeightedValue | MixMatchWeightedValue factor over hits from LongClick stream |
1175 | LongClickAnnotationMaxValueWeighted | AnnotationMaxValueWeighted factor over hits from LongClick stream |
1176 | LongClickFullMatchValue | FullMatchValue factor over hits from LongClick stream |
1177 | LongClickAnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from LongClick stream |
1178 | SimpleClickAllWcmWeightedValue | AllWcmWeightedValue factor over hits from SimpleClick stream |
1179 | SimpleClickAllWcmWeightedPrediction | AllWcmWeightedPrediction factor over hits from SimpleClick stream |
1180 | SimpleClickAllWcmMaxPrediction | AllWcmMaxPrediction factor over hits from SimpleClick stream |
1181 | SimpleClickMixMatchWeightedValue | MixMatchWeightedValue factor over hits from SimpleClick stream |
1182 | SimpleClickAnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from SimpleClick stream |
1183 | BrowserPageRankAnnotationMaxValueWeighted | AnnotationMaxValueWeighted factor over hits from BrowserPageRank stream |
1184 | BrowserPageRankAnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from BrowserPageRank stream |
1185 | BrowserPageRankAnnotationMaxValue | AnnotationMaxValue factor over hits from BrowserPageRank stream |
1186 | BrowserPageRankBocm15K001 | Bocm15K001 factor over hits from BrowserPageRank stream |
1187 | OneClickMixMatchWeightedValue | MixMatchWeightedValue factor over hits from OneClick stream |
1188 | OneClickFullMatchValue | FullMatchValue factor over hits from OneClick stream |
1189 | OneClickAnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from OneClick stream |
1190 | SplitDwellTimeAllWcmWeightedPrediction | AllWcmWeightedPrediction factor over hits from SplitDwellTime stream |
1191 | SplitDwellTimeBm15MaxAnnotationK001 | Bm15MaxAnnotationK001 factor over hits from SplitDwellTime stream |
1192 | QueryDwellTimeBclmWeightedProximity1Bm15Size1K0001 | BclmWeightedProximity1Bm15Size1K0001 factor over hits from QueryDwellTime stream |
1193 | QueryDwellTimeAttenV1Bm15K001 | AttenV1Bm15K001 factor over hits from QueryDwellTime stream |
1194 | QueryDwellTimeMixMatchWeightedValue | MixMatchWeightedValue factor over hits from QueryDwellTime stream |
1195 | QueryDwellTimeAnnotationMaxValueWeighted | AnnotationMaxValueWeighted factor over hits from QueryDwellTime stream |
1196 | QueryDwellTimeAnnotationMaxValue | AnnotationMaxValue factor over hits from QueryDwellTime stream |
1197 | QueryDwellTimeAnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from QueryDwellTime stream |
1198 | QueryDwellTimeAllWcmWeightedValue | AllWcmWeightedValue factor over hits from QueryDwellTime stream |
1199 | QueryDwellTimeAllWcmMatch80AvgValue | AllWcmMatch80AvgValue factor over hits from QueryDwellTime stream |
1200 | RandomLogDBM35BclmPlaneProximity1Bm15W0Size1K0001 | BclmPlaneProximity1Bm15W0Size1K0001 factor over hits from RandomLogDBM35 stream |
1201 | RandomLogDBM35Bm15StrictAnnotationK001 | Bm15StrictAnnotationK001 factor over hits from RandomLogDBM35 stream |
1202 | RandomLogDBM35MixMatchWeightedValue | MixMatchWeightedValue factor over hits from RandomLogDBM35 stream |
1203 | RandomLogDBM35AnnotationMaxValueWeighted | AnnotationMaxValueWeighted factor over hits from RandomLogDBM35 stream |
1204 | RandomLogDBM35AnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from RandomLogDBM35 stream |
1205 | RandomLogDBM35AllWcmWeightedValue | AllWcmWeightedValue factor over hits from RandomLogDBM35 stream |
1206 | RandomLogDBM35FullMatchValue | FullMatchValue factor over hits from RandomLogDBM35 stream |
1207 | RandomLogDBM35ExactQueryMatchAvgValue | ExactQueryMatchAvgValue factor over hits from RandomLogDBM35 stream |
1208 | IsRelevLocaleID | relev_locale == id |
1209 | IsMobileBeauty | The binary factor about the mobile adaptability of the document. It is taken from ERF |
1210 | ForeignDomain | In those cases when fi_national_domain is 0, and Herf.NationalDomainid is filled 1 |
1211 | XfDtShowAllMaxWfQueryDwellTimeMixMatchWeightedValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: MixmatchweightedValue by Stream Querydwelltime. The maximum balanced value of the expansion factor. |
1212 | XfDtShowTopSumWFSumWQueryDwellTimeMixMatchWeightedValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: MixmatchweightedValue by Stream Querydwelltime. The average balanced value of the expansion factor. |
1213 | XfDtShowTopMinWFQueryDwellTimeMixMatchWeightedValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: MixmatchweightedValue by Stream Querydwelltime. The minimum balanced value of the factor on the expansion top. |
1214 | XfDtShowTopMinWFQueryDwellTimeAnnotationMatchWeightedValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: AnnotationMatchWeightedValue by Stream Querydwelltime. The minimum balanced value of the factor on the expansion top. |
1215 | XfDtShowAllMaxWFQueryDwellTimeAnnotationMatchWeightedValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: AnnotationMatchWeightedValue by Stream Querydwelltime. The maximum balanced value of the expansion factor. |
1216 | XfDtShowTopSumWFSumWQueryDwellTimeAllWcmMatch95AvgValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: Allwcmmatch95Avgvalue Stream QueryDwellTime. The average balanced value of the factor on the expansion top. |
1217 | XfDtShowAllMaxWFBQPRSampleMixMatchWeightedValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: MixMatchweightedValue for Stream BQPRSAMPLE. The maximum balanced value of the expansion factor. |
1218 | XfDtShowAllMaxWFDoubleFrcMixMatchWeightedValue | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: MixMatchweightedValue in stream DoubleFrc. The maximum balanced value of the expansion factor. |
1219 | DssmLongMiddleShortVsHardClicks | DSSM model trained on clicks. |
1220 | DssmLongVsMiddleShortNoClicks | DSSM model trained on clicks. |
1221 | DssmMiddleVsShortLongHardNoClicks | DSSM model trained on clicks. |
1222 | DssmShortVsMiddleLongHardNoClicks | DSSM model trained on clicks. |
1223 | DssmNOVsShortMiddleLongHardClicks | DSSM model trained on clicks. |
1224 | DssmLongVsShortMiddleHardClicks | DSSM model trained on clicks. |
1225 | DssmMiddleLongVsShortHardClicks | DSSM model trained on clicks. |
1226 | DssmShortMiddleLongVsHardNoClicks | DSSM model trained on clicks. |
1227 | Medical2UrlQuality | Neural model of content quality for medical subjects |
1228 | IsDesktopRequest | request came from yandsearch (rearr.is_desktop == 1) |
1229 | IsMobileRequest | request came from touchsearch (rearr.is_mobile == 1) |
1230 | IsTabletRequest | request came from padsearch (rearr.is_tablet == 1) |
1231 | RequestIsFromAndroid | request came from device with Android OS (rearr.dd_osfamily == Android) |
1232 | RequestIsFromIOS | request came from device with iOS (rearr.dd_osfamily == iOS) |
1233 | RequestIsFromWindows | request came from device with Windows OS (rearr.dd_osfamily == Windows) |
1234 | RequestIsFromOtherPlatform | request does not come from devices with Android, iOS or Windows OS (rearr.dd_osfamily != [Android, iOS, Windows]) |
1235 | EmbedVideoBroken | A broken built -in video on the page. |
1236 | CorrectedCtrLongPeriodFullMatchValue | FullMatchValue factor over hits from CorrectedCtrLongPeriod stream |
1237 | CorrectedCtrLongPeriodMixMatchWeightedValue | MixMatchWeightedValue factor over hits from CorrectedCtrLongPeriod stream |
1238 | CorrectedCtrLongPeriodAnnotationMaxValueWeighted | AnnotationMaxValueWeighted factor over hits from CorrectedCtrLongPeriod stream |
1239 | CorrectedCtrLongPeriodAnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from CorrectedCtrLongPeriod stream |
1240 | CorrectedCtrLongPeriodAllWcmMatch95AvgValue | AllWcmMatch95AvgValue factor over hits from CorrectedCtrLongPeriod stream |
1241 | CorrectedCtrLongPeriodAllWcmMatch80AvgValue | AllWcmMatch80AvgValue factor over hits from CorrectedCtrLongPeriod stream |
1242 | CorrectedCtrLongPeriodAllWcmWeightedValue | AllWcmWeightedValue factor over hits from CorrectedCtrLongPeriod stream |
1243 | CorrectedCtrLongPeriodAllWcmWeightedPrediction | AllWcmWeightedPrediction factor over hits from CorrectedCtrLongPeriod stream |
1244 | Medical2UrlQualityFresh | Neural model of content quality for medical subjects (for ex -) |
1245 | NHopSumDwellTimeBclmMixPlainKE5 | BclmMixPlainKE5 factor over hits from NHopSumDwellTime stream |
1246 | NHopSumDwellTimeMatch80AvgValue | Match80AvgValue factor over hits from NHopSumDwellTime stream |
1247 | FinLawUrlQuality | Neural model of content quality for financial and legal topics |
1248 | NHopSumDwellTimeMixMatchWeightedValue | MixMatchWeightedValue factor over hits from NHopSumDwellTime stream |
1249 | FinLawUrlQualityFresh | Neural model of content quality for financial and legal topics (for exposures) |
1250 | FirstClickDtXfBclmMixPlainKE5 | BclmMixPlainKE5 factor over hits from FirstClickDtXf stream |
1251 | FirstClickDtXfFullMatchValue | FullMatchValue factor over hits from FirstClickDtXf stream |
1252 | FirstClickDtXfAnnotationMaxValueWeighted | AnnotationMaxValueWeighted factor over hits from FirstClickDtXf stream |
1253 | FirstClickDtXfAnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from FirstClickDtXf stream |
1254 | FirstClickDtXfBclmPlaneProximity1Bm15W0Size1K0001 | BclmPlaneProximity1Bm15W0Size1K001 factor over hits from FirstClickDtXf stream |
1255 | RequestWithRegionNameTextBm11Norm16384 | Linguistic boosting factor. Type of extensions: Requestwithregionname. BM11 in the text and the Title of the Document |
1256 | RequestWithRegionNameTextCosineMatchMaxPrediction | Linguistic boosting factor. Type of extensions: Requestwithregionname. Cosinematchmaxprediction on the text and dump title |
1257 | RequestWithRegionNameLongClickAnnotationMatchWeightedValue | Linguistic boosting factor. Type of extensions: Requestwithregionname. Factor: AnnotationMatchWeightedValue by stream LongClick. |
1258 | RequestWithRegionNameOneClickFullMatchValue | Linguistic boosting factor. Type of extensions: Requestwithregionname. Factor: Fullmatchvalue according to Stream OneClick. |
1259 | RequestWithRegionNameOneClickAnnotationMatchValue | Linguistic boosting factor. Type of extensions: Requestwithregionname. Factor: AnnotationMatchvalue according to Stream OneClick. |
1260 | RequestWithRegionNameLongClickSPAnnotationMatchWeightedValue | Linguistic boosting factor. Type of extensions: Requestwithregionname. Factor: AnnotationMatchWeightedValue by stream LongClicksp. |
1261 | RequestWithRegionNameLongClickSPFullMatchValue | Linguistic boosting factor. Type of extensions: Requestwithregionname. Factor: Fullmatchvalue by stream LongClicksp. |
1262 | RequestWithRegionNameBQPRSampleAnnotationMaxValueWeighted | Linguistic boosting factor. Type of extensions: Requestwithregionname. Factor: AnnotationMaxValueweded by Stream BQPRSAMPLE. |
1263 | RequestWithRegionNameFieldSet1Bm15FLogK0001 | Linguistic boosting factor. Type of extensions: Requestwithregionname. Factor: BM15 in the group of streams 1. |
1264 | RequestWithRegionNameFieldSet2Bm15FLogK0001 | Linguistic boosting factor. Type of extensions: Requestwithregionname. Factor: BM15 in the group of streams 2. |
1265 | RequestWithRegionNameFieldSet3BclmWeightedFLogW0K0001 | Linguistic boosting factor. Type of extensions: Requestwithregionname. Factor: BCLMWEIGHTEDFLOGW0 in the Stream group 3. |
1266 | RequestWithRegionNameBodyChain0Wcm | Linguistic boosting factor. Type of extensions: Requestwithregionname. Chain0WCM factor on the text of the document |
1267 | QueryDocRandom | Random float in [0,1] by user request and document |
1268 | SosUrlQuality | Neural model of content quality for SOS topics |
1269 | SumFlashArea | the ratio of the total area of all Flash blocks to the screen area |
1270 | SosUrlQualityFresh | Neural model of content quality for SOS subjects (for ex -) |
1271 | UrlHostFraction | Copy of Old Version No.294 Factor. Added for Use on L3 Stage Only. Coating domain three -bouqu and request. (Chelyabinsk lottery – Chelloto. We translate a request to translite, find the three -book that are covered (Che, Hel, Lot, Olo), we look at what share of all three -bouquets are covered) |
1272 | UrlHitsCoverage | Fast version of FI_URL_DOMAIN_FRACTION |
1273 | AliceTimespentSum | Prediction of the time of the session, provided that this pair is requested by the request-document |
1274 | DssmDogL3 | Request-document DSSM, predicting the dog’s dog |
1275 | TiktokTag | A document is a selection of Tiktok /Ta |
1276 | TiktokDiscovery | Document – this is a selection of Tictock /Discover |
1277 | TiktokMusic | Document – this is a selection of Tiktok /MUSI |
1278 | DssmSinsigL2 | Request-document model Sinsiga. |
1279 | OriginalRequestUrlCosineMatchMaxPrediction | The factor for the original request. It is considered to be toxicated by Ural. Algorithm Cosinematchmaxpredical. |
1280 | OriginalRequestUrlAttenV1Bm15K05 | The factor for the original request. It is considered to be toxicated by Ural. The weight of the hit is multiplied by 1/ (1 + the position of the word in the sentence) an algorithm for aggregation of the scales of words: BM15. Normalization coefficient 0.5. |
1281 | OriginalRequestTitleBclmMixPlainKE5 | The factor for the original request. It is considered according to the heading of the document. The algorithm for aggregation of words weights is BCLMMIXPLAIN: a linear mixture of annotation BCLM weights and balanced Positionless weights of the word, then the former meters are aggregated through BM15. Normalization coefficient 10^(-5). |
1282 | OriginalRequestTitleCMMatchTop5AvgMatchValue | The factor for the original request. It is considered according to the heading of the document. CMMATCHTOP5AVGMATCHVALUE algorithm. |
1283 | OriginalRequestTitleWordCoverageForm | The factor for the original request. It is considered according to the heading of the document. The degree of coating of the words of the request is accurate to the form (without synonyms). |
1284 | OriginalRequestTitleAttenV1Bm15K05 | The factor for the original request. It is considered according to the heading of the document. The weight of the hit is multiplied by 1/ (1 + the position of the word in the sentence) an algorithm for aggregation of the scales of words: BM15. Normalization coefficient 0.5. |
1285 | OriginalRequestBodyBclmMixPlainKE5 | The factor for the original request. It is considered according to the contents of the document. The algorithm for aggregation of words weights is BCLMMIXPLAIN: a linear mixture of annotation BCLM weights and balanced Positionless weights of the word, then the former meters are aggregated through BM15. Normalization coefficient 10^(-5). |
1286 | OriginalRequestBodyCosineMatchMaxPrediction | The factor for the original request. It is considered according to the contents of the document. Algorithm Cosinematchmaxpredical. |
1287 | OriginalRequestBodyAllWcmWeightedPrediction | The factor for the original request. It is considered according to the contents of the document. Algorithm Allwcmweightedpredical. |
1288 | OriginalRequestBodyBocm15K001 | The factor for the original request. It is considered according to the contents of the document. Algorithm for aggregation of the scales of words BOCM15. Normalization coefficient 0.01. |
1289 | OriginalRequestBodyQueryPartMatchSumValueAny | The factor for the original request. It is considered according to the contents of the document. Algorithm: Querypartmatchsumvalueany. |
1290 | OriginalRequestBodyWordCoverageForm | The factor for the original request. It is considered according to the contents of the document. The degree of coating of the words of the request is accurate to the form (without synonyms). |
1291 | OriginalRequestBodyWordCoverageExact | The factor for the original request. It is considered according to the contents of the document. The degree of covering the words of the request in the exact form. |
1292 | OriginalRequestBodyBm15MaxAnnotationK001 | The factor for the original request. It is considered according to the contents of the document. Libra Agnregation algorithm: BM15Maxannotation normalization coefficient 0.01. |
1293 | HasCloaking | DssmFullSplitBert |
1294 | DssmFullSplitBert | SocialUrlIsVerified |
1295 | SocialUrlIsVerified | URL is a channel/fasting from a verified social network account |
1296 | DssmMimicrationUrl | DSSM, predicting whether the site is a facial |
1297 | Removed_1297 | Removed_1298 |
1298 | Removed_1298 | MetaPolyGen8AllWcmMaxMatch |
1299 | MetaPolyGen8AllWcmMaxMatch | MetaPolyGen8 |
1300 | QueryDwellTimeCMMatch80AvgValue | CMMatch80AvgValue factor over hits from QueryDwellTime stream |
1301 | DoubleFrcCMMatchTop5AvgMatch | CMMatchTop5AvgMatch factor over hits from DoubleFrc stream |
1302 | OneClickFrcXfSpPerWordCMMaxMatchMin | PerWordCMMaxMatchMin factor over hits from OneClickFrcXfSp stream |
1303 | FirstClickDtXfPerWordCMMaxMatchMin | PerWordCMMaxMatchMin factor over hits from FirstClickDtXf stream |
1304 | XfDtShowAllMaxWFLongClickSPPerWordCMMaxMatchMin | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: PerwordcMaxMatchmin for LongClicksp Stream. The maximum balanced value of the expansion factor. |
1305 | XfDtShowAllMaxWFOneClickPerWordCMMaxMatchMin | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: PerwordcMaxMatchmin according to Stream OneClick. The maximum balanced value of the expansion factor. |
1306 | XfDtShowTopMinWFFirstClickDtXfPerWordCMMaxMatchMin | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: PerwordCmmaxMatchmin for stream FIRSTCLICKDTXF. The minimum balanced value of the factor on the expansion top. |
1307 | Removed_1307 | Removed_1308 |
1308 | Removed_1308 | DistanceToAnkara |
1309 | DistanceToAnkara | The distance from the city from where the request was set to Ankara |
1310 | DistanceToMagadan | The distance from the city from where the request was set to Magadan |
1311 | Latitude | The geographical breadth of the city, from where a request was set |
1312 | Longitude | Geographical longitude of the city, from where a request was set |
1313 | LongClickMobileFullMatchValue | FullMatchValue factor over hits from LongClick stream (Mobile sessions filtered) |
1314 | LongClickMobileCosineMatchMaxPrediction | CosineMatchMaxPrediction factor over hits from LongClick stream (Mobile sessions filtered) |
1315 | LongClickMobileAnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from LongClick stream (Mobile sessions filtered) |
1316 | LongClickMobileAllWcmMatch95AvgValue | AllWcmMatch95AvgValue factor over hits from LongClick stream (Mobile sessions filtered) |
1317 | LongClickMobileAllWcmWeightedValue | AllWcmWeightedValue factor over hits from LongClick stream (Mobile sessions filtered) |
1318 | LongClickMobileAllWcmWeightedPrediction | AllWcmWeightedPrediction factor over hits from LongClick stream (Mobile sessions filtered) |
1319 | LongClickMobileCMMatchTop5AvgValue | CMMatchTop5AvgValue factor over hits from LongClick stream (Mobile sessions filtered) |
1320 | LongClickMobileBm15MaxAnnotationK001 | Bm15MaxAnnotationK001 factor over hits from LongClick stream (Mobile sessions filtered) |
1321 | XfDtShowAllMaxWFLinksAllPerWordCMMaxMatchMin | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: perwordcmmaxMatchmin for incoming links. The maximum balanced value of the expansion factor. |
1322 | Removed_1322 | Removed_1323 |
1323 | Removed_1323 | USLongPeriodUrlCtr |
1324 | USLongPeriodUrlCtr | Static URL factor in search sessions in 1600 days. Ordinary CTR. |
1325 | USLongPeriodUrlDt3600Avg | Static URL factor in search sessions in 1600 days. Average dwelltime, and DwellTime is cut from the session if more than 3600 seconds |
1326 | Removed_1326 | USLongPeriodUrlDt180Avg |
1327 | USLongPeriodUrlDt180Avg | Static URL factor in search sessions in 1600 days. Average dwelltime, and DwellTime is cut from the session if more than 180 seconds |
1328 | USLongPeriodUrlLongClickProb | Static URL factor in search sessions in 1600 days. The probability that the URL click will be more than 120 seconds |
1329 | USLongPeriodUrlShows | Static URL factor in search sessions in 1600 days. Logarithm of the number of shows. |
1330 | Removed_1330 | USLongPeriodUrlWinsProb |
1331 | USLongPeriodUrlWinsProb | Static URL factor in search sessions in 1600 days. The probability that URL is clicking if they do not click on at least one URL higher. |
1332 | USLongPeriodUrlLossesProb | Static URL factor in search sessions in 1600 days. The probability that URL is not clicks if they click at least one URL below. |
1333 | USLongPeriodUrlCtrReg | Static URL factor in search sessions in 1600 days. Ordinary CTR. Localization to the level of countries. |
1334 | USLongPeriodUrlDt3600AvgReg | Static URL factor in search sessions in 1600 days. Average dwelltime, and DwellTime is cut from the session if more than 3600 seconds. Localization to the level of countries. |
1335 | USLongPeriodUrlLongClickProbReg | Static URL factor in search sessions in 1600 days. The probability that the URL click will be more than 120 seconds. Localization to the level of countries. |
1336 | USLongPeriodUrlPositionAvgReg | Static URL factor in search sessions in 1600 days. The average position of the URL for all requests. Localization to the level of countries. |
1337 | USLongPeriodUrlShowsReg | Static URL factor in search sessions in 1600 days. Logarithm of the number of shows. Localization to the level of countries. |
1338 | DssmLogDwellTimeBigrams | DSSM model trained on clicks. Takes bigrams into account. |
1339 | FirstLastClickMobileMixMatchWeightedValue | MixMatchWeightedValue factor over hits from FirstLastClick stream (Mobile sessions filtered) |
1340 | FirstLastClickMobileCosineMatchMaxPrediction | CosineMatchMaxPrediction factor over hits from FirstLastClick stream (Mobile sessions filtered) |
1341 | FirstLastClickMobileFullMatchValue | FullMatchValue factor over hits from FirstLastClick stream (Mobile sessions filtered) |
1342 | FirstLastClickMobileAllWcmMatch95AvgValue | AllWcmMatch95AvgValue factor over hits from FirstLastClick stream (Mobile sessions filtered) |
1343 | FirstLastClickMobileCMMatchTop5AvgValue | CMMatchTop5AvgValue factor over hits from FirstLastClick stream (Mobile sessions filtered) |
1344 | FirstLastClickMobileAllWcmWeightedValue | AllWcmWeightedValue factor over hits from FirstLastClick stream (Mobile sessions filtered) |
1345 | IsQvoice | Was the request set in a voice |
1346 | AvgDTWeightedByRankMobileAllWcmWeightedValue | AllWcmWeightedValue factor over hits from AvgDTWeightedByRankMobile stream (Mobile sessions filtered) |
1347 | AvgDTWeightedByRankMobileAllWcmMatch95AvgValue | AllWcmMatch95AvgValue factor over hits from AvgDTWeightedByRankMobile stream (Mobile sessions filtered) |
1348 | AvgDTWeightedByRankMobileCMMatchTop5AvgValue | CMMatchTop5AvgValue factor over hits from AvgDTWeightedByRankMobile stream (Mobile sessions filtered) |
1349 | AvgDTWeightedByRankMobileAnnotationMatchWeightedValue | AnnotationMatchWeightedValue factor over hits from AvgDTWeightedByRankMobile stream (Mobile sessions filtered) |
1350 | AvgDTWeightedByRankMobileFullMatchValue | FullMatchValue factor over hits from AvgDTWeightedByRankMobile stream (Mobile sessions filtered) |
1351 | AvgDTWeightedByRankMobileMixMatchWeightedValue | MixMatchWeightedValue factor over hits from AvgDTWeightedByRankMobile stream (Mobile sessions filtered) |
1352 | XfDtShowTopSumW2FSumWFieldSet5AvgPerTrigramMaxValueAny | Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: AVGPERGRAGRAMMAXVALEANY in the Stream group 5. The average balanced values of the factor for the expansion top. |
1353 | CorrectedCtrLongPeriodAvgPerTrigramAvgValueAny | Avgpertrigramavgvalueany factor by CorretedctrlongPeriod Strame |
1354 | DssmLogDwelltimeBigramsL2 | DSSM model trained on clicks. Takes bigrams into account. Embeddings for documents are computed offline. |
1355 | RankArtroz | Rank of the quality of texts on the host. The higher, the greater the likelihood that the host is full of articles – a rewriting, a bad copy of the content ordered on the exchanges of content. Burning stronger as the before the aggregation. |
1356 | DssmBigramsQueryDerivativeMin | A minimum of gradients according to the Bigramm LogdwellTime model. |
1357 | DssmBigramsQueryDerivativeMax | Maximum from gradients according to the Bigramm Logdwelltime model. |
1358 | DssmBigramsQueryDerivativeMoment2Central | The second central moment (dispersion) from gradients according to the Bigramm Logdwelltime model. |
1359 | DssmBigramsQueryDerivativeMoment3Central | The third central moment from gradients according to the Bigramm Logdwelltime model. |
1360 | DssmVkPopularity | The probability that the VK.com host is popular for this request in accordance with the corresponding DSSM model. |
1361 | DssmOnlinerPopularity | The likelihood that the Onliner.by host is popular for this request according to the corresponding DSSM model. |
1362 | Removed_1362 | Removed_1363 |
1363 | Removed_1363 | DssmRamblerPopularity |
1364 | DssmRamblerPopularity | The probability that the Rambler.ru host is popular for this request in accordance with the corresponding DSSM model. |
1365 | DssmExpertcenPopularity | The likelihood that the ExpertCen.ru host is popular for this request in accordance with the corresponding DSSM model. |
1366 | DssmSunhomePopularity | The probability that the Sunhome.ru host is popular for this request according to the corresponding DSSM model. |
1367 | UBLongPeriodVisitsSNProb | Static URL factor in browser logs for the maximum period. The percentage of traffic from social networks in all traffic from other hosts and search. |
1368 | UBLongPeriodDirectHChildren90CntFromExtHost | Static URL factor in browser logs for the maximum period. The average number of direct descendants from the host on which they spent more than 90 seconds. The descendant is straight, only if there is a link from our page to the descendant and crossed it. |
1369 | UUBLongPeriodDepthFromExtHost | Static URL factor in browser logs for the maximum period. The average maximum depth of wood with the root in the current URL is when the URL is visited from other hosts. |
1370 | UBLongPeriodBrowseFrc | Static URL factor in browser logs for the maximum period. The number of times when the feather was switched to the page to the total number of pages to which they switched from a sickle. The closer to 1, the more often the page was opened the only one in the session. |
1371 | UBLongPeriodAvgSearchDuration600 | Static URL factor in browser logs for the maximum period. The average length of search sessions, when they switched to the page from a sickle |
1372 | UBLongPeriodSearchPercentEnd | Static URL factor in browser logs for the maximum period. The formula for calculating the factor we look at Wiki. |
1373 | UBLongPeriodSearchPercentMiddle30 | Static URL factor in browser logs for the maximum period. The formula for calculating the factor we look at Wiki. |
1374 | UBLongPeriodVisit120Prob | Static URL factor in browser logs for the maximum period. The probability that the user will spend on the page> 120 seconds. |
1375 | UBLongPeriodLeavesCnt | Static URL factor in browser logs for the maximum period. The number of leaves in URLA support. In this case, the leaves are a page from which there were no transitions. |
1376 | UBLongPeriodDtUrlHChildrenCut600 | Static URL factor in browser logs for the maximum period. The average time spent on the page and in all descendants of the page (URLS to which they switched) from the host. Cut off if the total DT is more than 10 minutes |
1377 | UBLongPeriodMinTimeWhenPageShow | Static URL factor in browser logs for the maximum period. The minimum Unix Time when the page appeared in the logs for the first time. |
1378 | UBLongPeriodDeltaAvgMinTimeWhenPageShow | Static URL factor in browser logs for the maximum period. The difference between the middle and minimum Unix Time when the page appeared in the logs. |
1379 | UBLongPeriodLatitude | Static URL factor in browser logs for the maximum period. Current breadth where the page was viewed from. |
1380 | UBLongPeriodLongitude | Static URL factor in browser logs for the maximum period. Current longitude where the page was viewed from. |
1381 | UBLongPeriodDownloadsProb | Static URL factor in browser logs for the maximum period. The likelihood of leaps from the page |
1382 | UBLongPeriodDownloadsImageProb | Static URL factor in browser logs for the maximum period. The likelihood of image jumps from the page |
1383 | UBLongPeriodDownloadsTorrentProb | Static URL factor in browser logs for the maximum period. The probability of leap torrent file from the page |
1384 | UBLongPeriodSearchPercentEndReg | Static URL factor in browser logs for the maximum period. The formula for calculating the factor we look at Wiki. Localization to the level of countries. |
1385 | UBLongPeriodLeavesCntReg | Static URL factor in browser logs for the maximum period. The number of leaves in URLA support. In this case, the leaves are a page from which there were no transitions. Localization to the level of countries. |
1386 | UBLongPeriodDtUrlHChildrenCut600Reg | Static URL factor in browser logs for the maximum period. The average time spent on the page and in all descendants of the page (URLS to which they switched) from the host. Cut if the total DT is more than 10 minutes. Localization to the level of countries. |
1387 | MisspellLmNgrYandexDirectOriginal | Summary Skorov words of a request for a language model 3GRAMS-YANDEX-DIRECT. |
1388 | MisspellLmRtlNgrWebMtOriginal | Summary of the Skorov words of the request by the Web-Mt language model. |
1389 | UBLongPeriodRank | Static URL factor in browser logs for the maximum period. Rank, based on only UBLP meters, which allows you to find many SBR losses |
1390 | QfufTopSumWFSumWFieldSet3BclmWeightedFLogW0K0001 | Linguistic boosting factor. Type of extensions: QFUF. Factor: BCLMWEIGHTEDFLOGW0_K0.001 FIELDSET3. The average balanced values of the TOP-10 factor by extensions. |
1391 | QueryToTextAllSumWFSumWBodyMinWindowSize | Linguistic boosting factor. Type of extensions: Querytotext. Factor: by minwindowsize according to the contents of the document. The average balanced values of the expansion factor. |
1392 | QueryToTextAllAvgW | Linguistic boosting factor. The average weight of extensions type Querytotext. |
1393 | QfufAllSumWFSumWQueryDwellTimeMixMatchWeightedValue | Linguistic boosting factor. Type of extensions: QFUF. Factor: MixMatchweightedValue on Querydwelltime Stream. The average balanced values of the expansion factor. |
1394 | QueryToTextTopMinWFBodyMinWindowSize | Linguistic boosting factor. Type of extensions: Querytotext. Factor: Minwindowsize according to the contents of the document. The average balanced values of the TOP-10 factor by extensions. |
1395 | QfufAllMaxFFieldSetUTBm15FLogW0K00001 | Linguistic boosting factor. Type of extensions: QFUF. Factor: BM15FLOGW0_K0.0001 on Ural and the heading. The maximum value of the expansion factor. |
1396 | QfufAllSumWFSumWFieldSet3BclmWeightedFLogW0K0001 | Linguistic boosting factor. Type of extensions: QFUF. Factor: BCLMWEIGHTEDFLOGW0_K0.001 FIELDSET3. The average balanced values of the expansion factor. |
1397 | QfufAllAvgW | Linguistic boosting factor. The average weight of the QFUF type extensions. |
1398 | QueryToTextAllSumFCountBodyPairMinProximity | Linguistic boosting factor. Type of extensions: Querytotext. Factor: PairminProximity according to the contents of the document. The average values of the expansion factor. |
1399 | QfufAllTotalW | Linguistic boosting factor. Type of extensions: QFUF. Transferred the total weight of the extensions. |
1400 | QueryToTextAllSumFCountTextBocm11Norm256 | Linguistic boosting factor. Type of extensions: Querytotext. Factor: Bocm11_norm256 according to the text of the document. The average values of the expansion factor. |
1401 | QfufAllMaxFTextCosineMatchMaxPrediction | Linguistic boosting factor. Type of extensions: QFUF. Factor: COSINEMATCHMAXPRECTION on the text of the document. The maximum value of the expansion factor. |
1402 | QfufTopSumW2FSumWFieldSet1Bm15FLogK0001 | Linguistic boosting factor. Type of extensions: QFUF. Factor: BM15FLOG_K0.001 according to Fieldset1. The average balanced values of the factor with a quadratic weight in the top 10 in terms of factor value by extensions. |
1403 | QfufAllMaxFTextBocm11Norm256 | Linguistic boosting factor. Type of extensions: QFUF. Factor: Bocm11_norm256 according to the text of the document. The maximum value of the expansion factor. |
1404 | QfufTopSumWFSumWFieldSetUTBm15FLogW0K00001 | Linguistic boosting factor. Type of extensions: QFUF. Factor: BM15FLOGW0_K0.0001 on Ural and the heading. The average balanced values of the expansion factor. |
1405 | DssmOneClickProbability | DSSM model trained on clicks, target=OneClicks/Clicks. Takes bigrams into account. |
1406 | DssmQueryDwellTime | DSSM model trained on clicks, target=QueryDwellTime stream value. Takes bigrams into account. |
1407 | AllMatchedWordWeightsSum | The normalized amount of the scales of the words of the request that met in the text of the document or links to it. |
1408 | StringMatchedWordWeightsSum | The normalized amount of the scales of the words of the request that Equal_by_String in the text of the document or links to it. |
1409 | AllMatchedWordWeightsSumText | The normalized amount of the scales of the words of the request that met in the text of the document. |
1410 | AllMatchedWordWeightsSumLink | The normalized amount of the scales of the words of the request that met in the links to the document. |
1411 | StringMatchedWordWeightsSumLink | The normalized amount of the scales of the words of the request that Equal_by_String in the links to the document. |
1412 | AllMatchedWordFiltrationModelWeightsSum | The normalized scales for the IFILTRETRATIONMODEL words of the request that met in the text of the document or links to it. |
1413 | StringMatchedWordFiltrationModelWeightsSum | The normalized scales for the IFILTRETRATIONMODEL Words of the request, which are Equal_by_String in the text of the document or links to it. |
1414 | LemmaMatchedWordFiltrationModelWeightsSum | The normalized scales for the IFILTRETRATIONMODEL Words of the request, which Equal_by_lemma in the text of the document or links to it. |
1415 | AllMatchedWordFiltrationModelWeightsSumLink | The normalized scales for the IFILTRETRATIONMODEL words of the request that met in links to the document. |
1416 | StringMatchedWordFiltrationModelWeightsSumLink | The normalized scales for the IFILTRETRATIONMODEL Words of the request, which Equal_by_String in the links to the document. |
1417 | QfufAllMaxFLinkAnnIndicatorAnnotationMaxValueWeighted | Linguistic boosting factor. Type of extensions: QFUF. Aggregation on all extensions. The greatest value of the factor. According to the stream from the lincum index of Linkannindicator. AnnotationmaxvalueWeighted algorithm – maximum weight (according to Mainweights Libra of Words) coverage of abstracts, weighed to the weight of the annotation |
1418 | QfufAllMaxWFLinkAnnIndicatorFullMatchValue | Linguistic boosting factor. Type of extensions: QFUF. Aggregation on all extensions. The greatest value of the factor. According to the stream from the lincum index of Linkannindicator. AnnotationmaxvalueWeighted algorithm – maximum weight (according to Mainweights Libra of Words) coverage of abstracts, weighed to the weight of the annotation |
1419 | XfDtShowAllMaxWFMaxWLinkAnnIndicatorPerWordCMMaxMatchMin | Linguistic boosting factor. Type of extensions: XFDTSHOW. Aggregation on all extensions. The greatest balanced value of the factor. It is normalized for the maximum weight of expansion. According to the stream from the lincum index of Linkannindicator. PerwordCmmaxMatchMin algorithm: The minimum according to CMMMAXMATCH weight. |
1420 | XfDtShowKnnAllMaxWFMetaPolyGen8BclmMixPlainKE5 | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN (closest to the DSSM models trained to predict XFDTSHOW of extension). Aggregation on all extensions. The greatest balanced value of the factor. A mixture of many streams, weight is calculated by a fixed Polita component from the scales on this annotation. The algorithm for aggregation of words weights is BCLMMIXPLAIN: a linear mixture of annotation BCLM weights and balanced Positionless weights of the word, then the former meters are aggregated through BM15. Normalization coefficient 10^(-5). |
1421 | XfDtShowKnnAllMaxWFMaxWCorrectedCtrLongPeriodWordCoverageForm | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN (closest to the DSSM models trained to predict XFDTSHOW of extension). Aggregation on all extensions. The greatest balanced value of the factor. It is normalized for the maximum weight of expansion. Stream: Correctedctrlongperiod. The degree of coating of words query accurate to form (without synonyms). |
1422 | QfufAllMaxWFMaxWVpcgCorrectedClicksSLPExactQueryMatchAvgValue | Linguistic boosting factor. Type of extensions: QFUF. Aggregation on all extensions. The greatest balanced value of the factor. It is normalized for the maximum weight of expansion. The result of the VPCG for a long long period, data: CorrectedClicks. The average weight of the anntations among those in which the request was an accurate tuning. |
1423 | XfDtShowKnnAllMaxWFMaxWCorrectedCtrLongPeriodBclmPlaneProximity1Bm15W0Size1K0001 | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN (closest to the DSSM models trained to predict XFDTSHOW of extension). Aggregation on all extensions. The greatest balanced value of the factor. It is normalized for the maximum weight of expansion. Stream: Correctedctrlongperiod. The BCLMPLANEPROXIMITY15W0SIZE1 algorithm: uses BCLM with free weight if there are several words of the request, if the word is one, then the sum of hits is used like a coincidence. Normalization coefficient 0.001. |
1424 | XfDtShowKnnAllAvgW | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN (closest to the DSSM models trained to predict XFDTSHOW of extension). Aggregation on all extensions. The average weight of extensions. |
1425 | DssmLanguageClassifierRusL2 | Document DSSM model Language Classifier Rus. |
1426 | DssmLanguageClassifierEngL2 | Document DSSM model Language Classifier Eng. |
1427 | DssmLanguageClassifierOthL2 | Document DSSM model Language Classifier Other. |
1428 | Removed_1428 | Removed_1429 |
1429 | Removed_1429 | alice_aramusic_dssm |
1430 | alice_aramusic_dssm | AliceMusicRelevanceDssm |
1431 | AliceMusicRelevanceDssm | DSSM Prediction to determine Alice’s irrelevant answers |
1432 | RandomLogQueryAvgNews | The average value of News for the year. It is calculated in offline. |
1433 | RandomLogQueryAvgAddTime | ADDTIME average value for the year. It is calculated in offline. |
1434 | RandomLogQueryAvgTxtHiRelSy | The average value of TXTHIRELSY for the year. It is calculated in offline. |
1435 | RandomLogQueryAvgTextLike | The average TEXTLIKE value is for the year. It is calculated in offline. |
1436 | RandomLogQueryAvgHasNoAllWordsTRSy | The average Hasnoallwordstersy value for the year. It is calculated in offline. |
1437 | RandomLogQueryAvgIsForum | The average value of ISFORUM for the year. It is calculated in offline. |
1438 | RandomLogQueryAvgHasPayments | The average value of Haspayments for the year. It is calculated in offline. |
1439 | RandomLogQueryAvgYabarHostAvgTime2 | The average value is Yabarhostavgtime2 for the year. It is calculated in offline. |
1440 | RandomLogQueryAvgYabarUrlVisitors | The average value of Yabarurlvisitors for the year. It is calculated in offline. |
1441 | RandomLogQueryAvgQueryDOwnerOnlyClickRate | The average value of QueryDowneronlyClickRate for the year. It is calculated in offline. |
1442 | RandomLogQueryAvgDaterAge | The average value of Dateraage for the year. It is calculated in offline. |
1443 | RandomLogQueryAvgLongestText | The average value of the LonGestText for the year. It is calculated in offline. |
1444 | RandomLogQueryAvgDifferentInternalLinks | The average value is DifferentinTernallinks for the year. It is calculated in offline. |
1445 | RandomLogQueryAvgQueryDOwnerOnlyClickRate_Reg | The average value of QueryDowneronlyClickRate_Rreg is for a year. It is calculated in offline. |
1446 | RandomLogQueryAvgIsHub | The average ISHUB value is for the year. It is calculated in offline. |
1447 | Removed_1447 | RandomLogQueryAvgBM25_0 |
1448 | RandomLogQueryAvgBM25_0 | The average value is BM25_0 on request per year. It is calculated in offline. |
1449 | RandomLogQueryAvgBocm | The average value of BOCM for the year. It is calculated in offline. |
1450 | RandomLogQueryAvgIsIndexPage | The average ISindexpage is for the year. It is calculated in offline. |
1451 | RandomLogQueryAvgQueriesAvgCM2 | The average value of queriesavgcm2 for the year. It is calculated in offline. |
1452 | RandomLogQueryAvgBrowserHostDownloadProbability | The average value of BrowserhostDownloadProbabolyity for the year. It is calculated in offline. |
1453 | RandomLogQueryAvgRegBrowserUserHub | The average value of Regbrowseruserhub for the year. It is calculated in offline. |
1454 | RandomLogQueryAvgAuxTitleBM25 | Auxtitlebm25 average value for the year. It is calculated in offline. |
1455 | RandomLogQueryAvgQueryUrlCorrectedCtrXfactor | The average value of QuryurlCorrectedctrxFactor for the year. It is calculated in offline. |
1456 | RandomLogQueryAvgQueryToDocAllSumFCountTextBm11Norm16384 | The average value is QueryTodocallsumfcountTextbM11Norm16384 for the year. It is calculated in offline. |
1457 | RandomLogQueryAvgXfDtShowAllSumWFSumWBodyMinWindowSize | The average value of the XFDTSHOWALLSUMWFSUMWBODYMINWIDESIZE for the year. It is calculated in offline. |
1458 | RandomLogQueryClicksWeightedAvgIsMainPage | Maintened by clicks ISMainPage value for the year. It is calculated in offline. |
1459 | RandomLogQueryClicksWeightedAvgYabarUrlAvgTime | Main -heated clicks of the Yabarurlavgtime value for the year. It is calculated in offline. |
1460 | RandomLogQueryClicksWeightedAvgDifferentInternalLinks | Maintened by clicks DifferentinTernallinks for the year. It is calculated in offline. |
1461 | RandomLogQueryDwelltimeWeightedAvgUrlDomainFraction | Malpanized Dwelltime-AMI Value of Urldomainfraction for the year. It is calculated in offline. |
1462 | BM25FdPRFixedNoLinks | BM25FDPR with standardization on the average length of the document, depending on the language of the document. Only texts are used. |
1463 | DomainHasMetrika | Does owner have metrika or not |
1464 | HasSideblock | The document has a turbo page for Mobile platform. |
1465 | HistoricalAnnotationCount | Document annotations count in the whole history of the Search (DSSM AnnReg models helper) |
1466 | HistoricalAnnWordCount | Document annotation words count in the whole history of the Search (DSSM AnnReg models helper) |
1467 | HistoricalAnnRegionCount | Document annotation regions count in the whole history of the Search (DSSM AnnReg models helper) |
1468 | Removed_1468 | Removed_1469 |
1469 | Removed_1469 | NoApproxSmallWindowAttenuation |
1470 | NoApproxSmallWindowAttenuation | NoApproxSmallWindowAttenuationQ |
1471 | NoApproxSmallWindowAttenuationQ | DssmMainContentKeywords |
1472 | DssmMainContentKeywords | Query-MainContentKeywords similarity, target: logDwellTime |
1473 | YellownessMax | Maximum value of domain yellowness (based on Toloka) |
1474 | YellownessMean | Mean value of domain yellowness (based on Toloka) |
1475 | YellownessMedian | Median of domain yellowness (based on Toloka) |
1476 | YellownessMin | Minimum value of domain yellowness (based on Toloka) |
1477 | DssmBoostingXfWeightQuerySelfSimilarity | Dssm Boosting query self similarity for XfWeight model. |
1478 | DssmBoostingXfWeightKMeans5AvgTop02Score | Dssm Boosting AvgTop02Score aggregation for XfWeight model over 5-means centroids. |
1479 | DssmBoostingXfWeightKMeans5AvgTop04Score | Dssm Boosting AvgTop04Score aggregation for XfWeight model over 5-means centroids. |
1480 | DssmBoostingXfWeightKMeans5AvgTop02ScoreAvgClusterTop3Weighted | Dssm Boosting AvgTop02ScoreAvgClusterTop3Weighted aggregation for XfWeight model over 5-means centroids. |
1481 | DssmBoostingXfWeightKMeans5AvgTop02ScoreQE | Dssm Boosting AvgTop02Score aggregation for XfWeight model over 5-means centroids (query as expansion). |
1482 | DssmBoostingXfWeightKMeans5AvgTop02ScoreAvgClusterTop3WeightedQE | Dssm Boosting AvgTop02ScoreAvgClusterTop3Weighted aggregation for XfWeight model over 5-means centroids (query as expansion). |
1483 | DssmBoostingXfOneQuerySelfSimilarity | Dssm Boosting query self similarity for XfOne model. |
1484 | DssmBoostingXfOneKMeans1Score | Dssm Boosting Score aggregation for XfOne model over 1-means centroids. |
1485 | DssmBoostingXfOneKMeans1ScaledSumWeight | Dssm Boosting ScaledSumWeight aggregation for XfOne model over 1-means centroids. |
1486 | DssmBoostingXfOneKMeans1ScoreQE | Dssm Boosting Score aggregation for XfOne model over 1-means centroids (query as expansion). |
1487 | DssmBoostingXfOneKMeans1ScoreAvgNearest1WeightedQE | Dssm Boosting ScoreAvgNearest1Weighted aggregation for XfOne model over 1-means centroids (query as expansion). |
1488 | DssmBoostingXfOneKMeans1ScoreAvgNearest5WeightedQE | Dssm Boosting ScoreAvgNearest5Weighted aggregation for XfOne model over 1-means centroids (query as expansion). |
1489 | DssmBoostingXfOneSeKMeans1Score | Dssm Boosting Score aggregation for XfOneSe model over 1-means centroids. |
1490 | DssmBoostingXfOneSeKMeans1ScoreScaledSumWeighted | Dssm Boosting ScoreScaledSumWeighted aggregation for XfOneSe model over 1-means centroids. |
1491 | DssmBoostingXfOneSeKMeans1ScoreAvgNearest5Weighted | Dssm Boosting ScoreAvgNearest5Weighted aggregation for XfOneSe model over 1-means centroids. |
1492 | DssmBoostingCtrQuerySelfSimilarity | Dssm Boosting query self similarity for Ctr model. |
1493 | DssmBoostingCtrKMeans1Score | Dssm Boosting Score aggregation for Ctr model over 1-means centroids. |
1494 | DssmBoostingCtrKMeans1ScoreQE | Dssm Boosting Score aggregation for Ctr model over 1-means centroids (query as expansion). |
1495 | DssmBoostingCtrKMeans1ScoreScaledSumWeightedQE | Dssm Boosting ScoreScaledSumWeighted aggregation for Ctr model over 1-means centroids (query as expansion). |
1496 | DssmBoostingCtrKMeans1ScoreAvgNearest1WeightedQE | Dssm Boosting ScoreAvgNearest1Weighted aggregation for Ctr model over 1-means centroids (query as expansion). |
1497 | YellownessDispersion | Yellowness distribution dispersion of domain (based on Toloka) |
1498 | VpcgCorrectedClicksSLPPerWordCMMaxPredictionMin | The result of the VPCG for a long long period, data: CorrectedClicks. Fullmatchprediction factor |
1499 | VpcgCorrectedClicksSLPAllWcmMatch95AvgValue | The result of the VPCG for a long long period, data: CorrectedClicks. ALLWCMMATCH95AVGVALUE factor |
1500 | VpcgCorrectedClicksSLPCMMatchTop5AvgValue | The result of the VPCG for a long long period, data: CorrectedClicks. CMMATCHTOP5AVGVALUE factor |
1501 | VpcgCorrectedClicksSLPAnnotationMaxValueWeighted | The result of the VPCG for a long long period, data: CorrectedClicks. Factor AnnotationmaxValueWeighted |
1502 | VpcgCorrectedClicksSLPMixMatchWeightedValue | The result of the VPCG for a long long period, data: CorrectedClicks. MixmatchweightedValue factor |
1503 | VpcgCorrectedClicksSLPCMMatchTop5AvgPrediction | The result of the VPCG for a long long period, data: CorrectedClicks. CMMATCHTOP5AVGPREDION factor |
1504 | DssmCtrNoMiner | DSSM model trained on CTRs without miner. |
1505 | DssmPageQualityRTHub | DSSM prediction (URL + Title), trained for the Page_QUALYY signal and implemented in RTHUB, the first slot. |
1506 | DssmPageQualityRTHubSlot2 | DSSM prediction (URL + Title), trained on the Page_QUALYY signal and implemented in RTHUB, the second slot. |
1507 | DssmQueryEmbeddingCtrNoMinerPca0 | The main components of the requesting Embling from the DSSMCTRNOMINER model |
1508 | DssmQueryEmbeddingCtrNoMinerPca1 | The main components of the requesting Embling from the DSSMCTRNOMINER model |
1509 | DssmQueryEmbeddingCtrNoMinerPca2 | The main components of the requesting Embling from the DSSMCTRNOMINER model |
1510 | DssmQueryEmbeddingCtrNoMinerPca3 | The main components of the requesting Embling from the DSSMCTRNOMINER model |
1511 | DssmQueryEmbeddingCtrNoMinerPca4 | The main components of the requesting Embling from the DSSMCTRNOMINER model |
1512 | DssmQueryEmbeddingCtrNoMinerPca5 | The main components of the requesting Embling from the DSSMCTRNOMINER model |
1513 | DssmQueryUrlTitleRegChainClicksOdd | DSSM model trained on click odd pool |
1514 | DssmQueryUrlTitleRegChainClicksPers | DSSM model trained on click personalization pool |
1515 | DssmQueryUrlTitleRegChainClicksTrFull | DSSM model trained on click triangle pool |
1516 | RequestWithRegionNameLinkAnnFloatMultiplicityCMMatchTop5AvgMatchValue | Linguistic boosting factor. Type of extensions: Requestwithregionname. Factor: CMMATCHTOP5AVGMATCHVAVALU on Stream Floatmultiplicity Linkann index |
1517 | Removed_1517 | LinkAnnLinkAnnFloatMultiplicityPerWordAMMaxValueMin |
1518 | LinkAnnLinkAnnFloatMultiplicityPerWordAMMaxValueMin | Linguistic boosting factor. Factor: Perwordammaxvaluemin for stream Floatmultiplicity Linkann index |
1519 | LinkAnnFloatMultiplicityAttenV1Bm15K001 | Linguistic boosting factor. Factor: Attenv1bm15K001 according to the stream Floatmultiplicity of the Linkann index |
1520 | LinkAnnLinkExternalBocm11Norm256 | Linguistic boosting factor. Factor: BOCM11NORM256 according to the stream of the ISEXTERNAL Linkann index |
1521 | Removed_1521 | RequestWithRegionNameLinkAnnFloatMultiplicityAnnotationMaxValue |
1522 | RequestWithRegionNameLinkAnnFloatMultiplicityAnnotationMaxValue | Linguistic boosting factor. Type of extensions: Requestwithregionname. Factor: Annotationmaxvalue Stream Floatmultiplicity Linkann index |
1523 | DssmLogDtBigramsAMHardQueriesNoClicks | DSSM model trained on clicks without miner (with no-clicks and AM-hard negatives). Takes bigrams into account. |
1524 | RandomLogHostHasPaymentsAvg | AVG aggregation of HasPayments web factor using random log |
1525 | RandomLogHostIsVideoQueryAvg | AVG aggregation of VideoQuery web factor using random log |
1526 | RandomLogHostSyntQualityAvg | AVG aggregation of SyntQuality web factor using random log |
1527 | RandomLogHostGeoRegionalityVNewPerc90 | PERCENTALE_90 aggregation of GeoRegionalityVNew web factor using random log |
1528 | RandomLogHostQClassDownloadAvg | AVG aggregation of QClassDownload web factor using random log |
1529 | RandomLogHostIsMusicAvg | AVG aggregation of IsMusic web factor using random log |
1530 | RandomLogHostQueryThEncyclopedicPerc25 | PERCENTALE_25 aggregation of QueryThEncyclopedic web factor using random log |
1531 | RandomLogHostCommercialOwnerRankRegAvg | AVG aggregation of CommercialOwnerRank_Reg web factor using random log |
1532 | RandomLogHostYabarWordDNGIPerc25 | PERCENTALE_25 aggregation of YabarWordDepthNodesGradientMin web factor using random log |
1533 | RandomLogHostPopularSEFRCBrowserAvg | AVG aggregation of PopularSEFRCBrowser web factor using random log |
1534 | RandomLogHostURLClicksMaxGeoRegionFRCRatioAvg | AVG aggregation of URLClicksMaxGeoRegionFRCRatio web factor using random log |
1535 | RandomLogHostUBLongPeriodDirectHChildren90CntPerc90 | PERCENTALE_90 aggregation of UBLongPeriodDirectHChildren90CntFromExtHost web factor using random log |
1536 | RandomLogHostUBLongPeriodDtUrlHChildrenPerc90 | PERCENTALE_90 aggregation of UBLongPeriodDtUrlHChildrenCut600Reg web factor using random log |
1537 | RandomLogHostIsPictureAvg | AVG aggregation of IsPicture web factor using random log |
1538 | RandomLogHostErratumLogQueryProbabilityAvg | AVG aggregation of ErratumLogQueryProbability web factor using random log |
1539 | Removed_1539 | Removed_1540 |
1540 | Removed_1540 | Removed_1541 |
1541 | Removed_1541 | DssmQueryCountryToUrlEstimatedDistance |
1542 | DssmQueryCountryToUrlEstimatedDistance | Predicted by demand and country, using a DSSM model, the length of the click from this country. |
1543 | DssmRandomLogQueryAvgNews | The average for the year for the year predicted using the neural network. |
1544 | DssmRandomLogQueryAvgAddTime | ADDTIME ADDTIME is predicted using a neural network for a year. |
1545 | DssmRandomLogQueryAvgTxtHiRelSy | The average Txthirelesy value predicted using a neural network for the year. |
1546 | DssmRandomLogQueryAvgTextLike | The average Textlike is predicted using a neural network for the year. |
1547 | DssmRandomLogQueryAvgHasNoAllWordsTRSy | The average HasnoallwordStrsy is predicted using a neural network for a year. |
1548 | DssmRandomLogQueryAvgIsForum | The average ISFORUM is predicted using a neural network for the year. |
1549 | DssmRandomLogQueryAvgHasPayments | The average Haspayments is predicted using a neural network for the year. |
1550 | DssmRandomLogQueryAvgYabarHostAvgTime2 | The average value of Yabarhostavgtime2 for the year for the year. |
1551 | DssmRandomLogQueryAvgYabarUrlVisitors | The average yabarurlvisitors is predicted using a neural network for the year. |
1552 | DssmRandomLogQueryAvgQueryDOwnerOnlyClickRate | The average value of QueryDowneronlyClickRate for the year for the year. |
1553 | DssmRandomLogQueryAvgDaterAge | The average Dateraage value for the year for a year predicted using a neural network. |
1554 | DssmRandomLogQueryAvgLongestText | The average LonGestText is predicted using a neural network for the year. |
1555 | DssmRandomLogQueryAvgDifferentInternalLinks | The average DifferentinTernallinks for the year for the year. |
1556 | DssmRandomLogQueryAvgQueryDOwnerOnlyClickRate_Reg | The average value of QueryDowneronlyClickRate_Rreg is predicted using a neural network for a year. |
1557 | Removed_1557 | Removed_1558 |
1558 | Removed_1558 | AliceMusicUrlTypeIsTrack |
1559 | AliceMusicUrlTypeIsTrack | Type of canonized Urla Yandex Music – track |
1560 | DssmRandomLogQueryAvgBocm | The average BOCM value predicted using a neural network for the year. |
1561 | DssmRandomLogQueryAvgIsIndexPage | The average ISindEXPAGE is predicted using a neural network for the year. |
1562 | DssmRandomLogQueryAvgQueriesAvgCM2 | The average value of QueriesavGCM2 for the year for the year predicted using a neural network. |
1563 | DssmRandomLogQueryAvgBrowserHostDownloadProbability | The average BrowserhostdowLoadProbabolyity for the year for the year. |
1564 | DssmRandomLogQueryAvgRegBrowserUserHub | The average value of Regbrowseruserhub for the year for a year predicted using a neural network. |
1565 | DssmRandomLogQueryAvgAuxTitleBM25 | The average AuxtitlebM25 average value for the year for the year. |
1566 | DssmRandomLogQueryAvgQueryUrlCorrectedCtrXfactor | The average value of QueryurlCorrectrxFactor for the year for the year. |
1567 | DssmRandomLogQueryAvgQueryToDocAllSumFCountTextBm11Norm16384 | The average value of QueryTodoCallsumfcountTextBM11NORM16384 for the year for the year predicted using a neural network. |
1568 | DssmRandomLogQueryAvgXfDtShowAllSumWFSumWBodyMinWindowSize | The average value of the XFDTSHOWALSUMWFSUMWBODYMINWINDOWSIZE for the year for the year. |
1569 | DssmRandomLogQueryClicksWeightedAvgIsMainPage | The value of the ISMAINPAGE with clicks predicted using the neural network with clicks on request for the year. |
1570 | DssmRandomLogQueryClicksWeightedAvgYabarUrlAvgTime | A mid Yabarurlavgtime value predicted using a neural network with clicks for a year. |
1571 | DssmRandomLogQueryClicksWeightedAvgDifferentInternalLinks | DiffferentinTernallinks, which is predicted using a neural network, is a weighted net with clicks for a year. |
1572 | DssmRandomLogQueryDwelltimeWeightedAvgUrlDomainFraction | The Malue Network DwellTime-AMI predicted using the neural network is the value of Urldomainfraction for the year. |
1573 | XfDtShowKnnAllMaxWFFieldSet3BclmWeightedFLogW0K0001 | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: BCLMWEIGHTEDFLOGW0 in the Stream group 3. The maximum balanced value of the factor. |
1574 | XfDtShowKnnAllMaxWFFieldSet2Bm15FLogK0001 | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: BM15FLOG in the group of streams 2. The maximum balanced value of the factor. |
1575 | XfDtShowKnnBagOfWordsFieldSetBagOfWordsOriginalRequestFraction | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: ORIGINALREQUENTFRACTFRACTION OF THE FIELDSETBAGOFWORDS Stream. |
1576 | XfDtShowKnnAllMaxWFSumWQueryDwellTimeMixMatchWeightedValue | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: MixmatchweightedValue by Stream Querydwelltime. The maximum balanced value of the factor is normalized for the total weight. |
1577 | XfDtShowKnnAllSumW2FSumWTitleBm15K01 | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: BM15 according to Stream Title. The total balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) normalized for total weight. |
1578 | XfDtShowKnnTopMinFFieldSet3BclmWeightedFLogW0K0001 | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: BCLMWEIGHTEDFLOGW0 in the Stream group 3. The minimum value of the factor for the expansion top. |
1579 | XfDtShowKnnAllSumW2FSumWFieldSet3BclmWeightedFLogW0K0001 | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: BCLMWEIGHTEDFLOGW0 in the Stream group 3. The total balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) normalized for the total weight. |
1580 | XfDtShowKnnAllMaxWFFieldSet1Bm15FLogK0001 | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: BM15FLOG in the Stream group 1. The maximum balanced value of the factor. |
1581 | XfDtShowKnnAllSumWFSumWFieldSet1Bm15FLogK0001 | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: BM15FLOG in the Stream group 1. The total balanced value of the factor is normalized for the total weight. |
1582 | XfDtShowKnnBagOfWordsLongClickSPAnnotationMatchAvgValue | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: Bag AnnotationMatChavgvalue by Stream LongClicksp. |
1583 | XfDtShowKnnTopSumW2FSumWFieldSet1Bm15FLogK0001 | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: BM15FLOG for the Stream group 1. The total balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) for expansion top extensions, standardized for the total weight of the expansion top. |
1584 | XfDtShowKnnTopMinWFMaxWFieldSet1Bm15FLogK0001 | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: BM15FLOG in the Stream group 1. The minimum balanced value of the factor for the expansion top extensions normalized for the maximum weight by the expansion top. |
1585 | XfDtShowKnnAllMaxWFSumWBodyPairMinProximity | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: PairminProximity according to Stream Body. The maximum balanced value of the factor is normalized for the total weight. |
1586 | XfDtShowKnnAllSumW2FSumWFieldSet1Bm15FLogK0001 | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: BM15FLOG for the Stream group 1. The total balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) normalized for total weight. |
1587 | XfDtShowKnnBagOfWordsSimpleClickAnnotationMatchAvgValue | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: SIMPLECLIC SIMPLECLICS bag. |
1588 | XfDtShowKnnBagOfWordsTitleCosineMaxMatch | Linguistic boosting factor. Type of extensions: XFDTSHOWKNN. Factor: CosinemaxMatch bag according to Title Stream. |
1589 | Regionality5LocalizationProbability | The prediction of the probability that the request is localized in accordance with the regionality5 rule. |
1590 | Removed_1590 | Removed_1591 |
1591 | Removed_1591 | Removed_1592 |
1592 | Removed_1592 | Removed_1593 |
1593 | Removed_1593 | DocumentHasFioFromOriginalRequest |
1594 | DocumentHasFioFromOriginalRequest | Document has Fio from original request |
1595 | PageQualityExperiment1 | PAGE Quality 1 experiments factor |
1596 | DssmLogDtBigramsAMHardQueriesNoClicksMixed | DSSM model trained on clicks without miner (with no-clicks and am_hard negatives 50/50 and then on am_hard negatives only). Takes bigrams into account. |
1597 | DssmBoostingXfOneSeAmSsHardKMeans1Score | Dssm Boosting Score aggregation for XfOneSeAmSsHard model over 1-means centroids. |
1598 | DssmBoostingXfOneSeAmSsHardKMeans1ScoreAvgClusterTop3Weighted | Dssm Boosting ScoreAvgClusterTop3Weighted aggregation for XfOneSeAmSsHard model over 1-means centroids. |
1599 | PageQualityExperiment2 | Page Quality 2 experiments factor |
1600 | YellownessImgMax | Average by url maximum yellowness of teaser image |
1601 | YellownessImgAvg | Average by url average yellowness of teaser image |
1602 | YellowImgShare | Ratio of yellow images in teasers on host |
1603 | YellowImgCount | Average yellow images count on host |
1604 | TeasersCount | Average teasers count on host |
1605 | TeasersArea | Average teasers area on host |
1606 | YellownessTxtMin | Average by url minimum yellowness of teaser text |
1607 | YellownessTxtAvg | Average by url average yellowness of teaser text |
1608 | HasAdvClickableBG | Background is clickable advertisement |
1609 | AdvNetsArea | Average ratio of adverts on screen |
1610 | AdvNetsAreaFirstPage | Ratio of adverts on screen on main page |
1611 | AdvNetsCount | Average count of adverts on screen |
1612 | AdvTraffOutShareDesktop | Ratio of outgoing advertisement traffic to all traffic (desktop) |
1613 | RTBTraffOutShareDesktop | Ratio of outgoing real-time bidding traffic to all traffic (desktop) |
1614 | NewsAgencyRating | Rating of news agency from agencies.json (Yandex.News resource) |
1615 | QueryToTextByXfDtShowKnnAllSumW2FSumWTextBocm11Norm256 | Linguistic boosting factor. Type of extensions: Querytotextbyxfdtshowknn. Factor: Norm256 by stream BOCM11. The total balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}). |
1616 | QueryToTextByXfDtShowKnnTopSumW2FSumWBodyMinWindowSize | Linguistic boosting factor. Type of extensions: Querytotextbyxfdtshowknn. Factor: Minwindowsize by Stream Body. The total balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) by the expansion top, normalized for the total weight according to the expansion top. |
1617 | QueryToTextByXfDtShowKnnAllSumW2FSumWBodyMinWindowSize | Linguistic boosting factor. Type of extensions: Querytotextbyxfdtshowknn. Factor: Minwindowsize by Stream Body. The total balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) normalized for total weight. |
1618 | QueryToTextByXfDtShowKnnTopSumW2FSumWTextBocm11Norm256 | Linguistic boosting factor. Type of extensions: Querytotextbyxfdtshowknn. Factor: Norm256 by stream BOCM11. The total balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) according to the expansion top. |
1619 | QueryToTextByXfDtShowKnnAllMinW | Linguistic boosting factor. Type of extensions: Querytotextbyxfdtshowknn. The minimum expansion weight. |
1620 | QueryToTextByXfDtShowKnnAllAvgW | Linguistic boosting factor. Type of extensions: Querytotextbyxfdtshowknn. The arithmetic mean of expansion weights. |
1621 | QueryToTextByXfDtShowKnnAllTotalW | Linguistic boosting factor. Type of extensions: Querytotextbyxfdtshowknn. The total weight of the extensions. |
1622 | QueryToTextByXfDtShowKnnBagOfWordsFieldSetBagOfWordsOriginalRequestFraction | Linguistic boosting factor. Type of extensions: Querytotextbyxfdtshowknn. Factor: ORIGINALREQUENTFRACTFRACTION OF THE FIELDSETBAGOFWORDS Stream. |
1623 | PageQualityExperiment3 | Page Quality 3 experiments factor |
1624 | DssmBoostingXfOneSeAmSsHardQueryMutationAddFixedYearWordRenormedDistance | Characterizes the request for the degree of change from the addition of a fixed word (number of some year), DSSM model DSSMBOOSTINGXFONESEAMSARD is used |
1625 | DssmBoostingXfOneSeAmSsHardQueryMutationAddOnlineWordRenormedDistance | Characterizes a request for the degree of change from the addition of a fixed word (‘online’ for Kirilitsa), DSSM model DSSMBOOSTINGXFONESEAMSARD is used |
1626 | DssmBoostingXfOneSeAmSsHardQueryMutationDelSiteWordRenormedDistance | Characterizes the request for the degree of change from removing a fixed word (‘site’ for Kirilitsa), DSSM model DSSMBOOSTINGXFONESEAMSARD is used |
1627 | DocSourceFresh | A document from the hearts with fresh |
1628 | RandomLogWordMaxHasNoTr | For each word offline, the average Hasnotr meaning is calculated for 3 months. Further, in all words of the request, the maximum of this value is taken. |
1629 | RandomLogWordMaxIsLJ | For each word offline, the average ISLJ value is calculated for 3 months. Further, in all words of the request, the maximum of this value is taken. |
1630 | Removed_1630 | RandomLogWordMinBclmLite |
1631 | RandomLogWordMinBclmLite | For each word offline, the average BCLMLITE value is calculated for 3 months. Further, in all words of the request, a minimum of this value is taken. |
1632 | RandomLogWordSkipStopWordsMaxDBM40 | For each word offline, the average DBM40 value is calculated for 3 months. Further, for all non -feet, the words of the request are taken as a maximum of this value. |
1633 | RandomLogWordSkipStopWordsMaxIsDesktopRequest | For each word offline, the average ISDESKTOPREQUEST value is calculated in demands in 3 months. Further, for all non -feet, the words of the request are taken as a maximum of this value. |
1634 | RandomLogWordMaxRLQAvgHasNoAllWordsTrSyn | For each word offline, the average value of RLQAVGHASNOLLWORDSTRSYN is calculated at the request for 3 months. Further, in all words of the request, the maximum of this value is taken. |
1635 | RandomLogWordMaxDssmAggregatedAnnReg | For each word offline, the average DSSMAGGRETEDANNREG value is calculated at the request for 3 months. Further, in all words of the request, the maximum of this value is taken. |
1636 | RandomLogWordMaxMetaNumUrlsPerHostFixed | For each word offline, the average meaning of MetanumurlSperhostfixed is calculated in demands in 3 months. Further, in all words of the request, the maximum of this value is taken. |
1637 | RandomLogWordSkipStopWordsMaxSDIsNavMxQueryMax | For each word offline, the average value of MaxsdisnavmxqueryMax is calculated at the request for 3 months. Further, for all non -feet, the words of the request are taken as a maximum of this value. |
1638 | RandomLogHostVisitsFromWikiAvg | AVG aggregation of VisitsFromWiki web factor using random log |
1639 | PageQualityExperiment4 | Page Quality 4 experiments factor |
1640 | RandomLogHostNavLinearPerc25 | PERCENTALE_25 aggregation of NavLinear web factor using random log |
1641 | RandomLogHostFoundPerc90 | PERCENTALE_90 aggregation of Found web factor using random log |
1642 | RandomLogHostSubqueryThMatchAvg | AVG aggregation of SubqueryThMatch web factor using random log |
1643 | PageQualityExperiment5 | Page Quality 5 experiments factor |
1644 | RandomLogHostSegmentWordPortionFromMainContentAvg | AVG aggregation of SegmentWordPortionFromMainContent web factor using random log |
1645 | RandomLogHostXfDtShowAllMaxFFieldSet2Bm15FLogK0001Avg | AVG aggregation of XfDtShowAllMaxFFieldSet2Bm15FLogK0001 web factor using random log |
1646 | RandomLogHostQueryRegionSizeAvg | AVG aggregation of QueryRegionSize web factor using random log |
1647 | DocFromWebTier1 | The document came from Webtier1 |
1648 | RandomLogHostIsRelevLocaleUAAvg | AVG aggregation of IsRelevLocaleUA web factor using random log |
1649 | RandomLogHostQfufAllSumWFSumWFieldSet3BclmWeightedFLogW0K0001Perc90 | PERCENTALE_90 aggregation of QfufAllSumWFSumWFieldSet3BclmWeightedFLogW0K0001 web factor using random log |
1650 | RandomLogHostDssmBoostingCtrQuerySelfSimilarityPerc90 | PERCENTALE_90 aggregation of DssmBoostingCtrQuerySelfSimilarity web factor using random log |
1651 | RandomLogHostQueryToDocAllSumFCountTextBocm11Norm256Avg | AVG aggregation of QueryToDocAllSumFCountTextBocm11Norm256 web factor using random log. NOTE: QueryToDocAllSumFCountTextBocm11Norm256 has been removed. |
1652 | RandomLogHostIsNavMxQueryPerc90 | PERCENTALE_90 aggregation of IsNavMxQuery web factor using random log |
1653 | DocFromPlatinum0 | The document came from Platinum0 |
1654 | RandomLogHostDBM15Wares2Avg | AVG aggregation of DBM15Wares2 web factor using random log |
1655 | RandomLogHostUrlNGramsModelPerc90 | PERCENTALE_90 aggregation of UrlNGramsModel web factor using random log |
1656 | UnexpectedTrashUrlQuality | Neural document model for finding unexpected tin |
1657 | MedicalHostQualityFresh | Medical host quality fresh. |
1658 | RandomLogHostDssmBoostingCtrKMeans1ScoreScaledSumWeightedQEPerc25 | PERCENTALE_25 aggregation of DssmBoostingCtrKMeans1ScoreScaledSumWeightedQE web factor using random log |
1659 | RandomLogHostLongClickMobileAllWcmWeightedValuePerc90 | PERCENTALE_90 aggregation of LongClickMobileAllWcmWeightedValue web factor using random log |
1660 | RandomLogHostDssmVkPopularityPerc25 | PERCENTALE_25 aggregation of DssmVkPopularity web factor using random log |
1661 | RandomLogHostUBLongPeriodVisitsSNProbAvg | AVG aggregation of UBLongPeriodVisitsSNProb web factor using random log |
1662 | RandomLogHostCountryQueryRegionalityPerc90 | PERCENTALE_90 aggregation of CountryQueryRegionality web factor using random log |
1663 | RandomLogHostTRhitwPerc90 | PERCENTALE_90 aggregation of TRhitw web factor using random log |
1664 | RandomLogHostUBLongPeriodAvgSearchDuration600Perc90 | PERCENTALE_90 aggregation of UBLongPeriodAvgSearchDuration600 web factor using random log |
1665 | RandomLogHostRequestIsFromIOSAvg | AVG aggregation of RequestIsFromIOS web factor using random log |
1666 | RandomLogHostDssmQueryEmbeddingCtrNoMinerPca4Perc90 | PERCENTALE_90 aggregation of DssmQueryEmbeddingCtrNoMinerPca4 web factor using random log |
1667 | RandomLogHostXfDtShowAllMaxFFieldSetUTBm15FLogW0Avg | AVG aggregation of XfDtShowAllMaxFFieldSetUTBm15FLogW0 web factor using random log |
1668 | RandomLogHostUrlTrigramsPerc25 | PERCENTALE_25 aggregation of UrlTrigrams web factor using random log |
1669 | RandomLogHostDssmQueryEmbeddingCtrNoMinerPca1Perc90 | PERCENTALE_90 aggregation of DssmQueryEmbeddingCtrNoMinerPca1 web factor using random log |
1670 | RandomLogHostIsRelevLocaleKZAvg | AVG aggregation of IsRelevLocaleKZ web factor using random log |
1671 | RandomLogHostTextFeaturesPerc90 | PERCENTALE_90 aggregation of TextFeatures web factor using random log |
1672 | HasJsFromMarketgidCom | 1 if host include js from marketgid.com |
1673 | HasJsFromRfityCom | 1 if host include js from rfity.com |
1674 | DssmGoogleSpecificity | DSSM prediction of google specificity for query |
1675 | OwnerWebsiteAttention | Site owner pays attention to site details (at least once in quarter) |
1676 | Removed1676 | ChatScore |
1677 | ChatScore | Chat info. positive / events or zero |
1678 | HostPlayerViewDepth | Host player info. Relation between view time and video duration |
1679 | HasJsFromGoogleAnalyticsCom | 1 if host include js from google-analytics.com |
1680 | HasJsFromGoogleApisCom | 1 if host include js from googleapis.com |
1681 | HasJsFromFacebookNet | 1 if host include js from facebook.net |
1682 | HasJsFromMcYandexRu | 1 if host include js from mc.yandex.ru |
1683 | KnnRandomLogQueryAvgAddTime | The average value of Randomlogqueryavgaddtime of the closest KNN queries. |
1684 | KnnRandomLogQueryAvgTxtHiRelSy | The average value of RandomlogqueryavgtXthirelsy nearest KNN queries. |
1685 | KnnRandomLogQueryAvgTextLike | The average value of Randomlogqueryavgtextlike nearest KNN queries. |
1686 | KnnRandomLogQueryAvgIsForum | The average value of Randomlogqueryavgisforum of the closest KNN queries. |
1687 | KnnRandomLogQueryAvgHasPayments | The average value of Randomlogqueryavghaspayments closest to KNN queries. |
1688 | KnnRandomLogQueryAvgDifferentInternalLinks | The average value of Randomlogqueryavgdiferentinternallinks of the nearest KNN queries. |
1689 | KnnRandomLogQueryAvgIsTargetBussinessCard | The average value of RandomlogqueryavgistargetbussinessCard of the nearest KNN queries. |
1690 | KnnRandomLogQueryAvgQueryToDocAllSumFCountTextBm11Norm16384 | The average value is RandomlogqueryavgquerytododoCallsumfcountTextBM11NORM16384 of the nearest KNN queries. |
1691 | KnnRandomLogQueryAvgXfDtShowAllSumWFSumWBodyMinWindowSize | The average value is Randomlogqueryavgxfdtshowallsumwfsumwbodyminwindowsize closest KNN queries. |
1692 | HostSpeedFromSpylog | Host speed estimation |
1693 | HostOfficial | Is site official |
1694 | Removed_1694 | HostCy100log |
1695 | HostCy100log | Quality link from good sites estimation |
1696 | HostNevasca2ShareWeight | Weight sum of each non-unique nevasca shingle |
1697 | HostNevasca2FreshWeek | Nevasca shingle quantity in last week |
1698 | HostTrafgraphInGT_share_d | Greentraffic share (aka direct visits). Desktop |
1699 | HostTrafgraphInGT_share_m | Greentraffic share (aka direct visits). Mobile |
1700 | HostGreenTrafficDesktop_log | Greentraffic absolute (desktop) |
1701 | HostReturnRateMonth | Visits averaged by user |
1702 | HostBizKernel | HostBizKernelQuantile |
1703 | HostBizKernelQuantile | HasVideo |
1704 | HasVideo | 1 if video on page |
1705 | WebVideoPCtrNewAllWcmMaxMatch | Stream PCtrNew from yandex video |
1706 | WebVideoPCtrNewCMMatchTop5AvgMatch | Stream PCtrNew from yandex video |
1707 | WebVideoPCtrNewAnnotationMaxValueWeighted | Stream PCtrNew from yandex video |
1708 | WebVideoPCtrNewCMMatchTop5AvgPrediction | Stream PCtrNew from yandex video |
1709 | WebVideoPCtrNewFullMatchValue | Stream PCtrNew from yandex video |
1710 | WebVideoPCtrNewPerWordCMMaxMatchMin | Stream PCtrNew from yandex video |
1711 | HasTurbo | The document has a turbo page. Depends on the platform |
1712 | MedicalHostQualityMetric | Medical host quality for metric. |
1713 | RequestWithoutVerbsTitleBm15K01 | The initial request with the removal of verbs. It is considered according to the heading of the document. The algorithm for aggregation of the scales of words: BM15. Normalization coefficient 0.1. |
1714 | RequestWithoutVerbsFieldSetUTBm15FLogW0K00001 | The initial request with the removal of verbs. It is considered to be composational stream, consisting of an tokenized Url and a title of a document. The algorithm for aggregation of the scales of words: BM15FLOGW0. Normalization coefficient 0.0001. |
1715 | RequestWithoutVerbsSumWBodyMinWindowSize | The initial request with the removal of verbs. It is considered according to the contents of the document. The minimum window size, which includes all the words of the request. It is normalized for the number of words in the request. |
1716 | RequestWithoutVerbsAllMaxFUrlBm15K01 | The initial request with the removal of verbs. It is considered to be toxicated by Ural. The algorithm for aggregation of the scales of words: BM15. Normalization coefficient 0.1. |
1717 | RandomLogHostLongRmse | RMSE aggregation of Long web factor using random log |
1718 | RandomLogHostIsOrgRmse | RMSE aggregation of IsOrg web factor using random log |
1719 | RandomLogHostGskUrlModelRmse | RMSE aggregation of GskUrlModel web factor using random log |
1720 | RandomLogHostDaterStatsAverageSourceSegmentRmse | RMSE aggregation of DaterStatsAverageSourceSegment web factor using random log |
1721 | RandomLogHostVisitsFromWikiRmse | RMSE aggregation of VisitsFromWiki web factor using random log |
1722 | RandomLogHostXfDtShowBagOfWordsTitleCosineMaxMatchRmse | RMSE aggregation of XfDtShowBagOfWordsTitleCosineMaxMatch web factor using random log |
1723 | RandomLogHostUBLongPeriodDownloadsProbRmse | RMSE aggregation of UBLongPeriodDownloadsProb web factor using random log |
1724 | RandomLogHostMetaAvgIsNotCgiRmse | RMSE aggregation of MetaAvgIsNotCgi meta factor using random log |
1725 | RandomLogHostMetaRmsSynPercentBadWordPairsRmse | RMSE aggregation of MetaRmsSynPercentBadWordPairs meta factor using random log |
1726 | RandomLogHostMetaPosTrigramsProbRmse | RMSE aggregation of MetaPosTrigramsProb meta factor using random log |
1727 | RandomLogHostBocmPerc90 | PERCENTALE_90 aggregation of Bocm web factor using random log |
1728 | RandomLogHostSegmentWordPortionFromMainContentPerc90 | PERCENTALE_90 aggregation of SegmentWordPortionFromMainContent web factor using random log |
1729 | RandomLogHostIsMobileBeautyPerc90 | PERCENTALE_90 aggregation of IsMobileBeauty web factor using random log |
1730 | RandomLogHostUSLongPeriodUrlWinsProbPerc90 | PERCENTALE_90 aggregation of USLongPeriodUrlWinsProb web factor using random log |
1731 | RandomLogHostDssmBoostingXfWeightKMeans5AvgTop02ScoreQEPerc90 | PERCENTALE_90 aggregation of DssmBoostingXfWeightKMeans5AvgTop02ScoreQE web factor using random log |
1732 | RandomLogHostDssmBoostingCtrKMeans1ScorePerc90 | PERCENTALE_90 aggregation of DssmBoostingCtrKMeans1Score web factor using random log |
1733 | RandomLogHostSDIsNavMxQueryMaxPerc90 | PERCENTALE_90 aggregation of SDIsNavMxQueryMax meta factor using random log |
1734 | RandomLogHostMetaWeb764Web1076ProductInvAvgPerc90 | PERCENTALE_90 aggregation of MetaWeb764Web1076ProductInvAvg meta factor using random log |
1735 | RandomLogHostMetaWeb1099Web1219ProductInvPosPerc90 | PERCENTALE_90 aggregation of MetaWeb1099Web1219ProductInvPos meta factor using random log |
1736 | RandomLogHostMetaMaxDssmMiddleVsShortLongHardNoClicksPerc90 | PERCENTALE_90 aggregation of MetaMaxDssmMiddleVsShortLongHardNoClicks meta factor using random log |
1737 | RandomLogHostNumLinksFromMPMax | MAX aggregation of NumLinksFromMP web factor using random log |
1738 | RandomLogHostNavLinearMax | MAX aggregation of NavLinear web factor using random log |
1739 | RandomLogHostDaterStatsAverageSourceSegmentMax | MAX aggregation of DaterStatsAverageSourceSegment web factor using random log |
1740 | RandomLogHostWeightedSumIsIndexPageIsNavMxQueryMax | MAX aggregation of WeightedSumIsIndexPageIsNavMxQuery web factor using random log |
1741 | RandomLogHostQueryToDocAllSumFCountTextBocm11Norm256Max | MAX aggregation of QueryToDocAllSumFCountTextBocm11Norm256 web factor using random log. NOTE: QueryToDocAllSumFCountTextBocm11Norm256 has been removed. |
1742 | RandomLogHostDssmBigramsQueryDerivativeMaxMax | MAX aggregation of DssmBigramsQueryDerivativeMax web factor using random log |
1743 | RandomLogHostDssmQueryCountryToUrlEstimatedDistanceMax | MAX aggregation of DssmQueryCountryToUrlEstimatedDistance web factor using random log |
1744 | RandomLogHostMetaWeb764Web1076ProductInvAvgMax | MAX aggregation of MetaWeb764Web1076ProductInvAvg meta factor using random log |
1745 | RandomLogHostTextFeaturesLogAvg | LOGAVG aggregation of TextFeatures web factor using random log |
1746 | RandomLogHostDocLenLogAvg | LOGAVG aggregation of DocLen web factor using random log |
1747 | RandomLogHostIsHTMLLogAvg | LOGAVG aggregation of IsHTML web factor using random log |
1748 | RandomLogHostHasLevensht1QueryFragmentLogAvg | LOGAVG aggregation of HasLevensht1QueryFragment web factor using random log |
1749 | RandomLogHostHeadingIdfSumFixedLogAvg | LOGAVG aggregation of HeadingIdfSumFixed web factor using random log |
1750 | RandomLogHostAdvPronounsPortionLogAvg | LOGAVG aggregation of AdvPronounsPortion web factor using random log |
1751 | RandomLogHostLongestTextLogAvg | LOGAVG aggregation of LongestText web factor using random log |
1752 | RandomLogHostCountryHourLogAvg | LOGAVG aggregation of CountryHour web factor using random log |
1753 | RandomLogHostMetrikaUrlAvgTimeLogAvg | LOGAVG aggregation of MetrikaUrlAvgTime web factor using random log |
1754 | RandomLogHostWikiLinkCountLogAvg | LOGAVG aggregation of WikiLinkCount web factor using random log |
1755 | RandomLogHostBrowserUrlDwellTimeRegionFrcLogAvg | LOGAVG aggregation of BrowserUrlDwellTimeRegionFrc web factor using random log |
1756 | RandomLogHostWikiInfoboxLogAvg | LOGAVG aggregation of WikiInfobox web factor using random log |
1757 | RandomLogHostQueryDocTitleRangesMatchingScoreLogAvg | LOGAVG aggregation of QueryDocTitleRangesMatchingScore web factor using random log |
1758 | RandomLogHostIsMobileBeautyLogAvg | LOGAVG aggregation of IsMobileBeauty web factor using random log |
1759 | RandomLogHostQueryToTextAllSumWFSumWBodyMinWindowSizeLogAvg | LOGAVG aggregation of QueryToTextAllSumWFSumWBodyMinWindowSize web factor using random log |
1760 | RandomLogHostDssmRandomLogQueryAvgDifferentInternalLinksLogAvg | LOGAVG aggregation of DssmRandomLogQueryAvgDifferentInternalLinks web factor using random log |
1761 | RandomLogHostMetaUrlDirectChildrenCntLogAvg | LOGAVG aggregation of MetaUrlDirectChildrenCnt meta factor using random log |
1762 | RandomLogHostMetaWeb1241Web1299ProductInvPosLogAvg | LOGAVG aggregation of MetaWeb1241Web1299ProductInvPos meta factor using random log |
1763 | RandomLogHostMetaEpsHashShareNationalLanguageLogAvg | LOGAVG aggregation of MetaEpsHashShareNationalLanguage meta factor using random log |
1764 | IsHttps | The document has a HTTPS protocol |
1765 | QueryUrlEditDistNormalizedByMaxlen | Levenstein’s distance between the request and Url of the type of YouTubecom/Watch normalized to the maximum of the length of the request and Url |
1766 | QueryUrlLCSNormalizedByQuery | The length of the largest total setting between Url and the request normalized for the length of the request |
1767 | TolokaBasedPornQueryClassificationSigmoid | Sigmoid rationed the value of a textual classifier of porn according to Toloka Porn |
1768 | TolokaBasedPornQueryClassificationBinary | Binarized value of a textual classifier text classifier according to Toloka estimates |
1769 | WebClassificationBasedPornQueryClassification | The value of the text classifier of porn according to the classifier of the web and add. dictionaries |
1770 | WebClassificationBasedPornQueryClassificationBinary | Binarized with the use of networks, the value of a textual classifier of porn according to the estimates of the web and additional classifier. dictionaries |
1771 | DirtyLanguageInQuery | The presence of obscene vocabulary in the request. 0 – absent, 0.5 – non -seated, 1 – hard |
1772 | PornMarkersInQuery | The presence of porn markers in the request (0 – is, 1/3 – no, 1 – request ‘gray’) |
1773 | DssmPantherTerms | AdultnessProd |
1774 | AdultnessProd | Documentary classifier of porn, features according to the text of the document |
1775 | AdultnessUrl | Documentary classifier of porn, features on Ural document |
1776 | NastyImageValue | Documentary classifier of porn, features according to the pictures of the document (information is taken from the picture index) |
1777 | NastyVideo | Documentary classifier of porn, features by video of the document (information is taken from the video index) |
1778 | NastyHost | A host classifier of porn, features about the porn of the requests, according to which the host was shown and clung. |
1779 | OfficialInQuery | The presence of an official word in a lemmetized request |
1780 | WikiInQuery | The presence of Wikipedia word in a lemmetized request |
1781 | NotInQuery | The presence in the lemmetized request of the word not and similar in meaning |
1782 | PriceInQuery | The presence in a lemmetized request to buy words, price and similar in meaning |
1783 | DwellTimeSumFractionPercentale25Aggr | Return factor to the host. Percentale aggregation with a coefficient of 0.25F Fichi Dwelltimesumfration |
1784 | DocFromQuickMed | The document came from Quickmed |
1785 | AverageReturnTimePercentale99Aggr | Return factor to the host. Percentale aggregation with a coefficient of 0.99f Fichi AveraGereturntime |
1786 | AverageReturnTimePercentale97Aggr | Return factor to the host. Percentale aggregation with a coefficient of 0.97f feature AveraGereturntime |
1787 | AverageReturnTimeGreaterFraction99Aggr | Return factor to the host. Greaterfraction Aggregation with a coefficient of 0.99F Fichi Averageturntime |
1788 | AverageLogReturnTimePercentale99Aggr | Return factor to the host. Percentale aggregation with a coefficient of 0.99F Fichi AveraGelogreturntime |
1789 | AverageLogReturnTimeGreaterFraction90Aggr | Return factor to the host. Greaterfraction Aggregation with a coefficient of 0.9F Fichi AveraGelogreturntime |
1790 | FirstClickDwellTimeLessFraction5Aggr | Return factor to the host. Lessfraction Aggregation with a coefficient of 0.05F FIRSTCLickdwelltime Fichi |
1791 | AverageVisitsPer3HoursWeightedAverageAggr | Host factor. Weigtedaverage aggregation Fichi averagevisitper3hors |
1792 | MedicalHostQuality | Medical host quality. |
1793 | HasTurboApp | The document has a turbo page for Desktop platforms. Updates on top of the base are delivered through SaAS. |
1794 | AverageDwellTimePerHourWeightedAverageAggr | Host factor. Weigtedaverage aggregation Fichi AvegedwellTimeperhour |
1795 | AverageDwellTimePer3HoursLessFraction10Aggr | Return factor to the host. Lessfraction Aggregation with a coefficient of 0.1F Fichi Averagewelltimeper3Hows |
1796 | AverageDwellTimePerWeekMaxAggr | Host factor. Max aggregation Fichi AvegedwellTimeperweek |
1797 | QueryDoppMedianDwelltime | Median dwelltai request in history. Dwelltaym is cut to 6000. The request is normalized by doppelgangers |
1798 | QueryDoppMultipleClicksShows | The number of shows of the request with more than one click in history. The request is normalized by doppelgangers |
1799 | QueryDoppMultipleClicksProbability | The share of shows with more than one click from all shows in history. The request is normalized by doppelgangers |
1800 | RandomLogOwnerRandomLogWordMaxMetaNumUrlsPerHostFixedPerc90 | Owner aggregation of RandomLogWordMaxMetaNumUrlsPerHostFixed web factor using random log, aggregation type is PERCENTALE_90 |
1801 | RandomLogOwnerMetaWeb1099Web1219ProductInvPosLogAvg | Owner aggregation of MetaWeb1099Web1219ProductInvPos meta factor using random log, aggregation type is LOGAVG |
1802 | RandomLogOwnerDssmDwelltimeRegChainTrainedEmbeddingPerc90 | Owner aggregation of DssmDwelltimeRegChainTrainedEmbedding meta factor using random log, aggregation type is PERCENTALE_90 |
1803 | RandomLogOwnerDssmRandomLogQueryAvgHasPaymentsLogAvg | Owner aggregation of DssmRandomLogQueryAvgHasPayments web factor using random log, aggregation type is LOGAVG |
1804 | RandomLogOwnerUBLongPeriodBrowseFrcPerc90 | Owner aggregation of UBLongPeriodBrowseFrc web factor using random log, aggregation type is PERCENTALE_90 |
1805 | RandomLogOwnerMetaUrlChildrenCntLogAvg | Owner aggregation of MetaUrlChildrenCnt meta factor using random log, aggregation type is LOGAVG |
1806 | RandomLogOwnerMetaRmsDifferentInternalLinksPerc25 | Owner aggregation of MetaRmsDifferentInternalLinks meta factor using random log, aggregation type is PERCENTALE_25 |
1807 | RandomLogOwnerRandomLogWordMaxHasNoTrPerc90 | Owner aggregation of RandomLogWordMaxHasNoTr web factor using random log, aggregation type is PERCENTALE_90 |
1808 | RandomLogOwnerMetaResidUSLongPeriodUrlWinsProbRmse | Owner aggregation of MetaResidUSLongPeriodUrlWinsProb meta factor using random log, aggregation type is RMSE |
1809 | RandomLogOwnerPornoQueryLogAvg | Owner aggregation of PornoQuery web factor using random log, aggregation type is LOGAVG |
1810 | RandomLogOwnerNationalLanguageLogAvg | Owner aggregation of NationalLanguage web factor using random log, aggregation type is LOGAVG |
1811 | RandomLogOwnerPercentVisibleContentPerc90 | Owner aggregation of PercentVisibleContent web factor using random log, aggregation type is PERCENTALE_90 |
1812 | RandomLogOwnerMetaWeb1241Web1299ProductInvPosPerc25 | Owner aggregation of MetaWeb1241Web1299ProductInvPos meta factor using random log, aggregation type is PERCENTALE_25 |
1813 | RandomLogOwnerLinkAnnFloatMultiplicityAttenV1Bm15K001LogAvg | Owner aggregation of LinkAnnFloatMultiplicityAttenV1Bm15K001 web factor using random log, aggregation type is LOGAVG |
1814 | RandomLogOwnerUBLongPeriodLeavesCntRmse | Owner aggregation of UBLongPeriodLeavesCnt web factor using random log, aggregation type is RMSE |
1815 | RandomLogOwnerNumLinksFromMPLogAvg | Owner aggregation of NumLinksFromMP web factor using random log, aggregation type is LOGAVG |
1816 | RandomLogOwnerDssmRandomLogQueryAvgDifferentInternalLinksPerc25 | Owner aggregation of DssmRandomLogQueryAvgDifferentInternalLinks web factor using random log, aggregation type is PERCENTALE_25 |
1817 | RandomLogOwnerIsOrgRmse | Owner aggregation of IsOrg web factor using random log, aggregation type is RMSE |
1818 | RandomLogOwnerQSegmentsBM25Max | Owner aggregation of QSegmentsBM25 web factor using random log, aggregation type is MAX |
1819 | RandomLogOwnerSegmentAuxAlphasInTextRmse | Owner aggregation of SegmentAuxAlphasInText web factor using random log, aggregation type is RMSE |
1820 | RandomLogOwnerRandomLogQueryDwelltimeWeightedAvgUrlDomainFractionLogAvg | Owner aggregation of RandomLogQueryDwelltimeWeightedAvgUrlDomainFraction web factor using random log, aggregation type is LOGAVG |
1821 | RandomLogOwnerRandomLogWordSkipStopWordsMaxIsDesktopRequestLogAvg | Owner aggregation of RandomLogWordSkipStopWordsMaxIsDesktopRequest web factor using random log, aggregation type is LOGAVG |
1822 | RandomLogOwnerVisitsFromWikiRmse | Owner aggregation of VisitsFromWiki web factor using random log, aggregation type is RMSE |
1823 | RandomLogOwnerIsTextRmse | Owner aggregation of IsText web factor using random log, aggregation type is RMSE |
1824 | RandomLogOwnerDBMSubstantiveMax | Owner aggregation of DBMSubstantive web factor using random log, aggregation type is MAX |
1825 | RandomLogOwnerDaterStatsAverageSourceSegmentRmse | Owner aggregation of DaterStatsAverageSourceSegment web factor using random log, aggregation type is RMSE |
1826 | RandomLogOwnerIsMobileBeautyLogAvg | Owner aggregation of IsMobileBeauty web factor using random log, aggregation type is LOGAVG |
1827 | RandomLogOwnerLongClickSPMixMatchWeightedValuePerc90 | Owner aggregation of LongClickSPMixMatchWeightedValue web factor using random log, aggregation type is PERCENTALE_90 |
1828 | RandomLogOwnerFemAndMasNounsPortionLogAvg | Owner aggregation of FemAndMasNounsPortion web factor using random log, aggregation type is LOGAVG |
1829 | RandomLogOwnerTrigramsProbPerc90 | Owner aggregation of TrigramsProb web factor using random log, aggregation type is PERCENTALE_90 |
1830 | RandomLogOwnerDaterStatsYearNormLikelihoodPerc25 | Owner aggregation of DaterStatsYearNormLikelihood web factor using random log, aggregation type is PERCENTALE_25 |
1831 | RandomLogOwnerUrlPathAndParamsFractionMax | Owner aggregation of UrlPathAndParamsFraction web factor using random log, aggregation type is MAX |
1832 | QueryToTextAllAvg | The average value for the query factor according to Querytotext linguosting data is calculated in the LingBoostqueryFeatures Hotemaway Rules |
1833 | QueryToTextKnnAllAvg | The average value for the query factor according to Querytotextbyxfdtshowknn linguosting data is calculated in the LingBoostqueryFeatures Hotemail Rules |
1834 | XfDtShowAllTotalW | SUM / (SUM + 10) for the requested factor according to the XFDTSHOW linguosting data, is calculated in the LingBoostqueryFeatures Hotemaura |
1835 | XfDtShowQuantile01 | Quantile 0.1 For the request factor according to the XFDTSHOW linguosting data, is calculated in the LingBoostqueryFeatures Hotemaway Rules |
1836 | XfDtShowKnnQuantile10 | Quantile 0.1 for the request factor according to the XFDTSHOWKNN linguosting data, is calculated in the LingBoostqueryFeatures Hotemaway Rules |
1837 | XfDtShowKnnQuantile09 | Quantile 0.9 for the quantity factor according to the XFDTSHOWKNN linguosting data, is calculated in the LingBoostqueryFeatures Hotemail Rules |
1838 | QfufAllTotalWeight | SUM / (SUM + 10) for the requested factor according to QFUF linguosting data, is calculated in the LingBoostqueryFeatures Hotemaura |
1839 | QfufAllAvg | The average value for the quoted factor according to QFUF lingvobusting data is calculated in the LingBoostqueryFeatures Hotemail Rules |
1840 | IsTasIx | The site is located on the TAS-IX network (relevant for Uzbekistan) |
1841 | DssmBoostingSerpSimilarityHardKMeans1Score | Dssm Boosting Score for SerpSimilarityHard model over 1-means centroids. |
1842 | PageQualityHost | Page quality aggregated by host (avg). |
1843 | IsRelevLocaleUZ | Relev_locale == Us |
1844 | QueryDoppTimeFromPreviousPercentile25 | 25% time quantile from the previous request to the current one. The request is normalized by doppelgangers |
1845 | NeuroTextModelLongClickPredictorByWordAndBigramCountersWithSSHards | The result of the use of a neural model, trained to distinguish long clicks from other events, the input of the model is the ambassadors and bigram meters, calculated by text streams (Title, Body, URL). |
1846 | IsMobileBeautyHost | Is this host adapted to mobile devices |
1847 | QfufFilteredByXfOneSeAllMaxFFieldSet2Bm15FLogK0001 | Linguistic boosting factor. Type of extensions: QFUFFILTEDBYXFONSE (QFUF, filtered on the DSSM models Xfonese). Aggregation on all extensions. The greatest value of the factor. Into aircraft association of the URLs, Title, Body, Correctedctr, Longclick, OneClick, Browserpagerank, Splitdwelltime, SampleperiodDayFrc, SimpleClick, Yabarvisits, Yabartime. The algorithm for aggregation of the scales of words: BM15FLOG (BM15 Aggregation of Logarithm of Construction of Words). Normalization coefficient 0.001. |
1848 | QfufFilteredByXfOneSeAllMaxFFieldSet3BclmWeightedFLogW0K0001 | Linguistic boosting factor. Type of extensions: QFUFFILTEDBYXFONSE (QFUF, filtered on the DSSM models Xfonese). Aggregation on all extensions. The greatest value of the factor. Rebelled association of streams Title, Body, LongClick, LongClicksp, OneClick. The algorithm for aggregation of the scales of words: BCLMWEIGHTEDFLOGW0. Normalization coefficient 0.001. |
1849 | QfufFilteredByXfOneSeAllMaxFFieldSetUTBm15FLogW0K00001 | Linguistic boosting factor. Type of extensions: QFUFFILTEDBYXFONSE (QFUF, filtered on the DSSM models Xfonese). Aggregation on all extensions. The greatest value of the factor. It is considered to be composational stream, consisting of an tokenized Url and a title of a document. The algorithm for aggregation of the scales of words: BM15FLOGW0. Normalization coefficient 0.0001. |
1850 | QfufFilteredByXfOneSeAllMaxFTitleBm15K01 | Linguistic boosting factor. Type of extensions: QFUFFILTEDBYXFONSE (QFUF, filtered on the DSSM models Xfonese). Aggregation on all extensions. The greatest value of the factor. It is considered according to the heading of the document. The algorithm for aggregation of the scales of words: BM15. Normalization coefficient 0.1. |
1851 | QfufFilteredByXfOneSeTopSumWFSumWFieldSet2Bm15FLogK0001 | Linguistic boosting factor. Type of extensions: QFUFFILTEDBYXFONSE (QFUF, filtered on the DSSM models Xfonese). Aggregation by TOP-10 (by the value of the factor) extensions. A suspended sum of the Libra of factors. Normalized for the total weight of extensions. Into aircraft association of the URLs, Title, Body, Correctedctr, Longclick, OneClick, Browserpagerank, Splitdwelltime, SampleperiodDayFrc, SimpleClick, Yabarvisits, Yabartime. The algorithm for aggregation of the scales of words: BM15FLOG (BM15 Aggregation of Logarithm of Construction of Words). Normalization coefficient 0.001. |
1852 | QfufFilteredByXfOneSeTopSumWFSumWBodyMinWindowSize | Linguistic boosting factor. Type of extensions: QFUFFILTEDBYXFONSE (QFUF, filtered on the DSSM models Xfonese). Aggregation by TOP-10 (by the value of the factor) extensions. A suspended sum of the Libra of factors. Normalized for the total weight of the extensions. It is considered according to the contents of the document. The minimum window size, which includes all the words of the request. It is normalized for the number of words in the request. |
1853 | OriginalRequestWordsFilteredByDssmSSHardFieldSet1Bm15FLogK0001 | The factor for the filtered original request: the DSSM state from the request is calculated without words to the initial request, after which the threshold is cut off. Into aircraft association of the URLs, Title, Body, Links, Correctedctr, LongClick, OneClick, Browserpagerank, Splitdwelltime, SampleperiodDayFrc, SimpleClick, Yabarvisits, Yabartime. The algorithm for aggregation of the scales of words: BM15FLOG (BM15 Aggregation of Logarithm of Construction of Words). Normalization coefficient 0.001. |
1854 | OriginalRequestWordsFilteredByDssmSSHardFieldSetUTBm15FLogW0K00001 | The factor for the filtered original request: the DSSM state from the request is calculated without words to the initial request, after which the threshold is cut off. It is considered to be composational stream, consisting of an tokenized Url and a title of a document. The algorithm for aggregation of the scales of words: BM15FLOGW0. Normalization coefficient 0.0001. |
1855 | DssmCtrEngSsHard | DSSM model trained on cross language CTRs using serp similarity hard miner. |
1856 | Removed_1856 | FractionOfPresentedInTitleWordsWithWeightsByDssmSSHardModel |
1857 | FractionOfPresentedInTitleWordsWithWeightsByDssmSSHardModel | For all words of the request, the weight is calculated by the Query-Mutation method (the distance between the requests in nash and there is no word). The sum of the scales of the words found in the title is taken, divided by the sum of the scales of all words. |
1858 | MaxWeightOfAbsentInTitleWordsWithWeightsByDssmSSHardModel | For all words of the request, the weight is calculated by the Query-Mutation method (the distance between the requests in nash and there is no word). Maximum weight is taken among words absent in the title of the document. |
1859 | NeuroTextModelLongClickPredictorByWordAndBigramCountersWithoutTitleWithSSHards | The result of the use of a neural model, trained to distinguish long clicks from other events, the input of the model is the ambassadors and bigram meters calculated by text streams (Body, URL). |
1860 | Removed_1860 | DaterAddTime80Hours |
1861 | DaterAddTime80Hours | It is considered as (80-x) where X is the return of the document in the clock (continuously). Uses the data of the Robotaddtime dates |
1862 | DaterAddTime10Days | It is considered as (10-x) where X is the return of the document in days (continuously). Uses the data of the Robotaddtime dates |
1863 | DaterAge10Days | The difference between the current date and the date of the document, determined by the Robotaddtime, 1 – the date is equal to the current, 0 – the document of 10 days or more, or the date is not determined |
1864 | XfOneSeKnnAllMaxWFMaxWFieldSet1Bm15FLogK0001 | Linguistic boosting factor. Type of extensions: XFONESEKNN (closest to the DSSM models trained to predict XFDTSHOW of extension). Aggregation on all extensions. The greatest balanced value of the factor. It is normalized for the maximum weight of expansion. Into aircraft association of the URLs, Title, Body, Links, Correctedctr, LongClick, OneClick, Browserpagerank, Splitdwelltime, SampleperiodDayFrc, SimpleClick, Yabarvisits, Yabartime. The algorithm for aggregation of the scales of words: BM15FLOG (BM15 Aggregation of Logarithm of Construction of Words). Normalization coefficient 0.001. |
1865 | XfOneSeKnnAllMaxWFMaxWOneClickFullMatchValue | Linguistic boosting factor. Type of extensions: XFONESEKNN (closest to the DSSM models trained to predict XFDTSHOW of extension). Aggregation on all extensions. The greatest balanced value of the factor. It is normalized for the maximum weight of expansion. Todo Algorithm: The maximum weight of the completely coincided with the request of the annotation. It is considered according to Stream OneClick. |
1866 | QueryToTextByXfOneSeKnnTopSumWFSumWBodyMinWindowSize | Linguistic boosting factor. Type of extensions: QuerytotextByxfoneKnn (Querytotext extensions of Xfoneeseknn extensions). Aggregation by TOP-10 (by the value of the factor) extensions. A suspended sum of the Libra of factors. Normalized for the total weight of the extensions. It is considered according to the contents of the document. The minimum window size, which includes all the words of the request. It is normalized for the number of words in the request. |
1867 | QueryToTextByXfOneSeKnnAllSumWFSumWFieldSet3BclmWeightedFLogW0K0001 | Linguistic boosting factor. Type of extensions: QuerytotextByxfoneKnn (Querytotext extensions of Xfoneeseknn extensions). Aggregation on all extensions. A suspended sum of the Libra of factors. Normalized for the total weight of the extensions. Rebelled association of streams Title, Body, LongClick, LongClicksp, OneClick. The algorithm for aggregation of the scales of words: BCLMWEIGHTEDFLOGW0. Normalization coefficient 0.001. |
1868 | IsInternationalDomain | Domain in the international zone |
1869 | IsMemorandumQuery | The request was recognized as having intent to copyright objects protected by anti-Pirate memorandum. |
1870 | HostVideoStevenson | The host contains pirate videos protected by anti-pirate memorandum. |
1871 | HostVideoDistributor | The host contains a video protected by anti-pirate memorandum. |
1872 | FreshHostAvg30DaysSurplus | The average freshness of the host of the host in 30 days |
1873 | FreshHost30DaysPositiveSurplusRate | Share of documents with a positive freshness from the host for 30 days |
1874 | HostStevensonBinary | Stevenson |
1875 | HostStevensonUrlsPerClicks | Stevenson |
1876 | HostStevensonUrlsPerShows | Stevenson |
1877 | HostStevensonUrlsPerShows10 | Stevenson |
1878 | HostStevensonWeight | Stevenson |
1879 | VideoIntendancePredict | Transferred Ethos predicted classifier for marking on the relevance of video. |
1880 | PiracyPredict | Reduced ETHOS predictive classifier, trained in a synthetic sample ‘Request characteristic of the pirate site’ VS ‘, the request is characteristic of the site far from this’ |
1881 | FREE_SLOT_1881 | In this slot, it was never a zero feature |
1882 | StevensonDssmPredictor | DSSM regression for the separation of memorandum and no requests |
1883 | MemorandumPredict | Transferred Ethos predicted classifier trained to distinguish a memorandum queries from a random |
1884 | PiracyPredictDssm | DSSM regression of embeds to separate Pirato specific and no requests |
1885 | ReformulationsLongestClickLogDt | DSSM model that predicts the logarithm of the longest click on the Serpa. As negative examples, select Urla from past requests of the same user, and the maximum time between requests is no more than 7 minutes (super -cords for reformulations) |
1886 | DocFromQuick | The document came from Quick but not from QuickRt |
1887 | DocFromQuickRt | The document came from QuickRt |
1888 | DocFromCallisto | The document came from Callisto |
1889 | LegalPlayers | Feature LegalPlayers from VideoIndex |
1890 | SocialNetworksPlayers | Feature SocialNetworksPlayers from VideoIndex |
1891 | StevensonPlayers | Feature StevensonPlayers from VideoIndex |
1892 | ReformulationsLongestClickLogDtEarlyBindingDssm | DSSM model with early binding, trained in reformulations, which predicts the logarithm of the longest click on the Serpa. |
1893 | HasNewsAgencyRating | Rating of news agency from agencies.json > 0 (Yandex.News resource) |
1894 | WeekdayQueryProbability | Weekday query probability |
1895 | HostBizKernelPercentile | The quality indicator of the site from the point of view of factors about user behavior aggregated to the owners. |
1896 | HitContextsDssm | Neural network value for contexts of query hits in document text. Predicts relevance-all-8-years. Uses formula ussr-dump-20190719 prs-20190720 all-8-years [t > 0.25] CrossEntropy 20k 0.25 -S 0.8 -Z 1 predictions for learning. |
1897 | AntispamBan | Bans of Antispam from erf |
1898 | DssmReformulationsWithExtensions | DSSM model trained on a reformal pool, which in the request, in addition to the request itself, receives 4 extensions of the XFDT with the largest weight |
1899 | RandomLogHostTopClickedUrlsIsMobileRequestLogAvg | Aggregated by the closest to the Host Urlam Logavg statistics of the ISMOBILEREQUEST factor |
1900 | RandomLogHostTopClickedUrlsNanobtaniumQueryWordTitle5nDist2maxXMaxIsMobileRequestLogAvg | Aggregated by the closest on the Host Urlam Logavg-statistics of the NanobtaniumQuerywordtitle5ndist2maxmax factor. |
1901 | AntispamBanGsm | Bans on gsm of Antispam from erf |
1902 | AntispamBanFresh | Bans on fresh of Antispam from erf |
1903 | RandomLogQueryAvgIsBlog | The average ISBLOG value for the year. It is calculated in offline. |
1904 | HasTurboMobile | The document has a turbo page for Mobile platform. Updates on top of the base are delivered through SaAS. |
1905 | HasTurboDesktop | The document has a turbo page for Desktop platforms. Updates on top of the base are delivered through SaAS. |
1906 | DssmFomula8YearsCe25Prediction | A model trained to predict an assessment of the USSR-DUMP-20190719 PRS-20190720 ALL-8-YEARS [T> 0.25] Crossentropy 20K 0.25 -s 0.8 -z 1. |
1907 | Removed_1907 | RandomCommercial |
1908 | RandomCommercial | Random’ factor for commercial sites. |
1909 | UnexpectedTrashUrlQualityFresh | Neuron document model for finding unexpected tin (for ex -) |
1910 | RequestMultitokensAllMaxFUrlBclmMixPlainKE5 | Features calculated on url with request multitokens expansion |
1911 | RequestMultitokensAllSumW2FSumWUrlExactQueryMatchAvgValue | Features calculated on url with request multitokens expansion |
1912 | DssmFomula8YearsCe25PredictionRatings | A model trained to predict an assessment of the USSR-DUMP-20190719 PRS-20190720 ALL-8-YEARS [T> 0.25] Crossentropy 20K 0.25 -s 0.8 -z 1 and an educational study on assessments of relevance. |
1913 | QueriesRatioMorda2 | The share of the requests on which the owner’s face was shown is among all the requests, according to which the owner was shown in the last week. |
1914 | SerpClicksByHopPart_0_30 | The share of visits from a sickle of documents located at 0 hopes. In 30 days. |
1915 | QueriesAvgTop | The average position of the owner at the request for the last week. |
1916 | TrafgraphMobileDesktopSE_share | The attitude of mobile to desktops by traf from search engines. |
1917 | TrafgraphMobileDesktopOutAll_share | The attitude of mobile to desktops throughout the outgoing traffic. |
1918 | AvgIsOrg | The average value of the ISORG query factor at request with this owner in the last week. |
1919 | AvgQsFPunctBlanksRat | The average attitude of punctuation to all dividers in the documents of the owner. |
1920 | FreshDetectorPredict | The value of the freshness detector calculated in the hippo. Always 0 with a detector value less than the threshold. |
1921 | HostVideoDistributorProd | The host contains a video protected by anti-pirate memorandum. |
1922 | HostMemorandumWeight | Stevenson |
👉Implications of the Yandex Data Breach on SEO
The Yandex data breach has disrupted the SEO landscape, offering fresh perspectives on how search engines prioritize content and rank web pages. It serves as a wake-up call for modern SEO practices and as a catalyst for professionals to reassess and refine their strategies armed with newfound intelligence.
👉Implications on Ranking Factor
The revelation of over 40 new quality-related ranking factors is poised to reshape search engine optimization strategies. SEO experts now have more profound insights into host reliability, page authority, and creating compelling, high-quality content that resonates with readers and search algorithms.
👉Implications on Over-Optimization
Yandex’s recent disclosure sheds light on the adverse effects of excessively optimizing websites. The leaked data underscores the risks of pushing SEO tactics too far, blurring the line between legitimate optimization and spamming.
Websites crossing this threshold may be penalized, risking a significant drop in their search engine rankings. Marketers must now navigate a nuanced approach to SEO, striking a balance that maximizes visibility without triggering penalties. Failure to do so could result in violations and spam flags, negatively impacting a site’s online presence.
To adapt to this evolving landscape, marketers must reconsider traditional keyword-stuffing techniques and prioritize natural language and user-centric content instead. This shift is crucial to ensure content ranks well while aligning with the latest guidelines based on incidents like the Yandex leak.
👉Implications for Local Results
Moreover, the impact extends to local search results, prompting SEO professionals to reassess strategies for businesses targeting specific regions. Local queries now favor high-quality content and well-structured websites that effectively cater to local interests.
Businesses can enhance their visibility in local searches by optimizing for location-specific signals such as accurate metadata, clear address information, and locally relevant keywords. By prioritizing a seamless local user experience, businesses improve visibility and enhance their chances of success in community-based searches, especially as search engines increasingly prioritize user satisfaction.
👉Influence of On-Page Advertising
The incorporation of on-page advertising significantly impacts how Yandex evaluates a website. These adverts can sway user behavior signals, enhancing engagement and distinguishing your site from others. Strategically placing advertisements within your content can generate multiple traffic sources, aligning with the preferences of the Yandex algorithm.
To maximize the benefits of on-page ads, creating adverts catering to user search intent is crucial. This involves understanding the needs of your audience and providing ad content that adds value rather than disrupts their browsing experience. Given Yandex’s inclination towards websites with diverse traffic sources and robust user engagement, leveraging strategic on-page advertising can boost your site’s rankings.
👉Importance of URL Structure
The construction of URLs plays a subtle yet crucial role in SEO. Clear, concise URLs containing relevant keywords aid search engines comprehensively understand and index your content. A well-structured URL also enhances user experience, facilitating easy sharing and memorability of links to your website.
Crafting optimal URLs involves more than simply inserting keywords into the address. It requires establishing a logical hierarchy that search engine crawlers and users can navigate effortlessly. This entails using hyphens instead of underscores, maintaining brevity while ensuring descriptiveness, and avoiding unnecessary parameters that may confuse indexing or appear spammy.
Additionally, employing HTTPS to secure your site’s connection enhances trustworthiness, which is pivotal for users and augments the potential for better ranking in search results.
👉Significance of Page-Level Factors
Page-level factors greatly influence how Yandex evaluates content quality and user intent. Yandex prioritizes high-quality, authoritative content, particularly favoring pages reminiscent of Wikipedia. Consequently, crafting meticulously researched, factually accurate articles can significantly improve your website’s ranking prospects. SEO efforts should concentrate on delivering value through each page and aligning content with user search queries.
Integrating tf*idf optimization to balance keyword relevance and moderation is paramount in page-level optimization strategies. Search engines utilize this statistical measure to gauge the significance of a word within a document relative to a corpus of documents. Webmasters can closely align with Yandex’s ranking preferences by adopting this approach, securing high-value inbound links and crafting meta descriptions that succinctly delineate the page’s purpose.
👉User Behavior and Engagement
The behavior and engagement of users now hold considerable sway in how Yandex assesses the relevance and importance of web pages. Increased clicks, prolonged engagement with content, and active interaction indicate a higher likelihood of ranking better in search results.
This emphasis underscores website owners’ need to prioritize creating top-notch, captivating content that aligns with user interests, thereby fostering repeat visits.
Engaged users indicate a website’s value and pertinence to Yandex. Consequently, SEO strategies should focus on attracting visitors and delivering an exceptional user experience that encourages interaction with various page elements.
Metrics such as click-through rates from search results and time spent on pages directly impact rankings, underscoring the significance of aligning expertise with audience expectations.
👉Backlink Quality
The quality of backlinks has emerged as a paramount concern in SEO strategies, particularly following revelations from the Yandex leak. Backlinks from reputable websites enhance a site’s credibility and increase its prospects of securing higher rankings in search results.
Both Google and Yandex value the authenticity and relevance of backlinks, favoring websites that maintain a natural and diverse link profile.
Establishing a network of backlinks necessitates more than sheer quantity; these links must originate from diverse sources that resonate with the content. This approach discourages unethical SEO practices like link spamming. It promotes meaningful connections through guest blogging, active engagement on social media platforms like Facebook or LinkedIn, and compelling content that others find share-worthy.
Monitoring these efforts is vital—leverage tools like Google Analytics to evaluate the efficacy of backlinks in driving unique visitors to your official website.
👉Key Insights from the Yandex Data Breach for SEO Professionals
The recent data breach at Yandex offers SEO practitioners a valuable opportunity to gain insights into the underlying mechanics of search engine algorithms. This breach unveils crucial factors that influence rankings, prompting a reevaluation of SEO strategies for enhanced organic performance.
👉Understanding Google Ranking Factors
While Google’s algorithm has long been a mystery, the leaked information suggests that Yandex and Google’s ranking criteria share similar criteria. The data suggests that both search engines prioritize high-quality content and user engagement. This indicates that SEO professionals should emphasize optimizing page-level elements, enhancing backlink quality, and improving user interaction metrics to improve SERP rankings.
👉Reevaluating SEO Practices
The revelation of over 40 quality-related factors underscores the need to reassess current SEO methodologies. It also emphasizes the importance of accurately optimizing content and prioritizing user experience to achieve better visibility in Google Search results.
👉Implications for Digital Marketing Ecosystems
These insights offer a unique glimpse into potential criteria utilized by search engine algorithms across various platforms, including Google Ads, YouTube, Twitter, TikTok, and Quora. This suggests that aligning SEO strategies with these factors could improve performance within the broader digital marketing landscape.
👉The Significance of User Experience in SEO
The cornerstone of effective SEO strategies lies in prioritizing user experience. A website that captivates visitors with intuitive navigation and valuable content tends to prolong their stay, decreasing bounce rates and increasing the likelihood of repeat visits. Such positive interactions send strong signals to search engines like Yandex and Google, indicating the website’s reliability as a source of information.
Creating an online environment that resonates with audiences entails profoundly understanding their needs and preferences. Implementing an efficient site architecture ensures that users can effortlessly locate desired content, thus enhancing satisfaction and fostering engagement.
This emphasis on user behavior closely aligns with SEO principles, as search engines favor websites that adeptly address user queries, underscoring the connection between thoughtful user experience design and improved search result rankings.
👉The Role of Backlinks and Direct Traffic
Shifting the focus to backlinks reveals their pivotal role in SEO strategies. High-quality links from reputable sources can elevate a website’s authority and signal to search engines the value of its content. This notion is corroborated by insights from the Yandex data leak, which underscores the significance of backlink quality in determining page rank. Trustworthy backlinks validate credibility, propelling a website higher in search results.
Additionally, direct traffic holds sway over search rankings. Websites with a substantial volume of direct visits convey to search engines like Yandex their status as trusted authorities within their respective niches or industries. This trustworthiness can translate into enhanced organic performance, as evidenced by findings from the data leak.
However, it’s imperative to recognize that Yandex scrutinizes activities such as click manipulation, emphasizing the importance of genuine efforts in building direct traffic for sustainable SEO success.
👉Should the Yandex Data Breach Impact Your SEO Strategy?
The recent data breach at Yandex may appear distant if your focus lies solely within the realm of Google’s search algorithms. However, it underscores the importance of user behavior and machine learning in shaping search engine results.
Acknowledging these elements should prompt SEO professionals to reassess their optimization strategies, especially concerning click-through rates, dwell time, and creating high-quality content that genuinely engages users.
Adapting your approach could prove advantageous, as techniques effective for Yandex may also align with Google’s algorithmic updates. This realization emphasizes the significance of adhering to legitimate link-building methods and avoiding manipulative tactics like Private Blog Networks (PBNs).
Revising your SEO strategy should enhance the user experience by improving website navigation and loading speed and prioritizing creating compelling content that encourages organic clicks and social media shares.
A Deeper Dive into Their Functionality and Unique Features
Yandex is one of the most influential tech companies in Russia, and its algorithms are central to how it operates its search engine and related services. While the company has pioneered a number of innovative technologies, its algorithmic approach is both similar to and distinct from other search engines, like Google. Yandex’s algorithms have evolved to tackle the specific needs of the Russian internet ecosystem, considering factors like local language processing, regional trends, and different user behavior.
Yandex’s Approach to Semantic Search and Natural Language Processing (NLP)
One of the distinguishing features of Yandex’s search algorithms is their ability to understand the semantic meaning behind user queries. While Google has long been recognized for its focus on providing results based on keywords, Yandex’s search algorithms also heavily rely on semantic search, which attempts to interpret the meaning behind a user’s search rather than simply matching keywords to indexed content.
This is particularly important in Russian, as the language is highly inflected, meaning that words can change forms depending on their grammatical context. To accommodate these nuances, Yandex developed its “semantica” technology, which powers the company’s NLP (Natural Language Processing) efforts. This allows Yandex’s algorithms to better understand the context of searches and deliver more relevant results. In practice, this means that users can type queries in a more conversational style, and Yandex will be able to interpret the intent behind the words and provide more accurate, meaningful results.
Yandex and Artificial Intelligence (AI)
Artificial Intelligence (AI) plays a crucial role in how Yandex’s algorithms deliver results. The company’s MatrixNet machine learning algorithm, for example, is trained using vast amounts of data to improve search rankings over time. The AI at the core of MatrixNet uses various factors, including user interaction data, click-through rates, and the authority of the websites, to adjust rankings and improve accuracy. As the system gathers more data, it becomes better at predicting what results are likely to be the most relevant for users, thus providing a more personalized experience.
Additionally, Yandex has implemented AI in its advertising algorithms (Yandex Direct). By leveraging machine learning models, Yandex Direct can optimize campaigns based on real-time user behavior and automatically adjust ad placements to target the most relevant audience segments. This has revolutionized the way advertisers interact with consumers on the Yandex platform, contributing to higher ad performance and a more tailored advertising experience.
Yandex and User Experience: From Search to Social Media
Another key aspect of Yandex’s algorithms is their integration across the company’s full range of services. Unlike Google, which focuses primarily on search, Yandex’s ecosystem spans from search to e-commerce, social media, and even self-driving vehicles. Yandex Zen, for example, is an AI-powered content recommendation system that provides users with a personalized feed of articles, blogs, and news based on their browsing habits, search queries, and social media activity. By analyzing vast amounts of data, Yandex Zen can fine-tune content suggestions, keeping users engaged and improving the overall user experience.
Yandex has also integrated AI into its Yandex Maps, which provide not only standard maps and directions but also real-time data on traffic conditions, accidents, and route optimization. The algorithm that powers Yandex Maps uses machine learning to predict traffic patterns, helping users avoid congested roads and arrive at their destinations faster. This level of optimization is a result of Yandex’s ongoing work to improve the accuracy and utility of its algorithms, especially in the transportation and logistics space.
The Yandex Data Breach: What Went Wrong?
The Yandex data breach of 2024 represents one of the most significant security lapses in the company’s history. The breach exposed personal data from millions of users across Yandex’s various platforms, including Yandex Search, Yandex Zen, Yandex Direct, and more. As a company that handles vast amounts of personal and sensitive data, including browsing history, search queries, and financial information, the breach raised serious questions about how the company safeguards user data.
How Did the Yandex Data Breach Happen?
While Yandex has not provided full details about the breach, cybersecurity experts have pointed to several potential vulnerabilities that may have been exploited. One theory is that phishing attacks targeting Yandex employees were the starting point. Attackers may have gained access to internal systems by tricking staff into clicking on malicious links or downloading harmful attachments. Once the attackers gained access to Yandex’s internal network, they were able to infiltrate sensitive databases that housed user data.
Another possibility is that weaknesses in the company’s encryption protocols allowed hackers to bypass security measures and access encrypted data. While Yandex has reported using standard encryption methods to protect user information, advanced persistent threats (APTs) could have used sophisticated techniques to break these safeguards and exfiltrate sensitive data.
The Long-Term Consequences of the Data Breach
Reputation Damage
For any tech company, especially one as influential as Yandex, the reputation damage caused by a data breach can be severe and long-lasting. Users are now more cautious about entrusting their personal information to any company. Yandex’s breach has likely caused a ripple effect throughout the tech industry, reminding users of the inherent risks of living in a data-driven world. The breach undermines trust in the Yandex brand and its services, especially in the advertising sector, where user data is integral to creating targeted campaigns.
Legal and Regulatory Fallout
Given the nature of the breach and the type of data involved, Yandex could be subjected to regulatory scrutiny, especially under the General Data Protection Regulation (GDPR) in the European Union. Even though Yandex is based in Russia, it operates in many countries around the world, including within the EU, and is subject to these laws if it processes data from EU citizens. GDPR mandates strict measures for data protection, and if Yandex is found to have violated these provisions, it could face heavy fines, potentially amounting to billions of euros.
Financial Impact
Beyond fines and legal penalties, Yandex could also face significant financial losses as a result of the breach. The company’s stock value may drop, especially if users begin to pull back from its services or if advertisers lose confidence in the company’s ability to deliver secure services. Additionally, the cost of remediation efforts, including cybersecurity upgrades, public relations efforts, and legal expenses, will add up over time.
Impact on Yandex’s Algorithms
Following the breach, Yandex will likely re-evaluate how its algorithms access and process personal data. The company may adopt more stringent security protocols in an effort to rebuild user trust. This could involve stronger encryption techniques, greater transparency about data collection methods, and more control for users over their personal information.
Potential for Identity Theft and Fraud
The breach could also open the door to identity theft and other forms of cybercrime. Hackers with access to financial details, email addresses, and other personal information could target Yandex users with phishing schemes, credit card fraud, or blackmail. The company is now in the process of offering credit monitoring and identity protection services to those affected by the breach, but the damage to individuals’ personal security may extend beyond what the company can prevent.
How Yandex Is Responding to the Breach
To address the breach, Yandex has taken several steps. It has already notified affected users and offered identity theft protection services. The company has also begun collaborating with cybersecurity firms and law enforcement to track down the perpetrators and assess the full scale of the damage.
In terms of technical response, Yandex has initiated a security overhaul, which includes tightening its encryption protocols, improving its data access policies, and implementing stronger authentication measures for both users and employees. The company has also promised to audit its systems more frequently to identify and fix vulnerabilities before they can be exploited by attackers.
Lessons Learned: What Companies Can Take Away From the Yandex Breach
Yandex’s breach serves as a cautionary tale for all tech companies that collect and store user data. The incident underscores the importance of implementing strong cybersecurity measures and ensuring that all company employees are educated about potential phishing attacks and other forms of social engineering.
Proactive Cybersecurity Measures
Companies must adopt a proactive approach to cybersecurity, continuously monitoring for potential threats and investing in technologies that can detect breaches in real time. Tools like intrusion detection systems (IDS), multi-factor authentication (MFA), and regular penetration testing should be standard practice.
Transparency with Users
Trust is paramount when dealing with user data. Yandex’s response to the breach—offering transparency and identity theft protection—can serve as a model for other companies. Being upfront with users about what happened, how it affects them, and what steps are being taken to prevent future breaches is key to maintaining long-term customer loyalty.
Comprehensive Risk Management Plans
Companies must have comprehensive risk management strategies in place. This includes everything from incident response plans to crisis communication strategies, ensuring that the company can respond effectively to a breach without creating further panic or confusion.
Education and Awareness
Employees should be regularly trained on the latest cybersecurity threats and best practices. Many breaches start with an employee inadvertently compromising security, and providing regular training can mitigate these risks.
👉Conclusion
In summary, the Yandex data breach equips SEO professionals with deeper insights. Strategies previously veiled in mystery now call for a fresh perspective and more astute implementation.
This development underscores the importance of prioritizing user experience and content quality. Link-building practices and on-page optimizations must undergo scrutiny and adaptation. As the SEO landscape is forever altered, it moves towards an era characterized by transparency and refined tactics
Thatware | Founder & CEO
Tuhin is recognized across the globe for his vision to revolutionize digital transformation industry with the help of cutting-edge technology. He won bronze for India at the Stevie Awards USA as well as winning the India Business Awards, India Technology Award, Top 100 influential tech leaders from Analytics Insights, Clutch Global Front runner in digital marketing, founder of the fastest growing company in Asia by The CEO Magazine and is a TEDx speaker.