Tag: TG_L2
(3674 ranking factors)
Factors |
---|
PR
web_production: 0
Weight: 0.182867833093047 Page Rank. The factor will be remarked.
|
News
web_production: 11
This is the news (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushichiekomponenty/klassificacionnye?v=tkd#h45859-3 Patterns in URL URL)))).
|
Cat
web_production: 13
This is a catalog (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayafformula/tekushhiekomponenty/klassificacionnye? .
|
YaBar
web_production: 14
Weight: 0.027302374355601 Attendance from the bar - ((http://wiki.yandex-team.ru/andrejjkostjagin/yabarlog/hoststat data description)). The factor will be remarked.
|
Long
web_production: 15
Weight: -0.084798680877042 Long document (the longer the document, the greater the value of the factor).
|
LongQuery
web_production: 17
Weight: 0.030334786608805 The amount of IDF words of the request. The name does not reflect the essence: for example, for the request of 'Gadyach' this factor will be more than for the request of 'Moscow Peter Yekaterinburg Samara'.
|
PureText
web_production: 18
Long text without links.
|
Root
web_production: 19
This is a muzzle.
|
SubqueryThMatch
web_production: 23
Coincidence of thematic spectra of request and document. Request themes-the result of work ((http://wiki.yandex-team.ru/evgenijjkroxalev/subquery Rules of the sorcerer Subquerysearch)) The subject of the document is taken from Yandex-Catalog
|
FreshNewsDetectorPredict
web_production: 30
The value of the news detector calculated in the Hippo. Always 0 with a detector value less than the threshold.
|
LinkQuality
web_production: 35
Weight: -0.001564275785704 The quality of incoming links (the classifier of the bream) is broken, cm [405]
|
NumLinks
web_production: 37
The number of incoming links. Remembrance.
|
PopularQ
web_production: 38
The popularity of the request
|
RusLang
web_production: 40
The language of the document is Russian.
|
AddTime
web_production: 41
Weight: 0.006691168756865 The time of adding a page, more - a more old document; The root is placed from time displayed at the interval [0.1] so that 3+ years gives 1.
|
IsMainPage
web_production: 42
If the main page of the owner (most often a second -level domain, for example xxxx.ru), then the factor is 1. For bums, hosting, personal blogs, etc. (for example, Lifejornal, People.ru, etc.) - domains of the third level (such as xxxxx.narod.ru) will also have an equal factor 1.
|
AddTimeMP
web_production: 43
The time for adding the main page of the owner (host?) Will be remaped like Addtime.
|
QueryURLClicksPCTR
web_production: 45
How often they click in this URL for this request - CTR blasting for the correction factor
|
TextBM25
web_production: 46
Simple BM25 in text.
|
LinkBM25
web_production: 47
Simple BM25 for links, the weights of the braces are not taken into account.
|
TLBM25
web_production: 48
Weight: 0.031399776481102 Simple BM25 in text and links at the same time.
|
TLp1
web_production: 49
All the words of the request are in the text + links.
|
TxtPair
web_production: 53
Weight: -0.020921642736537 Simple BM25 in pairs of words - we take all pairs of words of the request and consider the number of their entry into the text of the document. In the quality of the weight of the pair we use the sum of the scales of words. It does not work if there is a stop-word in the request
|
LnkPair
web_production: 54
The same as txtpair, but for links; Link weights are not taken into account.
|
TxtBreak
web_production: 55
BM25 from the number of sentences in the document in which it occurs.
|
TxtHead
web_production: 56
Weight: -0.037878046829073 BM25 according to only in the heading.
|
TxtHiRel
web_production: 57
BM25 according to only with High Rel-bots ('significant', with the allocation (<b> ITP)).
|
WordCount
web_production: 59
Min (number of words of request/10, 1.f)
|
InvWordCount
web_production: 60
1 / quantity_lov_v_
|
HasNoQueryURLShows
web_production: 63
For this Urla, for this request, there is no information about clickness 1 - request or request -URLA in the click database, 0 - query URL in the clicks database
|
HasNoQueryShows
web_production: 64
Weight: 0.205699196177282 For this request, there is no information about the clickness of 1 - there is no request in the click database, 0 - the request is in the click database.
|
Hops
web_production: 65
The number of hops of Url inpans (such as less - closer to the muzzle, the lower the value (0 - the muzzle, 1 - from the muzzle cannot be reached, 0 <can get from the muzzle <1). Normal value for the root of the nosta 0.0039).
|
TxtPairEx
web_production: 67
Weight: -0.00667940021707 the presence of pairs of words in the exact form
|
TxtBreakEx
web_production: 68
Weight: 0.024006117828321 the number of sentences in which there are many words in the exact form
|
TxtHeadEx
web_production: 69
Weight: -0.03957553241619 the presence of words in the header in the exact form
|
TxtHiRelEx
web_production: 70
BM25 in the exact form
|
TxtBm25Ex
web_production: 71
Simple BM25 in the exact form.
|
TxtPairSy
web_production: 72
Weight: -0.022152880819573 the presence of pairs of words taking into account synonyms (> = txtpair)
|
TxtBreakSy
web_production: 73
Weight: -0.116819481337211 the number of sentences in which there are many words taking into account synonyms
|
TxtHeadSy
web_production: 74
Weight: -0.012919083353605 the presence of words in the header, taking into account synonyms
|
TxtHiRelSy
web_production: 75
Weight: -0.039215257302626 BM25 taking into account synonyms
|
TxtBm25Sy
web_production: 76
Simple BM25 taking into account synonyms.
|
QueryDOwnerClicksPCTR
web_production: 77
Weight: 0.219595036178226 How often they click in the URLs of this Domainid for this request - Ctr Domainid blasting for the correction factor
|
HasNoQueryDOwnerShows
web_production: 78
Weight: 0.160379344658431 For this Domainid, for this request there is no information about clickability 1 - request or request -owner is not in the clicks database, 0 - the request for clicks is in the database of clicks
|
OwnerClicksPCTR
web_production: 79
Weight: 0.231000481757815 The owner's clickness regardless of the request
|
Ukrainian
web_production: 95
It is equal to one if the site has a Ukrainian geoist (i.e. 1 - Ukrainian site)
|
IsBlog
web_production: 96
Page from the blogochosting
|
IsLivejournal
web_production: 97
Page with Livejournal.com
|
TextFeatures
web_production: 100
Weight: -0.016033504310566 The quality of the text. It is considered a rather complex formula
|
TextLike
web_production: 101
Weight: -0.094096848692163 Text quality (classifier Alekseeva)
|
YaBarCoreOwner
web_production: 104
The core of the audience of owners according to Yandex.Mrazusing
|
YaBarCoreHost
web_production: 105
The core of the audience of the hosts according to Yandex.Mrazusing
|
HasYaBarCore
web_production: 106
Does the host have a host
|
MusicQ
web_production: 108
The musicality of the request. The results of the sorcerer Anton Konygin.
|
DocLen
web_production: 110
Weight: -0.065128132003719 Document length in sentences
|
UrlLen
web_production: 111
Weight: -0.001158034315755 The length of the URL, divided by 5
|
QueryNonCommerciality
web_production: 112
The commercial request for the dictionary of phrases from Direct: 0 - maximum commercial, 1 - minimal.
|
HostSize
web_production: 113
Weight: -0.032004809610482 The size of the host named after Raskovalov in the documents without taking into account the takes (each double is taken into account in the factor by an independent document)
|
IsHTML
web_production: 114
Document type - HTML
|
GeoCityProxim
web_production: 127
Weight: 0.051465613603836 Means the coincidence of the region mentioned in the request and found sites at the level of areas. Binar factor: 1-rush, 0-no. It is based on ((http://wiki.yandex-team.ru/ Yandexposisk/ Classification of Sytraitniki/ Geographic/Sospolzanievpoysk Geoklassification of sites)))))))
|
PornoQuery
web_production: 130
Are there any words from Yweb/Pornofilter/Porno.query.
|
IsPorno
web_production: 131
Document from porn kitski
|
IsComm
web_production: 132
Weight: -0.066463228806236 A document from a commercial clay. Not used (depreded)
|
IsFake
web_production: 133
Fast document
|
IsSEO
web_production: 134
The page title contains commercial vocabulary. Not used (depreded)
|
IsWiki
web_production: 135
page from ru.wikipedia.org
|
IsEShop
web_production: 136
Commercial page (Classifier Savina)
|
GeoRegionProxim
web_production: 137
Weight: 0.082967074248567 |
HasNoAllWordsTRSy
web_production: 138
The document does not have all the words of the request (with an accuracy to a synonym)
|
NumWordsTRSy
web_production: 139
The percentage of the words of the request in the document (with an accuracy to a synonym)
|
HasAllWordsTRSy
web_production: 140
The document has all the words of the request (with an accuracy to a synonym)
|
NumWordsLR
web_production: 141
The percentage of the words of the request in the links (with an accuracy to a synonym)
|
HasAllWordsLR
web_production: 142
There are all the words of the request in the links (with an accuracy to a synonym)
|
PayDetectorPredict
web_production: 143
The value of the commerce detector calculated in the Hippo.
|
TxtInvPair
web_production: 144
Tr by pairs of words in the reverse order
|
LnkInvPair
web_production: 145
Lr by pairs of words of the request in the reverse order
|
TxtSkipPair
web_production: 146
Weight: -0.077504878926916 TR by pairs of words of the request through one word in texts
|
LnkSkipPair
web_production: 147
Lr by pairs of words of the request through one word in texts
|
NumWordsTRFm
web_production: 148
The percentage of all the words of the request in the text (with an accuracy to the form)
|
HasAllWordsTRFm
web_production: 149
The document has all the words of the request (with an accuracy to the form)
|
QDiversity
web_production: 150
Weight: 0.046783126435468 The degree of centralization of the points from which the request is set
|
QBlog
web_production: 151
Whether the request of blog vocabulary contains
|
NonCommercialQuery
web_production: 154
Binar non -profit request: Querynoncommerciality> 0.965.
|
TLen
web_production: 164
The length of the page text in the words tlen = map (number of words, 1/400), where map (x, y) = x*y / (1 + x*y)
|
IsUnreachable
web_production: 165
The page is unattainable by the links from the muzzle.
|
QueryURLClicksFRC
web_production: 168
the ratio of the number of clicks on this Urlu to all clicks on request
|
QueryDOwnerClicksFRC
web_production: 169
Weight: 0.214713693660762 the ratio of the number of clicks on this Domainid to all clicks on request
|
QueryURLClicksPCTR_copy
web_production: 170
[Bug: A copy of factor 45] How often they click in this URL for this request - CTR blasting for a correction factor
|
DoppQueryUrlSessionClicksFRCCity
web_production: 171
What part (on average by the session) from the user Urlov’s user, this URL user, who has been completed to it, is this URL. It is considered to be user sessions.
|
QueryURLClicksPCTR_Reg
web_production: 172
How often do they click in this URL for this request - CTR blasting for the correction factor, by small regions from Relev_regions.web.txt
|
QueryDOwnerClicksPCTR_Reg
web_production: 173
Weight: 0.047914113074106 How often they click in the URLs of this Domainid for this request - Ctr Domainid to the correction factor, by small regions from Relev_regions.web.txt
|
QueryURLClicksFRC_Reg
web_production: 174
Weight: 0.023610887210981 The ratio of the number of clicks on this Urlu to all clicks on request, by small regions from Relev_regions.web.txt
|
QueryDOwnerClicksFRC_Reg
web_production: 175
Weight: 0.118638180985299 The ratio of the number of clicks on this Domainid to all clicks on request, by small regions from Relev_regions.web.txt
|
QueryURLClicksCombo_Reg
web_production: 176
Query URL Clicks Combo, in small regions from Relev_regions.web.txt
|
QueryDOwnerClicksCombo_Reg
web_production: 177
Weight: 0.160420713540373 Query Download Clicks Combo, in small regions from Relev_regions.web.txt
|
TLp1All
web_production: 187
Weight: 0.055767877134775 Options for relevant factors taking into account the feet of words
|
TxtBM25AttenSyn
web_production: 191
Weight: 0.075434934641649 Tr with discount for suggestions
|
IsForum
web_production: 196
URL satisfies forum_detector regularly
|
IsObsolete
web_production: 198
The URL has an ancient date. Ancient news are recognized. Factor 1 if there is a year in Url <= 2007.
|
HasPayments
web_production: 201
The page has a about 'payment SMS'.
|
EshopValue
web_production: 203
Weight: -0.123814718900663 Stage of the page
|
PornoValue
web_production: 204
Pornography of the page
|
CountersSearchTraffic1
web_production: 209
Weight: 0.024263431712643 Search traffic - transitions from search engines to the site (2nd formula)
|
CountersSearchTraffic2
web_production: 210
Weight: -0.057014032623374 Search traffic - transitions from search engines to the site (2nd formula)
|
QueryUrlLCS
web_production: 213
The largest total tuning of Urla and request, normalized by the length of Urla
|
OnlyUrl
web_production: 214
All coincidences are only in the URL, there are no coincidences in the text
|
GeoRelevRegionCity
web_production: 215
|
GeoRelevRegionRegion
web_production: 216
|
GeoRelevRegionCountry
web_production: 217
Weight: 0.084012276385059 Three levels of coincidence of the geography of the user and page
|
XLRGeoRelevRegionCity
web_production: 218
|
XLRGeoRelevRegionRegion
web_production: 219
|
GeoCountryProxim
web_production: 221
Weight: 0.01317157982937 Geographical proximity
|
IsNavQuery
web_production: 222
Is the request for navigation, on the clicking of the answers
|
QueryDOwnerYabarVisits
web_production: 226
Weight: 0.147136648195774 |
QueryDOwnerYabarVisitors
web_production: 227
Weight: 0.119512833156651 |
QueryDOwnerYabarAvgTime
web_production: 228
Weight: 0.122090633457258 The average for users Active continuous time of the user is (in second) on the host pages after the transition on request from the search engine (the factor depends on the pair (request, Domattr)).
|
QueryDOwnerYabarAvgTime2
web_production: 229
The average for users Active continuous time of the user is (in second) on the host pages after the transition on request from the search engine (the factor depends on the pair (request, Domattr)). In the inside of the Yandex. Bara/elements/browser counter
|
QueryDOwnerYabarAvgActions
web_production: 230
The average for users is the number of active actions (clicks, clicks) with the continuous finding of the user on the host pages after the transition on request from the search engine (the factor depends on the pair (request, Domattr)). . In the inside of the Yandex. Bara/elements/browser counter
|
QueryUrlYabarVisits
web_production: 231
|
QueryUrlYabarVisitors
web_production: 232
The number of unique visitors from search engines for a specific request
|
QueryUrlYabarAvgTime
web_production: 233
The average for users Active continuous time of the user (in second) on the page after the transition on request from the search engine (the factor depends on the pair (request, URL)).
|
QueryUrlYabarAvgTime2
web_production: 234
The average for users Active continuous time of the user (in second) on the page after the transition on request from the search engine (the factor depends on the pair (request, URL)). In the inside of the Yandex. Bara/elements/browser counter
|
QueryUrlYabarAvgActions
web_production: 235
The average for users is the number of active actions (clicks, keystrokes) on the page after the transition on request from the search engine (the factor depends on the pair (request, URL))
|
IsForeignQuery
web_production: 241
Request is not in Russian
|
PageRegionSizeIn
web_production: 243
Weight: 0.056552232052119 The size of the page of the page
|
PageRegionInvSizeIn
web_production: 244
Weight: -0.006950709230428 The factor is inversely proportional to the size of the page region
|
QueryRegionSize
web_production: 245
The size of the region of the request
|
QueryRegionInvSize
web_production: 246
The factor is inversely proportional to the size of the regional region
|
GeoGeometryProxim
web_production: 247
Weight: -0.000843495929565 The geographical proximity of the user and the site
|
YabarHostVisitors
web_production: 249
Weight: 0.085929172196314 The number of unique visitors, remarks exponentially
|
YabarHostSearchTraffic
web_production: 250
Weight: 0.00667848123376 The share of traffic from search engines
|
YabarHostInternalTraffic
web_production: 251
Weight: 0.071417326810502 The share of suits to the site is not by links (set with hands or from bookmarks)
|
YabarHostAvgTime
web_production: 252
Weight: -0.007634608393132 average for users Active continuous time for user finding (in sec) on the host pages
|
YabarHostAvgTime2
web_production: 253
Weight: 0.074172193125966 The average for users Active continuous time of the user (in second) on the pages of the host. In the inside of the Yandex. Bara/elements/browser counter
|
YabarHostAvgActions
web_production: 254
Weight: 0.127979729953137 The average for users is the number of active actions (clicks, clicks) with the continuous finding of the user (in second) on the pages of the host.
|
YabarHostBrowseRank
web_production: 255
Implementation of the algorithm described in the article ((http://wiki.yandex-team.ru//h.yandex.net/?http%3A%2F%2FreseRosoft.microsoft.com%2fen-US%2FPEOPLIULIUUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUUP032-LIUUUU .pdf http://research.microsoft.com/en-us/people/tyliu/fp032-liu.pdf))
|
YabarUrlVisits
web_production: 256
Weight: 0.067151098341326 Varla's attendance according to I-Bara
|
YabarUrlVisitors
web_production: 257
Weight: 0.051057813309267 The number of unique visitors to Urla
|
YabarUrlAvgTime
web_production: 258
Weight: 0.003890338237824 The average for users time is the user on the page. It is read as the difference between neighboring transitions.
|
OwnerSatisfied4Rate
web_production: 259
Weight: 0.102548297661617 This is the SEA factor = s4_r/ (k_r+10) where S4_R is the number of clicks> 180 sec, k_r - the total number of clicks. It is considered taking into account reformalization.
|
OwnerSatisfied4Rate_Reg
web_production: 260
This is the SEA factor = s4_r/ (k_r+10) where S4_R is the number of clicks> 180 sec, k_r - the total number of clicks. It is considered taking into account reformalization. Localized version
|
UrlQueryVariety
web_production: 261
The degree of variety of requests for which this Urla click
|
DocIdfSum_broken
web_production: 263
IDF for various parts of the document, broken, are not used
|
TitleIdfSum_broken
web_production: 264
Weight: 0.070074395872424 IDF for various parts of the document, broken, are not used
|
HeadingIdfSum_broken
web_production: 265
Weight: 0.061031422056552 IDF for various parts of the document, broken, are not used
|
NormalTextIdfSum_broken
web_production: 266
IDF for various parts of the document, broken, are not used
|
AuxTextBM25
web_production: 268
BM25 for the user region for localized queries, for the unflapped in Cuba, is a country. The texts of the queries sent for the regions can be viewed in Relev_regions.txt in the sorcerer
|
AuxLinkBM25
web_production: 269
The same for lingonic relevance
|
CommLinksSEOHosts
web_production: 270
Weight: -0.180963639077109 The share of incoming corrupt links. The algorithm for recognition of commercial links is implemented. The factor will be remarked to [0.1] if the share of such links is 50%, otherwise 0. ((http://wiki.yandex-team.ru/svetlanashorina/topseolinks selection of wound sites))))))
|
CommLinksSEOHostsPornoQuery
web_production: 271
Previous factor multiplied by Pornoquery
|
CommLinksSEOHostsNonComm
web_production: 272
Weight: 0.0033634994869 ComMlinksseohosts factor multiplied by Noncommercialquery
|
TovarCategoryQuery
web_production: 273
The request mentions the product category. Not used (depreded)
|
TovarCategoryVendor
web_production: 274
The request mentions a vendor. Not used (depreded)
|
Diversity2
web_production: 275
Weight: 0.001181036676865 Geographical distribution of the request
|
NightQuery
web_production: 276
The request is set mainly at night
|
MorningQuery
web_production: 277
Weight: -0.013510450334814 The request is set mainly in the morning
|
DayQuery
web_production: 278
The request is given mainly in the afternoon
|
EveningQuery
web_production: 279
The request is set mainly in the evening
|
HourDiversity
web_production: 280
The severity of the querial tasks at different times of the day
|
SubqueryThMatchA
web_production: 282
Weight: 0.178646516342524 Coincidence of thematic spectra of request and document. Request themes - the result of work ((http://wiki.yandex-team.ru/evgenijjkroxalev/subquery Rules of the sorcerer Subquerysearch)) The subject of the document is determined by the automatic classifier
|
OwnerSDiffClickEntropy
web_production: 286
Weight: -0.017928063556114 Entropy - distribution of clicks
|
OwnerSDiffShowEntropy
web_production: 287
Weight: 0.032525279432611 Entropy - distribution of shows
|
OwnerSDiffCSRatioEntropy
web_production: 288
Weight: -0.01129676986565 Entropy - Distribution of clique/shows.
|
XPornoQuery
web_production: 291
Classifier of Porn Causions, another dictionary than Pornoquery
|
GeoCountryCountryProxim
web_production: 293
The geographical proximity of the country of the site and the country of request
|
UrlDomainFraction
web_production: 294
Weight: 0.564095297143887 Coating domain three -bouqu and request. (Chelyabinsk lottery - Chelloto. We translate a request to translite, find the three -book that are covered (Che, Hel, Lot, Olo), we look at what share of all three -bouquets are covered)
|
UrlPathAndParamsFraction
web_production: 295
Weight: -0.162220616846705 The same as the previous factor, but about the entire Url except the domain
|
SpecificalQuery
web_production: 296
The request is local-specific. The request is often reformulated with the obvious task of the region. ((https://ml.yandex-team.ru/archive/thread1433892/#Message1433892 more))
|
LnkBreak
web_production: 302
Weight: 0.078872214489662 Analogs of the corresponding text factors for links. BM25 from the number of links in which a coincidence occurred.
|
LnkBm25Ex
web_production: 303
Simple BM25 in the exact form in link texts
|
LnkPairSy
web_production: 304
Weight: 0.046891090311905 The presence of pairs in the links of the words, taking into account synonyms
|
LnkBrkSy
web_production: 305
Weight: 0.035447186193336 The number of links passed the threshold
|
LnkBm25Sy
web_production: 306
Simple BM25 by links taking into account synonyms
|
VideoQuery
web_production: 307
Request about the video
|
OwnerClicksPCTR_Reg
web_production: 308
Weight: 0.166327421401765 The owner's clickness regardless of the request, separately in the regions
|
OwnerSDiffClickEntropy_Reg
web_production: 309
Weight: -0.160285061981584 Entropy is the distribution of clicks. Regionalized
|
OwnerSDiffShowEntropy_Reg
web_production: 310
Weight: 0.004768007631846 Entropy is the distribution of shows. Regionalized
|
OwnerSDiffCSRatioEntropy_Reg
web_production: 311
Weight: -0.023916010788926 Entropy - distribution of clique/shows. Regionalized
|
Adultness
web_production: 312
equals 2 * NastyContent
|
HostAdultness
web_production: 313
equals 2 * NastyContent
|
IsCom
web_production: 315
Weight: 0.276250497243267 Domna in Zone .com
|
IsUa
web_production: 316
Domain in the .ua zone
|
IsNotRu
web_production: 317
Weight: 0.081289466115302 Domain is not in the .ru zone
|
Poetry
web_production: 319
The poetry of the document
|
PoetryQuad
web_production: 320
The maximum poetry of the quatrain
|
EngLang
web_production: 321
Document language - English
|
CyrLang
web_production: 327
The language of the document is Cyrillic
|
GeoRegionalityU
web_production: 328
Requestful factors - the result of work ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/georegionality classifier of geolocalization of the request)))))))))))))
|
GeoRegionalityR
web_production: 329
R- Georelevan - regional results in the issuance could be useful, but nothing more
|
GeoRegionalityV
web_production: 330
V- geovital - regional issuance is of fundamental importance
|
UrlHasNoDigits
web_production: 331
There are no numbers in Urla
|
SynS1
web_production: 334
Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
SynFLremap1
web_production: 335
Weight: 0.002431406823392 Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
SynFLremap2
web_production: 336
Weight: 0.08033186404617 Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
OwnerSessNormDuration
web_production: 337
Weight: 0.126700168643196 ND/K normalized time to click
|
UrlSessNormDurRate
web_production: 338
Weight: 0.025806639721603 nd/i
|
QueryDOwnerSessNormDuration
web_production: 339
CONTRY / K
|
QueryDOwnerWeightClick
web_production: 340
Weight: 0.202186193546053 w/k
|
QueryDOwnerOnlyClickRate
web_production: 341
Weight: 0.185032224423923 o/i
|
QueryDOwnerClickSummary
web_production: 342
Weight: 0.077454131996933 Selected formula
|
QueryDOwnerSatisfied4Rate
web_production: 343
Weight: 0.148292222594522 r_s4b/(r_k + 10)
|
SyntQuality
web_production: 344
Weight: 0.010872234578071 Does the request have a complete syntactic analysis
|
PageDate
web_production: 345
Weight: -0.034716206980983 The date of the document that is registered on the page is remarkable
|
HasTextPos
web_production: 350
The document has textual relevance
|
SynPercentBadWordPairs
web_production: 353
An indicator of the unnaturalness of the text from the point of view of the Russian language. The number of bad pairs of words in the text, transferred to the segment [0.1] according to the Z/(Z+10) formula
|
SynNumBadWordPairs
web_production: 354
The proportion of bad steam among all found in the table: Z/(X+1), where Z is the number of bad couples in the text, and X is (http://wiki.yandex-team.ru/evgenijgrechnikov/testsynonimizers of 2000-navigable )) steam
|
NumLatinLetters
web_production: 355
Weight: -0.086731079136512 The number of Latin letters in the text (not counting the markings), driven into [0.1] formula n/(n+100)
|
DocIdfSumFixed
web_production: 357
Previous factors - fixed
|
TitleIdfSumFixed
web_production: 358
Weight: 0.047164043400143 Previous factors - fixed
|
HeadingIdfSumFixed
web_production: 359
Weight: -0.068235863277027 Previous factors - fixed
|
NormalTextIdfSumFixed
web_production: 360
Previous factors - fixed
|
QueryURLClicksCombo
web_production: 361
factor cunningly combined from FRC and Pseudo-CTR
|
QueryDOwnerClicksCombo
web_production: 362
Weight: 0.369078039338024 factor cunningly combined from FRC and Pseudo-CTR
|
RusWordsInText
web_production: 364
The number of words in the text (the word is what the lemmeter selected) is displayed in [0.1] according to the formula x/(x+a)
|
RusWordsInTitle
web_production: 365
Weight: 0.03118624384934 The number of words of the Russian language in the title
|
MeanWordLength
web_production: 366
Weight: 0.019580616053835 The average length of the word
|
PercentWordsInLinks
web_production: 367
Weight: 0.057053549836014 The percentage of the number of words inside the tag <a> .. </a> from the number of all words
|
PercentVisibleContent
web_production: 368
Weight: -0.032828345615772 The percentage of the number of words outside the tags (outside the brackets <>) from the number of all words
|
PercentFreqWords
web_production: 369
Weight: -0.020210221137273 The percentage of the number of words, which are 200 the most frequent words of the language, from the number of all words of the text
|
PercentUsedFreqWords
web_production: 370
Weight: -0.063976585802142 The number used in the text 500 of the most popular words of the language, divided by 500
|
TrigramsProb
web_production: 371
Weight: -0.002170850269151 Logarithm of average geometric probabilities of trigrams in the text. (the probability of a trigram - the number of its meetings in the text, divided by the number of all trigrams) is displayed in [0.1] according to the formula -x (x+a)
|
TrigramsCondProb
web_production: 372
Weight: 0.026650508120317 Logarithm of the average geometric conditional probabilities of trigrams. The conditional probability of a trigram is its probability, divided by the probability of a bigram from the first two words
|
DoppDOwnerPCTR
web_production: 373
The analogue of the QueryDownerClickSpCTR factor differs from it in that the requests are normalized by doppelgage (details of such normalization -((http://staff.yandex-team.ru/finder Andrei Plakhov)), code/yandex/doppelganges)
|
DoppDOwnerPCTR_Reg
web_production: 374
The analogue of the QueryDownerClickspCTR factor differs from it in that the requests are normalized according to doppelgage (details of such normalization -((http://staff.yandex-team.ru/finder Andrei Plakhov)), code/yandex/Doppelganges). Localized to Relev_regions.web.txt
|
DoppUrlPCTR
web_production: 375
The analogue of the QueryurlClickSpCTR factor differs from it in that the requests are normalized by doppelgagers (details of such normalization - ((http://staff.yandex-team.ru/finder Andrei Plakhov)), code - Yandex/Doppelganges)
|
DoppUrlPCTR_Reg
web_production: 376
The analogue of the QueryurlClickSpCTR factor differs from it in that the requests are normalized by doppelgage (details of such normalization - ((http://staff.yandex-team.ru/finder Andrei Plakhov)), code - Yandex/Doppelganges). Localized to Relev_regions.web.txt
|
UrlBM25
web_production: 377
Weight: 0.066890922161289 BM25 on URL'U
|
HasBigPicture
web_production: 378
The page has a big picture
|
DaterAge
web_production: 380
Weight: -0.207437366708906 The difference between the current date and the date of the document defined by the dates, 1 - the date of the document is equal to the current, 0 - the document of 10 years or more, if the date is not defined, equal to 0. Attention! ((1 - dateraage)*60)^2 = age of the page In days.
|
NumNonRussianLinks
web_production: 384
The number of incoming links without Russian letters. Remembrance.
|
TextMaxForms
web_production: 385
Weight: -0.015212586791057 The maximum number of forms in all words of the request is max in all words of the request request_form_dl_lov/64
|
TextWeightedForms
web_production: 386
Weight: 0.022803839020796 The sum of the number of forms balanced by the scales of words - the amount in all words of the request of the number_form_dly_lov/64*weight_lov; REMAP species x/(1 + x).
|
TextForms
web_production: 387
Weight: -0.008656938143421 The unwarmed amount of the number of forms is the amount in all words of the request of the number_form_dl_lov/64/number_lov_
|
LinkMaxForms
web_production: 388
The maximum number of forms in all words of the request
|
LinkWeightedForms
web_production: 389
Weight: 0.096811143316269 Summer of the number of forms balanced by scales
|
LinkForms
web_production: 390
Undested amount of the number of forms
|
TextBM25_Fm_W1
web_production: 393
Analogues of the factors of the same name, the weight of the word = 1
|
TextBM25_Sy_W1
web_production: 394
Analogues of the factors of the same name, the weight of the word = 1
|
LinkBM25_W1
web_production: 395
Analogues of the factors of the same name, the weight of the word = 1
|
TLBM25_W1
web_production: 396
Analogues of the factors of the same name, the weight of the word = 1
|
NumeralsPortion
web_production: 399
The share of different parts of speech in the text. The share of numerals (among all words that managed to recognize part of the speech)
|
ParticlesPortion
web_production: 400
Weight: -0.012429221647235 The share of particles
|
AdjPronounsPortion
web_production: 401
Weight: -0.005976754416269 The share of pronoun adjectives
|
AdvPronounsPortion
web_production: 402
Weight: -0.001250755074786 The proportion of pronoun nouns
|
VerbsPortion
web_production: 403
The share of verbs
|
FemAndMasNounsPortion
web_production: 404
Weight: 0.011650367441796 The share of words that can be both masculine nouns and nouns of the feminine, but not of the middle kind, among all nouns (examples: 'hummingbirds' are an example of an indefinite kind that can be determined in two ways, 'Alexander' is homonymy).
|
LinkQualityFixed
web_production: 405
Weight: 0.013112575551553 Quality of incoming links (hauser classifier) corrected
|
HasLinkQualityFixed
web_production: 406
Considered LinkQuality for this page or not (did not think, if there are few links) corrected
|
NewLinkQualityFixed
web_production: 407
Weight: 0.021178675054476 Quality classifier of incoming links 2 corrected
|
IsOrg
web_production: 408
Weight: -0.018278527670779 The request is the name of the organization (example: Gazprom, Gazprom) ((http://wiki.yandex-team.ru/arsengadzhikurbanov/warees Description))
|
LongestText
web_production: 410
Weight: 0.069696682544392 The size of the largest text segment (from the factor [18] puretext)
|
SmartUkrainian
web_production: 411
|
SmartBelorussian
web_production: 412
|
DifferentInternalLinks
web_production: 414
Weight: 0.096447224363928 The number of different internal links to the page
|
HasDeterminedCities
web_production: 415
Weight: 0.165031403865939 The city is defined for the site
|
GeoRegionalityUNew
web_production: 416
Requestful factors - the result of the work ((http://wiki.yandex-team.ru/poiskovajaplatforma/lingvistika/zaprosnyjefefactory/localizovannyjezaprosya classifier of the request of the request)) - a new version of factors [328] - [328] - [328]: u - u - u - u - u - u - uceleless sites the request is meaningless;
|
GeoRegionalityRNew
web_production: 417
Запросные факторы - результат работы ((http://wiki.yandex-team.ru/PoiskovajaPlatforma/Lingvistika/ZaprosnyjeFactory/LocalizovannyjeZaprosy классификатора геолокализованности запроса)) - новая версия факторов [328]-[330]: R - георелевантные - региональные результаты в issuing could be useful, but nothing more;
|
GeoRegionalityVNew
web_production: 418
Requestful factors - the result of work ((http://wiki.yandex-team.ru/poiskovajaplatforma/lingvistika/zaprosnyjefefactory/localizovannyjezaprosya classifier of the request of the request)) - a new version of factors [328]: Vegetable fundamental importance.
|
UkrainPageRank
web_production: 420
Weight: 0.087122791007993 Ukrainian Page Rank
|
QClassDownload
web_production: 421
= 1 - v. Download formula. Class requests: download/watch online/play/photo/listen
|
QClassBrandnames
web_production: 422
The result of the classifier of the request - in the request there are words from the corresponding dictionary. brand
|
QClassDisease
web_production: 423
Medication Dictionary
|
QClassKak
web_production: 424
question
|
QClassMoscow
web_production: 425
Specific request for Moscow
|
QClassOAO
web_production: 426
Weight: -0.005085205304656 organization
|
QClassPorno
web_production: 427
porn
|
QClassTravel
web_production: 428
trips
|
PeriodicLinkDatesPercent
web_production: 430
Weight: 0.013900531929943 The frequency of links to the site
|
LinkAlmostPeriod
web_production: 431
The number of almost-periodic links
|
QDOwnerStatPower
web_production: 432
Weight: -0.025355498987515 The number of Owner shows on request, normalization x/(100 + x).
|
QUrlStatPower
web_production: 433
Weight: -0.194376876842978 The number of URL shows on request, normalization x/(100 + x).
|
HasLiRuCounter
web_production: 434
The presence of a LiveInternet meter
|
OwnerReqsPopularity
web_production: 435
Weight: 0.209508533629415 The popularity of Owner is in requests
|
PiracyDetectorPredict
web_production: 440
The value of the pirate detector calculated in the hippo.
|
FirstValidTs10Days
web_production: 442
It is considered as (10-x) where X is the return of the document in days (continuously) regarding the validity time of the document in Samovar
|
HostInQuery
web_production: 443
The host of the document is recognized in the request
|
VitalHostInQuery
web_production: 444
URL consists only of the host, which is recognized in the request
|
YandexNewsStoryUrl
web_production: 445
URL is the plot of Yandex News
|
RcSpylogUrlRationalSigmoidD1T240
web_production: 446
URL feature computed from rapid clicks spy_log counters with decay of 1 day
|
RcSpylogUrlRationalSigmoidD1T240Frozen
web_production: 447
URL feature computed from rapid clicks spy_log counters with decay of 1 day
|
RcSpylogUrlRationalSigmoidD0_5T30
web_production: 448
URL feature computed from rapid clicks spy_log counters with decay of 0.5 days
|
RcSpylogUrlRationalSigmoidD0_5T30Frozen
web_production: 449
URL feature computed from rapid clicks spy_log counters with decay of 0.5 day
|
TxtPair_W1
web_production: 454
Weight: -0.016932610010322 Simple BM25 in pairs of words - we take all pairs of words of the request and consider the number of their entry into the text of the document. Weight = 1. It does not work if there is a stop-word in the request
|
AuraDocLogShared
web_production: 455
Weight: -0.097686304848915 Logarithm of the number of shingles on which this document is not unique
|
AuraDocLogAuthor
web_production: 456
Weight: -0.097277529611975 Logarithm of the number of shingles on which this owner of the document is recognized as the author
|
AuraDocMeanSharedWeight
web_production: 457
Weight: -0.110593487056685 The average weight of non-ugly shingles of this document
|
RegHostRank
web_production: 467
Weight: 0.156712439907419 It reads in the same way as the Hostrank factor, but not on all the Owner graph, but on its subrack, consisting of Owner's in this region. Belonging to the region is determined by TLD, or by the presence of pages with this Owner in the index, about which the GEO or Geoa classifier says that they are from this region. Mapped in the same way as the Hostrank factor, from 0 to 1 with 256 gradations
|
RegIsWiki
web_production: 468
A document from the language section of Wikipedia corresponding to the user region
|
CountryPopularQ
web_production: 470
The popularity of the request within the country
|
CountryQDiversity
web_production: 471
Weight: 0.03718037385465 The degree of centralization of the points from which the request is set (inside the country)
|
CountryQDiversity2
web_production: 472
Weight: -0.00120970063307 Geographical distribution of the request within the country
|
CountryHour
web_production: 473
The hour at which this request is given the most
|
CountryHourDiversity
web_production: 474
The degree of severity of the querial tasks at different times of the day (inside the country)
|
NationalDomain
web_production: 476
The country of the document (domain) and the country of the user coincide ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaFormula/tekushhiekomponijafaktorov#national
|
IsPornoAdvert
web_production: 477
On the Porn Advertising page
|
RcSpylogUrlRationalSigmoidD3T120
web_production: 478
URL feature computed from rapid clicks spy_log counters with decay of 3 days
|
CountryQueryRegionality
web_production: 479
Weight: 0.012081787040108 Country classifier of localization - how much the request implies the context of the country
|
NumSlashes
web_production: 480
Weight: 0.050576094170344 The number of slashes in Url
|
BM25FdPR_obsolete
web_production: 481
Weight: 0.054156294329288 BM25 with different parameters for different fields, including an incoming anchortekst. The weight of the text of the links included on the page is normalized depending on Delta Page Rank links
|
WatchVideo
web_production: 482
The presence of a built -in video player on the page
|
DownloadVideo
web_production: 483
Video for downloading
|
RcSpylogUrlRationalSigmoidD3T120Frozen
web_production: 484
URL feature computed from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogUrlRationalSigmoidD14T300
web_production: 485
URL feature computed from rapid clicks spy_log counters with decay of 14 days
|
GskUrlModel
web_production: 487
Weight: 0.013412340418363 The factor is calculated from the text of Url using the classifier of sequences Quality/Seq/GSK
|
UrlTrigrams
web_production: 488
Weight: 0.064310714968383 Model with the training of each trigram on '+' and '-' Urlah. It does not depend on the request.
|
RcSpylogUrlRationalSigmoidD14T300Frozen
web_production: 489
URL feature computed from rapid clicks spy_log counters with decay of 14 days
|
RcSpylogAge
web_production: 490
Age of rapid clicks spy_log update, in seconds
|
RcSpylogFreshness
web_production: 491
Freshness of rapid clicks spy_log update
|
QueryCommercialityMx
web_production: 494
Weight: 0.103903118421863 The measure of 'commercial' request. It is a comprehensively calculated Matrixnet factor formula for the procurement vocabulary in direct + for user queries + add. Intensive dictionaries. Requests with intensity to buy a factor seeks to -> 1 commodity requests -> 0.6 with intensity cannot buy, reviews, etc. -> 0 ((http://wiki.yandex-team.ru/Faktorydljanovogokatorazaprosov Factors of the Classifier))) (HTTP : //wiki.yandex-team.ru/jandekspoisk/antispam/antiseo/klassifikATORCHESSKIXZAPROSOV STUNITURE OF HIM))
|
FieldLM
web_production: 495
Weight: 1.36522746e-7 Unigramal language model. Language is modeling according to the document, smoothed out by the general linguistic model. When building a model, the document uses information on which field of the document met the word request (Title, Head or Plain Text)
|
GeoCityUrlRegionCity
web_production: 496
The coincidence of geography, determined from the Url of the document and the city of the request (IP or LR)
|
GeoCityUrlRegionRegion
web_production: 497
The coincidence of geography, determined from the Url of the Document and the Request region (IP or LR)
|
GeoCityUrlRegionCountry
web_production: 498
Weight: -0.168645758020604 The coincidence of geography, determined from the Url of the document and the country of request (IP or LR). Actual for Russia and Ukraine.
|
GeoCityUrlGeoCityCity
web_production: 499
The coincidence of geography, determined from Ural Documents and the City in the request (GEOCITY rule)
|
PayAppDetectorPredict
web_production: 500
The value of the chopped commerce detector, calculated in the hippo.
|
OwnerNavQuota
web_production: 506
Weight: 0.189743110446303 The share of clicks for navigation requests
|
GeoRelevAlienCity
web_production: 507
Weight: 0.084699401575226 The result has a geography of the user at the city level ([415] == 1 && [215] == 0)
|
GeoVQueryInUserCity
web_production: 508
Request geovitality for results from the user region
|
GeoVQueryInAlienCity
web_production: 509
Request geovitality for the results is not from the user region
|
HostReliability
web_production: 510
Weight: -0.045942748393758 The share of the Urlov that respond without errors
|
DmozThemeMatchAll
web_production: 511
Coincidence of the thematic spectrum (according to DMOZ) request and document. The theme of the request is determined ((http://wiki.yandex-team.ru/jandekspoisk/zarubezhnyjjinternet/dmozqueryClassifier1 The rule of the sorcerer Dmoztheme))
|
DmozThemeMatchBest
web_production: 512
Coincidence of the thematic spectrum (according to DMOZ) request and document. The theme of the request is determined by the best result ((http://wiki.yandex-team.ru/jandekspoisk/zarubezhnyjjinternet/dmozqueryClassifier1 Rules for the sorcerer DmozTheme)) The subject of the document is determined by the automatic classifier
|
PageRegionCoverage
web_production: 516
Weight: -0.063761467432684 |
PageRegionSize
web_production: 517
Weight: -0.030877746812643 The size of the page of the page
|
PageRegionRelCoverage
web_production: 518
Weight: -0.000832706989751 |
RcSpylogFreshnessAtReq
web_production: 519
Freshness of rapid clicks spy_log update, calculated at the request time
|
IsGeo
web_production: 520
Weight: -0.027287688639737 It launches on the basic search under the name ISGEO the maximum weight of the meters of the gelator in the request. A geo-object is understood as an object of the category GEO, Geo1, Geoaddr, Geoaddr1, Landmark, Landmark1 (see ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects kaovsky allocation))))))))))))))))))))))))))))))). wiki.yandex-team.ru/arsengadzhikurbanov/wares Read more))
|
IsMusic
web_production: 521
It launches for the basic search under the name ISMUSIC the maximum weight of the Music or Music1 category of the category of the Category in the request. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/warees more)))))))))))))))))))))
|
BclmLite
web_production: 522
Modification of the BCLM2 factor, lightweight for use in tulle. The main difference is that BCLMLite does not use absolute displacements of words relative to the beginning of the document. Instead, the factor works with the usual positions of the type <number of the_prising, position_v_production>. At the same time, the proximity between the words is taken into account only inside the sentence. (Http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaFormula/tekushichiekomponenty/bclmlite bclmlite)))))))))))))
|
NearbyQuery
web_production: 523
When responding to a request, the results are important in close proximity ([pharmacies], [children's clinic])
|
CityQuery
web_production: 524
Weight: -0.091993052812036 When answering a request, the results within the city are important (the bulk of localized queries)
|
AdmQuery
web_production: 525
When responding to a request, the results from the region of the user ([airport], [dairy]) are important
|
NumLinksFromMP
web_production: 526
The number of incoming muzzle links
|
Soft404
web_production: 531
Page - '404' (share of tokens '404' in relation to the total number of tokens on the page)
|
RcSpylogUrlRationalSigmoidD1T240AtReq
web_production: 532
URL feature computed at the request time from rapid clicks spy_log counters with decay of 1 day
|
OwnerSessNormDuration_Reg
web_production: 535
ND/K normalized time to click
|
RcSpylogUrlRationalSigmoidD0_5T30AtReq
web_production: 536
URL feature computed at the request time from rapid clicks spy_log counters with decay of 0.5 days
|
QueryDOwnerSessNormDuration_Reg
web_production: 537
CONTRY / K
|
QueryDOwnerWeightClick_Reg
web_production: 538
Weight: 0.115262514353577 w/k
|
QueryDOwnerOnlyClickRate_Reg
web_production: 539
Weight: 0.179216994410993 o/i
|
QueryDOwnerClickSummary_Reg
web_production: 540
Weight: 0.054680076158058 Selected formula
|
QueryDOwnerSatisfied4Rate_Reg
web_production: 541
Weight: 0.07148176099275 r_s4b/(r_k + 10)
|
SegmentAuxAlphasInText
web_production: 542
Weight: 0.010581678208134 Number of letters in the AUX segment
|
SegmentAuxSpacesInText
web_production: 543
Weight: -0.011681967583253 The number of spaces in the AUX segment
|
SegmentContentCommasInText
web_production: 544
The number of commas in the Content segment
|
IsShop
web_production: 545
Weight: -0.133931985443449 Page is a store. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/opisanijafaktorov#SSHOP Description)). Not used (depreded)
|
UrlNGramsModel
web_production: 552
Weight: 0.055185094441888 Urlngramsmodel ranking factor in ERF
|
NationalLanguage
web_production: 553
The language of the document corresponds to the country's request
|
OwnerIsCommercial
web_production: 554
|
GeoCountryUrlRegionCountry
web_production: 555
|
GeoCountryUrlGeoCountry
web_production: 556
|
UrlQueryVariety_Reg
web_production: 559
Weight: -0.020628033510418 The degree of variety of requests for which this Urla click is read by regions
|
UrlSessNormDurRate_Reg
web_production: 560
Weight: 0.025328925792111 nd/i
|
LanguageGoodForTurkey
web_production: 562
The language of the document is one of the permissible for Turkey (Turkish, English, German, French, Arabic, Azerbaijani) or the document has zero length. In the search stage is calculated only for Isrealgeolocal requests.
|
GeoDispersion
web_production: 564
Document links dispersion
|
QueryDownerEnoughClicked
web_production: 565
Weight: -0.118870879105496 The number of clicks on the owner and the number of clicks on request more than 5
|
BM25FdPRFixed
web_production: 566
Weight: 0.058870258158539 BM25FDPR with standardization on the average length of the document, depending on the language of the document. ((http://wiki.yandex-team.ru/bm25frework test results.))
|
LanguagePopularity
web_production: 567
The popularity of the language of the document. Number from 0 to 1. (http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/languaguaguagepopalarity)))))))
|
QueryDOwnerWeightedSumFRCAndBM25FdPRFixed
web_production: 568
Weight: 0.087850313290757 The amount of factors QueryDownerClicksFRC and BM25FDPRFIXED with scales 0.358449 and 0.184922, respectively. '565' in the name of the factor does not need to be perceived literally, it is Legashi or a typo.
|
RcSpylogUrlRationalSigmoidD3T120AtReq
web_production: 570
URL feature computed at the request time from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogUrlRationalSigmoidD14T300AtReq
web_production: 571
URL feature computed at the request time from rapid clicks spy_log counters with decay of 14 days
|
LangDispersion
web_production: 574
Dispersion of languages in XMAP
|
HasMisspell
web_production: 575
There is a typo in the request
|
UrlLinkPercent
web_production: 578
Weight: 0.089404211238337 The ratio of the number of incoming links, the text of which is the URL, is one of the incoming links
|
DssmBertDistillL2
web_production: 579
A pool of logs is marked with BERT trained on Sinsig. DSSM model is trained on this pool using BaseregionChain
|
NumNonLettersInUrl
web_production: 580
Weight: -0.011207582653854 The number of 'Nebukv 'in Url
|
UrlLen2
web_production: 581
Weight: 0.007908808762912 The length of the URL with an accuracy to the symbol. Disconnected in production.
|
IsHub
web_production: 582
Weight: 0.097073501164592 Habi page
|
StaticTitleBM25Ex
web_production: 584
Weight: 0.016179974819787 BM25 page title by its text
|
StaticTitleLRBM25
web_production: 585
Weight: 0.038263040612831 BM25 page title by texts of links to it
|
SeoInPayLinks
web_production: 586
Weight: -0.028595315195293 The number of COO-Thrilling links between hosts
|
USLongPeriodUrlMobileDt180Avg
web_production: 587
Static URL factor for search sessions for 1600 days calculated on mobile sessions. Average dwelltime, and DwellTime is cut from the session if more than 180 seconds
|
USLongPeriodUrlMobileLongClickProb
web_production: 588
Static URL factor for search sessions for 1600 days calculated on mobile sessions. The probability that the URL click will be more than 120 seconds
|
USLongPeriodUrlMobileLossesProb
web_production: 589
Static URL factor for search sessions for 1600 days calculated on mobile sessions. The probability that URL is not clicks if they click at least one URL below.
|
USLongPeriodUrlMobileDt3600AvgReg
web_production: 590
Static URL factor for search sessions for 1600 days calculated on mobile sessions. Average dwelltime, and DwellTime is cut from the session if more than 3600 seconds. Localization to the level of countries.
|
USLongPeriodUrlMobileDt180AvgReg
web_production: 591
Static URL factor for search sessions for 1600 days calculated on mobile sessions. Average dwelltime, and DwellTime is cut from the session if more than 180 seconds. Localization to the level of countries.
|
HpDetectorPredict
web_production: 592
The value of the health detector calculated in the Hippo.
|
TitleInLinksTrigrams
web_production: 597
Weight: -0.076334972364641 The share of unique trigrams in the trigrams of links
|
LinksInTitleTrigrams
web_production: 598
Weight: 0.019301158836494 Share of unique trigrams of links in trigrams header
|
TrashAdv
web_production: 599
The greasy of the page
|
MetrikaUrlVisits
web_production: 600
Similar to Yabarurlvisits
|
RegNavQuery
web_production: 603
Regional and navigation request - in the user region there are one or more navigation results on it
|
YabarUrlLcAc
web_production: 604
Weight: -0.046030869083841 The number of sessions in which Url was the last, classified as the sessions in which Url appeared
|
SOMaxSumSourceRank
web_production: 605
Weight: 0.061675217167197 The sum of the maximum values of Sourcerank's for each incoming link, taking into account the uniqueness of the owner.
|
TRLRQuorumFm
web_production: 607
Weight: -0.062810308974889 The weight of the words of the request that is in the text in the exact form
|
TRLRQuorumLemma
web_production: 608
Weight: -0.003021983245146 The weight of the words of the request that is in the text with an accuracy to lemma
|
TRLRQuorumSyn
web_production: 609
The weight of the words of the request that is in the text
|
IsHum
web_production: 610
Weight: 0.003622338166697 It launches on the basic search under the name ISHUM the maximum weight of the enclosed object of the Hum or Hum1 category in the request. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#ishum more)))))
|
IsText
web_production: 611
It launches on the basic search under the name ISTEXT the maximum weight of the TEXT or Text1 category of the category of the category met in the request. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#istext more)))
|
IsPicture
web_production: 612
It launches on the basic search under the name Ispicture the maximum weight of the Picture or Picture1 category of the category of the category of the category in the request. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#ispicture))))))))))))))))))
|
MaxOne
web_production: 613
Weight: -0.059871381556405 Returns the maximum degree of household objects in the request under the name Wmaxone. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#maxone more)))))))
|
MinOne
web_production: 614
Weight: 0.113671587879567 Returns the maximum degree of household objects in the request under the name Wminone. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#minone more)))))
|
LinksAlive
web_production: 620
Allows you to evaluate whether the document is 'alive' is from the point of view of links to it coming.
|
MetrikaUrlVisitors
web_production: 622
Similar to Yabarurlvisitors
|
MetrikaUrlAvgTime
web_production: 623
Similar to Yabarurlavgtime
|
RegexMaxClickPercent
web_production: 625
The share of clicks on this Urlu among all clicks on similar requests
|
RegexCtr
web_production: 626
Corrected CTR of this Urla for all similar requests
|
RcSearchBaseUrlRationalSigmoidD1TM600AtReq
web_production: 635
URL feature computed at the request time from rapid clicks search counters with decay of 1 day
|
RcSearchBaseUrlContrastD30Odd0_9_X_D30T1AtReq
web_production: 638
URL feature computed at the request time from rapid clicks search counters with decay of 30 days
|
DmozQueryBestTheme
web_production: 640
Weight: -0.000807198317231 The most likely theme of the request determined ((http://wiki.yandex-team.ru/jandekspoisk/zarubezhnyjjinternet/dmozqueryClassifier1 The rule of the sorcerer DmozTheme)), only the most popular topics are taken into account (but there are more than in the DMOZQUREMES factor). The factor contains the likelihood of a correspondence of the request of the theme, but for each topic, its own interval is taken on the segment [0..1]
|
DmozQueryThemes
web_production: 641
The theme of the request determined ((http://wiki.yandex-team.ru/jandekspoisk/zarubezhnyjjinternet/dmozqueryClassifier1 The rule of the sorcerer Dmoztheme)), only a few of the most popular topics are taken into account.
|
DiversityCategNeedPhoto
web_production: 642
0 or 1, depending on the presence in the request of the clearly expressed intent Need_photo from the variety
|
DiversityCategNeedMap
web_production: 643
0 or 1, depending on the presence in the request of the clearly expressed intent Need_map from the variety
|
LongQuerySyn
web_production: 644
Weight: 0.058415162135787 The factor is an analogue of LongQuery (the sum of the IDF words of the request), but with the 'correct' accounting of synonyms. Specifically, a minimum of IDF (i.e. the most frequent) of synonyms and words is selected.
|
UrlHasShortCountryNameToken
web_production: 645
Url contains a token that coincides with the short name of the user country. The factor is considered only on the EU stream.
|
ExpectedFound
web_production: 647
Expected number of found on request
|
FooterInLinksTrigrams
web_production: 648
The share of unique trigrams of a footer fragment in trigrams of links
|
LinksInFooterTrigrams
web_production: 649
The share of unique trigrams of links among a fragment of trigrams of a footer
|
ErratumLogQueryProbability
web_production: 650
Double logarithm of the probability of a request for a language model of the Erratum typo service
|
QueryUrlCorrectedCtr
web_production: 657
'Fixed' clicks counted using REQUESTAGGRETELIB
|
QueryUrlCorrectedCtr_Reg
web_production: 658
'Fixed' clicks calculated using Requestaggregatelib. Regional version
|
YabarUrlVisits_Reg
web_production: 659
Regional attendance of Urla according to the I-Bara
|
MetrikaUrlHostVisitTime
web_production: 660
The average time of the user stay on the host with an external (from another non-search site) entry from a specific URL
|
MetrikaUrlHostVisitDepth
web_production: 661
The average 'depth' (the number of transitions within the framework of the host) of the user stay on the host with an external (from another non-playing site) entry from a specific URL
|
AvgSessionLen
web_production: 665
The average length of the logical session in which there was a request
|
YabarUrlDownloads
web_production: 667
Assessment of the probability of leaps from the document
|
HostUserLeakage
web_production: 669
User outflow coefficient from the search after a visit to the site
|
IsIndexPage
web_production: 671
This is Index. (HTML/PHP/ASPX?/...), without CGI parameters. It is considered to be for all takes.
|
IsIndexPageSoft
web_production: 672
This is Index. (HTML/PHP/ASPX?/...), possibly with CGI parameters. It is considered to be for all takes.
|
IsOwner
web_production: 673
Whether the host is the owner, conditionally host == Owner (Host).
|
MinPathLen
web_production: 674
The minimum length of Pathandquery for all half -shoes.
|
RankComGoodness
web_production: 681
Classifier for estimates of commercial sites
|
HasDownloadLinkOnFile
web_production: 682
The document has a direct link to the file
|
HasDownloadLinkOnFileHosting
web_production: 683
The document has a link to filehosting
|
DiversityCategDownload
web_production: 684
0 or 1 - whether the request is matured by the tickt
|
DiversityCategReview
web_production: 685
0 or 1 - whether the request is matured by the tickt
|
DiversityCategWatch
web_production: 686
0 or 1 - whether the request is matured by the tickt
|
QrTur
web_production: 687
The prediction of the share of “good” (at least two different cities and frequency> = 10) references to the request with geography in Turkey
|
QueryThEncyclopedic
web_production: 688
The result of the work of the lexical classifier of requests predicting the likelihood of click on the theme of 3561
|
QueryThVideohosting
web_production: 689
The result of the work of the lexical classifier of requests predicting the likelihood of click on the page 3973 page
|
IsNavMxQuery
web_production: 690
Rank 'navigation'
|
QueryUrlYabarVisits_Reg
web_production: 691
Regional attendance from search engines for a specific request
|
ClickedWithAnotherSEClicks
web_production: 692
Clicks on the urlahs shown in the issuance for requests, by which they went to look for other search engines
|
ShowsWithAnotherSEClicks
web_production: 693
Urlov shows in the issuance for requests, by which they went to look for other search engines
|
CommercialOwnerRank_Reg
web_production: 694
Classifier of the commerciality of the site
|
HasUserReviews
web_production: 698
The document contains user review/comment
|
RegexMaxClickPercentReg
web_production: 699
The share of clicks on this Urlu among all clicks according to similar requests, the country version, see ((http://wiki.yandex-team.ru/development/poisk/Arcadia/indexregex Indexregex)))))))))
|
RegexCtrReg
web_production: 700
Corrected CTR of this Urla for all similar requests, country version, see (http://wiki.yandex-team.ru/development/poisk/Arcadia/indexregex Indexregex))))))
|
Found
web_production: 701
The average number of found on request
|
YabarWordDepthNodesGradientMin
web_production: 702
The angle in the Depth Nodes space, counted only by words (min for all)
|
RankComGoodnessBar
web_production: 704
Classifier that approximate the quality of commercial sites based on user behavior data
|
DocCreateMonth
web_production: 705
The time of creating a document with an accuracy of 1.0 is the current month, 0- 10 years ago and older. Temporarily disconnected
|
DocUpdateMonth
web_production: 706
The time for updating the document with an accuracy of 1.0 is the current month, 0- 10 years ago and older. Temporarily disconnected
|
DaterStatsYearNormLikelihood
web_production: 709
The function of the credibility of the distribution of years in the document. Temporarily disconnected
|
DaterStatsAverageSourceSegment
web_production: 712
The arithmetic mean position of dates in the document. Temporarily disconnected
|
BeastNqUrlMeanPos
web_production: 715
The average position of Urla for a normalized request
|
BeastNqOwnerMeanPos
web_production: 716
The average position of Domattr for a normalized request
|
BeastUrlMeanPos
web_production: 717
The average position of Urla for all requests
|
BeastHostMeanPos
web_production: 718
The average position of the host for all requests
|
BeastUrlNumQueries
web_production: 719
Number of requests for URL
|
BeastHostNumQueries
web_production: 720
Number of requests for host
|
YabarHostBrowseRank_Reg
web_production: 721
Implementation of the algorithm described in the article ((http://wiki.yandex-team.ru//h.yandex.net/?http%3A%2F%2FreseRosoft.microsoft.com%2Fen-US%2FPEOPLIULIUUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUUP032-LIUUUU .pdf http://research.microsoft.com/en-us/people/tyliu/fp032-liu.pdf)) by large regions (tube)
|
SegmentWordPortionFromMainContent
web_production: 723
The share of the words of the document from the segments with Score> 2.
|
UrlDomainSimilarityFixed
web_production: 724
|
TotalDups
web_production: 725
|
RankBoostGoodness
web_production: 726
The rank of site quality used for boosts of the Moscow commercial formula
|
QueryDOwnerClicksFRCRegGeo
web_production: 727
|
QueryURLClicksFRCRegGeo
web_production: 728
|
UrlShowsWithNextPageClicksP1
web_production: 730
|
UrlShowsWithNextPageClicksP10
web_production: 731
The factor is used in Selectionrank. TG_UNUSED: should not be included in the formulas to avoid feedback
|
RcSearchBaseUrlRationalSigmoidD3T120AtReq
web_production: 735
URL feature computed at the request time from rapid clicks search counters with decay of 3 days
|
OwnerCTRWithNextPageClicksP10
web_production: 736
|
CommRus
web_production: 737
The weight of the document on a monosyllabic dictionary of commercial vocabulary
|
WikiLinkCount
web_production: 738
|
UrlInLinksTrigramsStatic
web_production: 739
|
LinksInUrlTrigramsStatic
web_production: 740
|
QueriesAvgCM2
web_production: 742
Average query commerciality
|
QiQueryCount
web_production: 743
The number of requests in the group of frequency requests similar to a given
|
QiUrlFreqWeightedFRC
web_production: 744
FRC groups of frequency requests similar to a given, with averaging through the sum of clicks and shows
|
QiUrlFreqWeightedFRCReg
web_production: 745
FRC groups of frequency requests similar to a given, with averaging through the sum of clicks and shows, according to regional statistics
|
RcSearchBaseUrlRationalSigmoidD1TM600Frozen
web_production: 746
URL feature computed from rapid clicks search frozen counters with decay of 1 day
|
RegexMaxClickPercentYabarReg
web_production: 750
The share of clicks on this Urlu among all clicks on similar requests, counted according to Popular Search Engine
|
YabarHostSurfTrDpNdLeafLn
web_production: 751
The length of the Depth Nodes petal counted for hosts
|
YabarHostSurfTrNdTmGrDsp
web_production: 752
Dispersion of the angle in the space of Nodes Time, calculated for hosts
|
YabarHostSurfTrNdTmLeafLn90
web_production: 753
0.9-quarter of the length of the petal in the space of Nodes Time, calculated for hosts
|
NastyContent
web_production: 755
Content ugliness factor.
|
SynnormURLPCTR
web_production: 756
CTR according to click data, the request is normalized according to Sinsets
|
SynnormURLPCTRReg
web_production: 757
Regional CTR according to click data, the request is normalized according to Sinsets
|
UrlQueryTrigramsStatic
web_production: 758
Static trigrams intercection of url and queries by which users visited the url.
|
AdvAspam
web_production: 759
|
HasPornoQuery
web_production: 760
The result of the work of Adult Rules for the Sorcerer.
|
QUBm15Weighted
web_production: 761
Weighed BM15 for a request for an index document - a list of requests for which they switched to it.
|
BrowserHostDownloadProbability
web_production: 764
The likelihood of a racing from a host after click (on the logs of the bar).
|
NHopChainsCountFrc
web_production: 765
The number of chains on request / (the number of chains in which URL + the number of chains on request participated).
|
NHopIsFinal
web_production: 766
The number of chains in which Url was the last normalized for the total number of chains in which this URL was.
|
VisitsFromWiki
web_production: 767
Number of transitions to URL from Wikipedia
|
RcSearchBaseUrlContrastD30Odd0_9_X_D30T1Frozen
web_production: 768
URL feature computed from rapid clicks search frozen counters with decay of 30 days
|
RegBrowserUserHub
web_production: 769
The page indicator is like a hub (how many pages are the bar users pass from it).
|
AuxTitleBM25
web_production: 770
TEXTBM25 is considered in the title by the text of the name of the user region - similar to the factor 268.
|
NoProductsProbability
web_production: 772
DSSM Prediction of the probability of URL + Title that there is no product on the page.
|
PopularSEFRCBrowser
web_production: 773
FRC Popular Search System for Browser Logs
|
LogCtrMean
web_production: 774
Weighted mean of log(query_clicks)/log(query_shows) for given host. Weights are proportional to log(query_shows) + 0.2.
|
QueryUrlNhopTotalFrc
web_production: 775
The number of transitions on the request for URL, found in the Hopes chain, normalized to the general garlic of the transitions on request.
|
QueryUrlNhopIsFinal
web_production: 776
The probability of Urla to be the last upon request in the chain of Hopes.
|
OneProductProbability
web_production: 777
DSSM Prediction of the probability of URL + Title, which is on the page one product.
|
ManyProductsProbability
web_production: 778
DSSM Prediction of the probability of URL + Title, that there are a lot of goods on the page.
|
RcSearchBaseUrlRationalSigmoidD3T120Frozen
web_production: 779
URL feature computed from rapid clicks search frozen counters with decay of 3 days
|
GeoCityUrlHasCity
web_production: 780
For Urla, a geo-approval of the city level is determined according to the rules of the BUKI-1125
|
GeoCityUrlHasCountry
web_production: 781
For Urla, a geo-approval of the country's level is determined according to the BUKI-1125 rules
|
GeoRelevRegionCityGeoa
web_production: 782
Factor Gorelevregions of the 1th Attichut and Geoa
|
GeoRelevRegionRegionGeoa
web_production: 783
Factor GorelevregionRegionRegion Natthew GEOA
|
GeoGeometryProximGeoa
web_production: 784
Factor Geogeetryproxim ▪ Attributu GEOA
|
GeoRelevAlienCityGeoa
web_production: 785
Factor Gorelevaliencity n Att. Att. Attibtu Geoa
|
GeoVQueryInUserCityGeoa
web_production: 786
Factor Geovqueryinusercidence n Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Attfruut and Geoa
|
GeoVQueryInAlienCityGeoa
web_production: 787
Geovquery Geovqueryinieniencity n Att. Att. Attib
|
PageRegionSizeGeo
web_production: 788
PageRegionsize Factor by GEO attribute
|
PageRegionCoverageGeo
web_production: 789
PageRegioncoverage Factor GEO attribute
|
PageRegionCoverageAdresa
web_production: 790
PageRegioncoverage Factor on Adresa attribute
|
GeoRelevRegionCityAdresa
web_production: 791
GeorelevregionCity Factor on Adresa attribute
|
DoppQueryUrlSessionClicksFRC
web_production: 792
What part (on average in the session) from the clinked in this query Urlov is this URL. It is considered to be user sessions.
|
OwnerIsActualShop
web_production: 793
Aries is a store
|
OwnerIsService
web_production: 794
Aries is a service
|
SameQueryReturnFRCBrowser
web_production: 796
FRC by transitions from requests that were set by the user several times
|
QueryURLISBMCTR
web_production: 797
The average weight of the shows on the first page; Click weighs 1, non -click - according to the SBM_GAMMAS table
|
QueryURLISBMCTRReg
web_production: 798
The average weight of the shows on the first page; Click weighs 1, non -click - according to the SBM_GAMMAS table. Regional version
|
RegexBeastPositionReg
web_production: 799
Half -Summaria assessment of the position of Url with a median position for all similar queries on bisters
|
RcSpylogHostRationalSigmoidD3T0AtReq
web_production: 800
Host feature computed at the request time from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogHostRationalSigmoidD3DTM3600AtReq
web_production: 801
Host feature computed at the request time from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogHostRationalSigmoidD14T0AtReq
web_production: 802
Host feature computed at the request time from rapid clicks spy_log counters with decay of 14 days
|
RcSpylogHostRationalSigmoidD14DTM3600AtReq
web_production: 803
Host feature computed at the request time from rapid clicks spy_log counters with decay of 14 days
|
RcSpylogHostRationalSigmoidedCTRD3DT0TM3600AtReq
web_production: 804
Host feature computed at the request time from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogHostRationalSigmoidedCTRD14DT0TM3600AtReq
web_production: 805
Host feature computed at the request time from rapid clicks spy_log counters with decay of 14 days
|
RcSpylogHostRationalSigmoidD3T0Frozen
web_production: 806
Host feature computed from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogHostRationalSigmoidD3DTM3600Frozen
web_production: 807
Host feature computed from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogHostRationalSigmoidD14T0Frozen
web_production: 808
Host feature computed from rapid clicks spy_log counters with decay of 14 days
|
RcSpylogHostRationalSigmoidD14DTM3600Frozen
web_production: 809
Host feature computed from rapid clicks spy_log counters with decay of 14 days
|
RcSpylogHostRationalSigmoidedCTRD3DT0TM3600Frozen
web_production: 810
Host feature computed from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogHostRationalSigmoidedCTRD14DT0TM3600Frozen
web_production: 811
Host feature computed from rapid clicks spy_log counters with decay of 14 days
|
OwnerIsPartner
web_production: 817
Aries is a partner
|
ShopInShopUrl
web_production: 818
The document is Shopinshop
|
QueryConversionDetectorPredict
web_production: 819
The value of the conversion of the request calculated in the Hippo.
|
ProductOfferAnyAvailable
web_production: 821
At least one offer from a sporled scheme has an accessibility status.
|
ProductOfferNoProducts
web_production: 822
There is not a single offer in the porous scheme.
|
YandexMarketProductUrl
web_production: 829
URL is a product on the market.
|
YandexMarketProductIncludeOfferidUrl
web_production: 830
URL is a product on the market and has Offerid.
|
ShopInShopCPAUrl
web_production: 831
URL is Shopinshopcpa.
|
ProductOfferNotAvailable
web_production: 832
At least one offer from a sporled scheme has an inaccessibility status.
|
NavParasites
web_production: 835
DSSM Prediction of the probability of URL + Title that the document is an overlap.
|
OfferAvailabilityIsSetUp
web_production: 836
In the offer from the new Parser, the PartnerOfferContent Available field is exhibited.
|
OfferAvailability
web_production: 837
In an offer from the new Parser, the PartnerOfferContent Available Field == True.
|
URLClicksMaxGeoCityFRCWeight
web_production: 838
Normalized corrected clicks count by query with user's city(gc=) mentioned
|
URLClicksMaxGeoCityFRCRatio
web_production: 839
Normalized corrected clicks maximum ratio by query with user's city(gc=) mentioned
|
URLClicksMaxGeoAlienCityFRCRatio
web_production: 840
Normalized corrected clicks maximum ratio by query with not user's city(gc=) mentioned
|
PurchaseTotalPredict
web_production: 842
The value of PURCHASETOTALPredict, calculated in the Hippo.
|
SerpSummarySurplusPredict
web_production: 843
The value of serpsummarysurpluspredict, calculated in the hippo.
|
YabarUrlRevisits
web_production: 844
User return on URL
|
RequestWith120D3ClickPartPredict
web_production: 845
Requestwith120d3ClickpartPredict value, calculated in the Hippo.
|
QueryNavParasitesDetectorPredict
web_production: 846
The value of the requester detector of the parasites calculated in the hippo.
|
BrowserHostCntDwellTimeLog
web_production: 847
Middle Logarithm by the user's location on the host with localization in the country; It is considered according to Yabar logs
|
BrowserHostDwellTimeRegionFrc
web_production: 848
The attitude of Dwell Time on a host in this region to Dwell Time on a host in all regions
|
BrowserUrlDwellTimeRegionFrc
web_production: 849
The attitude of Dwell Time on the page in this region to Dwell Time on a page in all regions
|
BrowserBookmarksUrl
web_production: 850
The more users add to bookmarks a url, the more factor value it has
|
SosDssm
web_production: 851
Predict SOS.DSSM models by URL + Title.
|
MedDssm
web_production: 852
Med.DSSM Predictions URL + Title models.
|
FinLawDssm
web_production: 853
FIN_LAW.DSSM Predictions URL + TITLE.
|
WikiInfobox
web_production: 854
On danny url is a link from inFobox-ov to Wikipedia.
|
CrueltyDssm
web_production: 855
Predict Cruelty.dssm URL + TITLE models.
|
HalfEcomPredict
web_production: 856
The value of Halfecompredict, calculated in the Hippo.
|
PrefixSuffixMaxClickPercentReg
web_production: 857
A factor similar to RegexmaxclickPercentreg, but calculated by Preffix-Suffix Generalization.
|
PrefixSuffixMaxClickPercentYabarReg
web_production: 858
A factor similar to REGEXMAXCLICKPERCENTYABARREG, but calculated according to PREFFIX-SUFFIX Generalization.
|
DssmNavigationL2
web_production: 859
Request and documentary navigation model.
|
YabarHostSurfTrNdHgGr
web_production: 860
The average sung of inclination in the plane of the peak
|
QueryUrlCorrectedCtrXfactor
web_production: 861
Request-murl factor. Value is the result of the collaborative filtration of data for the QueryurlCorrectedCTR factor
|
QueryDocTitleRangesMatchingScore
web_production: 866
The factor on the text of the request and heading (Title) of the document, assessment of the compliance of numerical ranges in words-markers
|
IsTranslatedDocument
web_production: 870
A sign that the document was received by machine translation
|
MedDssmWithTrash
web_production: 871
Prediction of Med_with_Trash.DSSM (Medic. Document model with Tresh Valley in Lern) Models for URL + Title.
|
FinLawDssmWithTrash
web_production: 872
Prediction FIN_LAW_WITH_TRASH.DSSM (Fin-Jur. Document model with a tresh valve in Lern) Models for URL + Title.
|
SamplePeriodClickFrcSyn
web_production: 877
The share of Urla in the total number of Urlov closed for the session on request (Synnorm).
|
SamplePeriodDayFrcSyn
web_production: 878
The average share of clicks for this UrLU for this request among all clicks for this request (Synnorm) during the day.
|
SamplePeriodDayFrc
web_production: 879
The average share of clicks for this UrLU for this request among all clicks for this request (QNORM) during the day.
|
QiQueryUrlCorrectedCtrXfactor
web_production: 880
QI version of factor 861. MaxValue over the set of popular similar queries.
|
QiQueryURLISBMCTRReg
web_production: 881
QI version of factor 798. MaxValue over the set of popular similar queries.
|
SamplePeriodDayFrcXfactor
web_production: 886
Request-murl factor. Value is the result of the collaborative filtration of data for the SampleperiodDayFRC factor
|
QiSamplePeriodDayFrc
web_production: 889
QI version of factor 879.
|
ShortVideo
web_production: 892
A document is a short video (Tiktok, Reels, Shorts).
|
TelegramChannelWebFormat
web_production: 893
Document-telegram channel in web format.
|
TelegramPost
web_production: 894
Document - post in a telegram.
|
IsNotCgi
web_production: 899
Factor about the presence of a symbol '?' In Url. It is zero if the Url has CGI parameters (more precisely: all duplicate have a symbol '?' In Url).
|
FractionOfQueriesWithGeoPredicted
web_production: 942
Prediction of a share of requests with geography on a bag of words built for request
|
IsExactUrl
web_production: 943
The request is a Urle with an accuracy of the points and testing characters - the ISURL sorcerer's rule is used
|
WeightedUnMatchUrlPredictedAndUserRegion
web_production: 966
The likelihood that the Yweb/Robot/urlgeo_ml region is correct is correct, provided that the city is predicted
|
URLClicksMaxGeoRegionFRCRatio
web_production: 988
Normalized corrected clicks maximum ratio by query with user's city(gc=) mentioned equal by region
|
URLClicksMaxGeoRegionOnlyFRCRatio
web_production: 989
Normalized corrected clicks maximum ratio by query with user's city(gc=) mentioned equal to user's region
|
IsLocalProbability
web_production: 1057
The value of the classifier of localization for request
|
IsRelevLocaleRU
web_production: 1058
Relev_locale == ru
|
IsRelevLocaleUA
web_production: 1059
Relev_locale == ua
|
IsRelevLocaleBY
web_production: 1060
relev_locale == by
|
IsRelevLocaleKZ
web_production: 1061
relev_locale == kz
|
QClassPornoVw
web_production: 1064
Porn query classification result from Wizard (iad_vw flag, based on Vowpal Wabbit)
|
FullUrlFraction
web_production: 1065
URL coating with trigrams from the request. Analogue of Urldomainfraction, Urlpathandparamsfraction factors.
|
More90SecVisitsShare
web_production: 1076
The share of visits for which the time spent during the day on the host is more than 90 seconds
|
More160SecVisitsShare
web_production: 1077
The share of visits for which the time spent during the day on the host is more than 160 seconds
|
RankHackedNovaPhp
web_production: 1078
Rank of hacked sites
|
RankAgs4
web_production: 1079
Rank AGS4
|
MaxQsDocClassQsRankPthQuerySpam
web_production: 1080
Maximum QSRANK on the owner
|
AvgQsRankOnNotSubdomainDocs
web_production: 1081
Average QSRANK on the main domain
|
VisitorsReturnMonthShare
web_production: 1082
The share of users who returned within a month
|
VisitorsReturnMonthNumber
web_production: 1083
The number of users returning within a month
|
RankXitDoor
web_production: 1084
Rank Dorweev
|
AvgTitleCapitalLettersRatio
web_production: 1085
Share of the capital letters in Title
|
FromSearchShareNormalized
web_production: 1086
The share of incoming traffic from search engines among all incoming traffic
|
GreenTrafficShareNormalized
web_production: 1087
The share of direct visits among all incoming traffic
|
AvgQsFWnd500TOKEN
web_production: 1088
Middle QSRank in a sliding window
|
MinOwnerQsRank
web_production: 1089
Minimum QSRANK
|
AvgNumhops
web_production: 1090
Average HOPS
|
IsMobileBeauty
web_production: 1209
The binary factor about the mobile adaptability of the document. It is taken from ERF
|
ForeignDomain
web_production: 1210
In those cases when fi_national_domain is 0, and Herf.NationalDomainid is filled 1
|
EmbedVideoBroken
web_production: 1235
A broken built -in video on the page.
|
QueryDocRandom
web_production: 1267
Random float in [0,1] by user request and document
|
SumFlashArea
web_production: 1269
the ratio of the total area of all Flash blocks to the screen area
|
UrlHostFraction
web_production: 1271
Copy of Old Version No.294 Factor. Added for Use on L3 Stage Only. Coating domain three -bouqu and request. (Chelyabinsk lottery - Chelloto. We translate a request to translite, find the three -book that are covered (Che, Hel, Lot, Olo), we look at what share of all three -bouquets are covered)
|
UrlHitsCoverage
web_production: 1272
Fast version of FI_URL_DOMAIN_FRACTION
|
TiktokTag
web_production: 1275
Document - this is a selection of Tiktok /Tag
|
TiktokDiscovery
web_production: 1276
Document - this is a selection of Tictock /Discovery
|
TiktokMusic
web_production: 1277
Document - this is a selection of Tiktok /Music
|
DssmSinsigL2
web_production: 1278
Request-document model Sinsiga.
|
USLongPeriodUrlCtr
web_production: 1324
Static URL factor in search sessions in 1600 days. Ordinary CTR.
|
USLongPeriodUrlDt3600Avg
web_production: 1325
Static URL factor in search sessions in 1600 days. Average dwelltime, and DwellTime is cut from the session if more than 3600 seconds
|
USLongPeriodUrlDt180Avg
web_production: 1327
Static URL factor in search sessions in 1600 days. Average dwelltime, and DwellTime is cut from the session if more than 180 seconds
|
USLongPeriodUrlLongClickProb
web_production: 1328
Static URL factor in search sessions in 1600 days. The probability that the URL click will be more than 120 seconds
|
USLongPeriodUrlShows
web_production: 1329
Static URL factor in search sessions in 1600 days. Logarithm of the number of shows.
|
USLongPeriodUrlWinsProb
web_production: 1331
Static URL factor in search sessions in 1600 days. The probability that URL is clicking if they do not click on at least one URL higher.
|
USLongPeriodUrlLossesProb
web_production: 1332
Static URL factor in search sessions in 1600 days. The probability that URL is not clicks if they click at least one URL below.
|
USLongPeriodUrlCtrReg
web_production: 1333
Static URL factor in search sessions in 1600 days. Ordinary CTR. Localization to the level of countries.
|
USLongPeriodUrlDt3600AvgReg
web_production: 1334
Static URL factor in search sessions in 1600 days. Average dwelltime, and DwellTime is cut from the session if more than 3600 seconds. Localization to the level of countries.
|
USLongPeriodUrlLongClickProbReg
web_production: 1335
Static URL factor in search sessions in 1600 days. The probability that the URL click will be more than 120 seconds. Localization to the level of countries.
|
USLongPeriodUrlPositionAvgReg
web_production: 1336
Static URL factor in search sessions in 1600 days. The average position of the URL for all requests. Localization to the level of countries.
|
USLongPeriodUrlShowsReg
web_production: 1337
Static URL factor in search sessions in 1600 days. Logarithm of the number of shows. Localization to the level of countries.
|
DssmLogDwelltimeBigramsL2
web_production: 1354
DSSM model trained on clicks. Takes bigrams into account. Embeddings for documents are computed offline.
|
RankArtroz
web_production: 1355
Rank of the quality of texts on the host. The higher, the greater the likelihood that the host is full of articles - a rewriting, a bad copy of the content ordered on the exchanges of content. Burning stronger as the before the aggregation.
|
DssmVkPopularity
web_production: 1360
The probability that the VK.com host is popular for this request in accordance with the corresponding DSSM model.
|
DssmOnlinerPopularity
web_production: 1361
The likelihood that the Onliner.by host is popular for this request according to the corresponding DSSM model.
|
DssmRamblerPopularity
web_production: 1364
The probability that the Rambler.ru host is popular for this request in accordance with the corresponding DSSM model.
|
DssmExpertcenPopularity
web_production: 1365
The likelihood that the ExpertCen.ru host is popular for this request in accordance with the corresponding DSSM model.
|
DssmSunhomePopularity
web_production: 1366
The probability that the Sunhome.ru host is popular for this request according to the corresponding DSSM model.
|
UBLongPeriodVisitsSNProb
web_production: 1367
Static URL factor in browser logs for the maximum period. The percentage of traffic from social networks in all traffic from other hosts and search.
|
UBLongPeriodDirectHChildren90CntFromExtHost
web_production: 1368
Static URL factor in browser logs for the maximum period. The average number of direct descendants from the host on which they spent more than 90 seconds. The descendant is straight, only if there is a link from our page to the descendant and crossed it.
|
UUBLongPeriodDepthFromExtHost
web_production: 1369
Static URL factor in browser logs for the maximum period. The average maximum depth of wood with the root in the current URL is when the URL is visited from other hosts.
|
UBLongPeriodBrowseFrc
web_production: 1370
Static URL factor in browser logs for the maximum period. The number of times when the feather was transferred to the page to the total number of pages to which they switched from a sickle. The closer to 1, the more often the page was opened the only one in the session.
|
UBLongPeriodAvgSearchDuration600
web_production: 1371
Static URL factor in browser logs for the maximum period. The average length of search sessions, when they switched to the page from a sickle
|
UBLongPeriodSearchPercentEnd
web_production: 1372
Static URL factor in browser logs for the maximum period. The formula for calculating the factor we look at Wiki.
|
UBLongPeriodSearchPercentMiddle30
web_production: 1373
Static URL factor in browser logs for the maximum period. The formula for calculating the factor we look at Wiki.
|
UBLongPeriodVisit120Prob
web_production: 1374
Static URL factor in browser logs for the maximum period. The probability that the user will spend on the page> 120 seconds.
|
UBLongPeriodLeavesCnt
web_production: 1375
Static URL factor in browser logs for the maximum period. The number of leaves in URLA support. In this case, the leaves are a page from which there were no transitions.
|
UBLongPeriodDtUrlHChildrenCut600
web_production: 1376
Static URL factor in browser logs for the maximum period. The average time spent on the page and in all descendants of the page (URLS to which they switched) from the host. Cut off if the total DT is more than 10 minutes
|
UBLongPeriodMinTimeWhenPageShow
web_production: 1377
Static URL factor in browser logs for the maximum period. The minimum Unix Time when the page appeared in the logs for the first time.
|
UBLongPeriodDeltaAvgMinTimeWhenPageShow
web_production: 1378
Static URL factor in browser logs for the maximum period. The difference between the middle and minimum Unix Time when the page appeared in the logs.
|
UBLongPeriodLatitude
web_production: 1379
Static URL factor in browser logs for the maximum period. Current breadth where the page was viewed from.
|
UBLongPeriodLongitude
web_production: 1380
Static URL factor in browser logs for the maximum period. Current longitude where the page was viewed from.
|
UBLongPeriodDownloadsProb
web_production: 1381
Static URL factor in browser logs for the maximum period. The likelihood of leaps from the page
|
UBLongPeriodDownloadsImageProb
web_production: 1382
Static URL factor in browser logs for the maximum period. The likelihood of image jumps from the page
|
UBLongPeriodDownloadsTorrentProb
web_production: 1383
Static URL factor in browser logs for the maximum period. The probability of leap torrent file from the page
|
UBLongPeriodSearchPercentEndReg
web_production: 1384
Static URL factor in browser logs for the maximum period. The formula for calculating the factor we look at Wiki. Localization to the level of countries.
|
UBLongPeriodLeavesCntReg
web_production: 1385
Static URL factor in browser logs for the maximum period. The number of leaves in URLA support. In this case, the leaves are a page from which there were no transitions. Localization to the level of countries.
|
UBLongPeriodDtUrlHChildrenCut600Reg
web_production: 1386
Static URL factor in browser logs for the maximum period. The average time spent on the page and in all descendants of the page (URLS to which they switched) from the host. Cut if the total DT is more than 10 minutes. Localization to the level of countries.
|
MisspellLmNgrYandexDirectOriginal
web_production: 1387
Summary Skorov words of a request for a language model 3GRAMS-YANDEX-DIRECT.
|
MisspellLmRtlNgrWebMtOriginal
web_production: 1388
Summary of the Skorov words of the request by the Web-Mt language model.
|
UBLongPeriodRank
web_production: 1389
Static URL factor in browser logs for the maximum period. Rank, based on only UBLP meters, which allows you to find many SBR losses
|
AllMatchedWordWeightsSum
web_production: 1407
The normalized amount of the scales of the words of the request that met in the text of the document or links to it.
|
StringMatchedWordWeightsSum
web_production: 1408
The normalized amount of the scales of the words of the request that Equal_by_String in the text of the document or links to it.
|
AllMatchedWordWeightsSumText
web_production: 1409
The normalized amount of the scales of the words of the request that met in the text of the document.
|
AllMatchedWordWeightsSumLink
web_production: 1410
The normalized amount of the scales of the words of the request that met in the links to the document.
|
StringMatchedWordWeightsSumLink
web_production: 1411
The normalized amount of the scales of the words of the request that Equal_by_String in the links to the document.
|
AllMatchedWordFiltrationModelWeightsSum
web_production: 1412
The normalized scales for the IFILTRETRATIONMODEL words of the request that met in the text of the document or links to it.
|
StringMatchedWordFiltrationModelWeightsSum
web_production: 1413
The normalized scales for the IFILTRETRATIONMODEL Words of the request, which are Equal_by_String in the text of the document or links to it.
|
LemmaMatchedWordFiltrationModelWeightsSum
web_production: 1414
The normalized scales for the IFILTRETRATIONMODEL Words of the request, which Equal_by_lemma in the text of the document or links to it.
|
AllMatchedWordFiltrationModelWeightsSumLink
web_production: 1415
The normalized scales for the IFILTRETRATIONMODEL words of the request that met in links to the document.
|
StringMatchedWordFiltrationModelWeightsSumLink
web_production: 1416
The normalized scales for the IFILTRETRATIONMODEL Words of the request, which Equal_by_String in the links to the document.
|
DssmLanguageClassifierRusL2
web_production: 1425
Document DSSM model Language Classifier Rus.
|
DssmLanguageClassifierEngL2
web_production: 1426
Document DSSM model Language Classifier Eng.
|
DssmLanguageClassifierOthL2
web_production: 1427
Document DSSM model Language Classifier Other.
|
RandomLogQueryAvgNews
web_production: 1432
The average value of News for the year. It is calculated in offline.
|
RandomLogQueryAvgAddTime
web_production: 1433
ADDTIME average value for the year. It is calculated in offline.
|
RandomLogQueryAvgTxtHiRelSy
web_production: 1434
The average value of TXTHIRELSY for the year. It is calculated in offline.
|
RandomLogQueryAvgTextLike
web_production: 1435
The average TEXTLIKE value is for the year. It is calculated in offline.
|
RandomLogQueryAvgHasNoAllWordsTRSy
web_production: 1436
The average Hasnoallwordstersy value for the year. It is calculated in offline.
|
RandomLogQueryAvgIsForum
web_production: 1437
The average value of ISFORUM for the year. It is calculated in offline.
|
RandomLogQueryAvgHasPayments
web_production: 1438
The average value of Haspayments for the year. It is calculated in offline.
|
RandomLogQueryAvgYabarHostAvgTime2
web_production: 1439
The average value is Yabarhostavgtime2 for the year. It is calculated in offline.
|
RandomLogQueryAvgYabarUrlVisitors
web_production: 1440
The average value of Yabarurlvisitors for the year. It is calculated in offline.
|
RandomLogQueryAvgQueryDOwnerOnlyClickRate
web_production: 1441
The average value of QueryDowneronlyClickRate for the year. It is calculated in offline.
|
RandomLogQueryAvgDaterAge
web_production: 1442
The average value of Dateraage for the year. It is calculated in offline.
|
RandomLogQueryAvgLongestText
web_production: 1443
The average value of the LonGestText for the year. It is calculated in offline.
|
RandomLogQueryAvgDifferentInternalLinks
web_production: 1444
The average value is DifferentinTernallinks for the year. It is calculated in offline.
|
RandomLogQueryAvgQueryDOwnerOnlyClickRate_Reg
web_production: 1445
The average value of QueryDowneronlyClickRate_Rreg is for a year. It is calculated in offline.
|
RandomLogQueryAvgIsHub
web_production: 1446
The average ISHUB value is for the year. It is calculated in offline.
|
RandomLogQueryAvgBM25_0
web_production: 1448
The average value is BM25_0 on request per year. It is calculated in offline.
|
RandomLogQueryAvgBocm
web_production: 1449
The average value of BOCM for the year. It is calculated in offline.
|
RandomLogQueryAvgIsIndexPage
web_production: 1450
The average ISindexpage is for the year. It is calculated in offline.
|
RandomLogQueryAvgQueriesAvgCM2
web_production: 1451
The average value of queriesavgcm2 for the year. It is calculated in offline.
|
RandomLogQueryAvgBrowserHostDownloadProbability
web_production: 1452
The average value of BrowserhostDownloadProbabolyity for the year. It is calculated in offline.
|
RandomLogQueryAvgRegBrowserUserHub
web_production: 1453
The average value of Regbrowseruserhub for the year. It is calculated in offline.
|
RandomLogQueryAvgAuxTitleBM25
web_production: 1454
Auxtitlebm25 average value for the year. It is calculated in offline.
|
RandomLogQueryAvgQueryUrlCorrectedCtrXfactor
web_production: 1455
The average value of QuryurlCorrectedctrxFactor for the year. It is calculated in offline.
|
RandomLogQueryAvgQueryToDocAllSumFCountTextBm11Norm16384
web_production: 1456
The average value is QueryTodocallsumfcountTextbM11Norm16384 for the year. It is calculated in offline.
|
RandomLogQueryAvgXfDtShowAllSumWFSumWBodyMinWindowSize
web_production: 1457
The average value of the XFDTSHOWALLSUMWFSUMWBODYMINWIDESIZE for the year. It is calculated in offline.
|
RandomLogQueryClicksWeightedAvgIsMainPage
web_production: 1458
Maintened by clicks ISMainPage value for the year. It is calculated in offline.
|
RandomLogQueryClicksWeightedAvgYabarUrlAvgTime
web_production: 1459
Main -heated clicks of the Yabarurlavgtime value for the year. It is calculated in offline.
|
RandomLogQueryClicksWeightedAvgDifferentInternalLinks
web_production: 1460
Maintened by clicks DifferentinTernallinks for the year. It is calculated in offline.
|
RandomLogQueryDwelltimeWeightedAvgUrlDomainFraction
web_production: 1461
Malpanized Dwelltime-AMI Value of Urldomainfraction for the year. It is calculated in offline.
|
BM25FdPRFixedNoLinks
web_production: 1462
BM25FDPR with standardization on the average length of the document, depending on the language of the document. Only texts are used.
|
HistoricalAnnotationCount
web_production: 1465
Document annotations count in the whole history of the Search (DSSM AnnReg models helper)
|
HistoricalAnnWordCount
web_production: 1466
Document annotation words count in the whole history of the Search (DSSM AnnReg models helper)
|
HistoricalAnnRegionCount
web_production: 1467
Document annotation regions count in the whole history of the Search (DSSM AnnReg models helper)
|
DssmMainContentKeywords
web_production: 1472
Query-MainContentKeywords similarity, target: logDwellTime
|
DssmBoostingXfWeightQuerySelfSimilarity
web_production: 1477
Dssm Boosting query self similarity for XfWeight model.
|
DssmBoostingXfWeightKMeans5AvgTop02Score
web_production: 1478
Dssm Boosting AvgTop02Score aggregation for XfWeight model over 5-means centroids.
|
DssmBoostingXfWeightKMeans5AvgTop04Score
web_production: 1479
Dssm Boosting AvgTop04Score aggregation for XfWeight model over 5-means centroids.
|
DssmBoostingXfWeightKMeans5AvgTop02ScoreAvgClusterTop3Weighted
web_production: 1480
Dssm Boosting AvgTop02ScoreAvgClusterTop3Weighted aggregation for XfWeight model over 5-means centroids.
|
DssmBoostingXfWeightKMeans5AvgTop02ScoreQE
web_production: 1481
Dssm Boosting AvgTop02Score aggregation for XfWeight model over 5-means centroids (query as expansion).
|
DssmBoostingXfWeightKMeans5AvgTop02ScoreAvgClusterTop3WeightedQE
web_production: 1482
Dssm Boosting AvgTop02ScoreAvgClusterTop3Weighted aggregation for XfWeight model over 5-means centroids (query as expansion).
|
DssmBoostingXfOneQuerySelfSimilarity
web_production: 1483
Dssm Boosting query self similarity for XfOne model.
|
DssmBoostingXfOneKMeans1Score
web_production: 1484
Dssm Boosting Score aggregation for XfOne model over 1-means centroids.
|
DssmBoostingXfOneKMeans1ScaledSumWeight
web_production: 1485
Dssm Boosting ScaledSumWeight aggregation for XfOne model over 1-means centroids.
|
DssmBoostingXfOneKMeans1ScoreQE
web_production: 1486
Dssm Boosting Score aggregation for XfOne model over 1-means centroids (query as expansion).
|
DssmBoostingXfOneKMeans1ScoreAvgNearest1WeightedQE
web_production: 1487
Dssm Boosting ScoreAvgNearest1Weighted aggregation for XfOne model over 1-means centroids (query as expansion).
|
DssmBoostingXfOneKMeans1ScoreAvgNearest5WeightedQE
web_production: 1488
Dssm Boosting ScoreAvgNearest5Weighted aggregation for XfOne model over 1-means centroids (query as expansion).
|
DssmBoostingXfOneSeKMeans1Score
web_production: 1489
Dssm Boosting Score aggregation for XfOneSe model over 1-means centroids.
|
DssmBoostingXfOneSeKMeans1ScoreScaledSumWeighted
web_production: 1490
Dssm Boosting ScoreScaledSumWeighted aggregation for XfOneSe model over 1-means centroids.
|
DssmBoostingXfOneSeKMeans1ScoreAvgNearest5Weighted
web_production: 1491
Dssm Boosting ScoreAvgNearest5Weighted aggregation for XfOneSe model over 1-means centroids.
|
DssmBoostingCtrQuerySelfSimilarity
web_production: 1492
Dssm Boosting query self similarity for Ctr model.
|
DssmBoostingCtrKMeans1Score
web_production: 1493
Dssm Boosting Score aggregation for Ctr model over 1-means centroids.
|
DssmBoostingCtrKMeans1ScoreQE
web_production: 1494
Dssm Boosting Score aggregation for Ctr model over 1-means centroids (query as expansion).
|
DssmBoostingCtrKMeans1ScoreScaledSumWeightedQE
web_production: 1495
Dssm Boosting ScoreScaledSumWeighted aggregation for Ctr model over 1-means centroids (query as expansion).
|
DssmBoostingCtrKMeans1ScoreAvgNearest1WeightedQE
web_production: 1496
Dssm Boosting ScoreAvgNearest1Weighted aggregation for Ctr model over 1-means centroids (query as expansion).
|
DssmPageQualityRTHub
web_production: 1505
DSSM prediction (URL + Title), trained for the Page_QUALYY signal and implemented in RTHUB, the first slot.
|
DssmPageQualityRTHubSlot2
web_production: 1506
DSSM prediction (URL + Title), trained on the Page_QUALYY signal and implemented in RTHUB, the second slot.
|
DssmQueryEmbeddingCtrNoMinerPca0
web_production: 1507
The main components of the requesting Embling from the DSSMCTRNOMINER model
|
DssmQueryEmbeddingCtrNoMinerPca1
web_production: 1508
The main components of the requesting Embling from the DSSMCTRNOMINER model
|
DssmQueryEmbeddingCtrNoMinerPca2
web_production: 1509
The main components of the requesting Embling from the DSSMCTRNOMINER model
|
DssmQueryEmbeddingCtrNoMinerPca3
web_production: 1510
The main components of the requesting Embling from the DSSMCTRNOMINER model
|
DssmQueryEmbeddingCtrNoMinerPca4
web_production: 1511
The main components of the requesting Embling from the DSSMCTRNOMINER model
|
DssmQueryEmbeddingCtrNoMinerPca5
web_production: 1512
The main components of the requesting Embling from the DSSMCTRNOMINER model
|
RandomLogHostHasPaymentsAvg
web_production: 1524
AVG aggregation of HasPayments web factor using random log
|
RandomLogHostIsVideoQueryAvg
web_production: 1525
AVG aggregation of VideoQuery web factor using random log
|
RandomLogHostSyntQualityAvg
web_production: 1526
AVG aggregation of SyntQuality web factor using random log
|
RandomLogHostGeoRegionalityVNewPerc90
web_production: 1527
PERCENTALE_90 aggregation of GeoRegionalityVNew web factor using random log
|
RandomLogHostQClassDownloadAvg
web_production: 1528
AVG aggregation of QClassDownload web factor using random log
|
RandomLogHostIsMusicAvg
web_production: 1529
AVG aggregation of IsMusic web factor using random log
|
RandomLogHostQueryThEncyclopedicPerc25
web_production: 1530
PERCENTALE_25 aggregation of QueryThEncyclopedic web factor using random log
|
RandomLogHostCommercialOwnerRankRegAvg
web_production: 1531
AVG aggregation of CommercialOwnerRank_Reg web factor using random log
|
RandomLogHostYabarWordDNGIPerc25
web_production: 1532
PERCENTALE_25 aggregation of YabarWordDepthNodesGradientMin web factor using random log
|
RandomLogHostPopularSEFRCBrowserAvg
web_production: 1533
AVG aggregation of PopularSEFRCBrowser web factor using random log
|
RandomLogHostURLClicksMaxGeoRegionFRCRatioAvg
web_production: 1534
AVG aggregation of URLClicksMaxGeoRegionFRCRatio web factor using random log
|
RandomLogHostUBLongPeriodDirectHChildren90CntPerc90
web_production: 1535
PERCENTALE_90 aggregation of UBLongPeriodDirectHChildren90CntFromExtHost web factor using random log
|
RandomLogHostUBLongPeriodDtUrlHChildrenPerc90
web_production: 1536
PERCENTALE_90 aggregation of UBLongPeriodDtUrlHChildrenCut600Reg web factor using random log
|
RandomLogHostIsPictureAvg
web_production: 1537
AVG aggregation of IsPicture web factor using random log
|
RandomLogHostErratumLogQueryProbabilityAvg
web_production: 1538
AVG aggregation of ErratumLogQueryProbability web factor using random log
|
DssmQueryCountryToUrlEstimatedDistance
web_production: 1542
Predicted by demand and country, using a DSSM model, the length of the click from this country.
|
DssmRandomLogQueryAvgNews
web_production: 1543
The average for the year for the year predicted using the neural network.
|
DssmRandomLogQueryAvgAddTime
web_production: 1544
ADDTIME ADDTIME is predicted using a neural network for a year.
|
DssmRandomLogQueryAvgTxtHiRelSy
web_production: 1545
The average Txthirelesy value predicted using a neural network for the year.
|
DssmRandomLogQueryAvgTextLike
web_production: 1546
The average Textlike is predicted using a neural network for the year.
|
DssmRandomLogQueryAvgHasNoAllWordsTRSy
web_production: 1547
The average HasnoallwordStrsy is predicted using a neural network for a year.
|
DssmRandomLogQueryAvgIsForum
web_production: 1548
The average ISFORUM is predicted using a neural network for the year.
|
DssmRandomLogQueryAvgHasPayments
web_production: 1549
The average Haspayments is predicted using a neural network for the year.
|
DssmRandomLogQueryAvgYabarHostAvgTime2
web_production: 1550
The average value of Yabarhostavgtime2 for the year for the year.
|
DssmRandomLogQueryAvgYabarUrlVisitors
web_production: 1551
The average yabarurlvisitors is predicted using a neural network for the year.
|
DssmRandomLogQueryAvgQueryDOwnerOnlyClickRate
web_production: 1552
The average value of QueryDowneronlyClickRate for the year for the year.
|
DssmRandomLogQueryAvgDaterAge
web_production: 1553
The average Dateraage value for the year for a year predicted using a neural network.
|
DssmRandomLogQueryAvgLongestText
web_production: 1554
The average LonGestText is predicted using a neural network for the year.
|
DssmRandomLogQueryAvgDifferentInternalLinks
web_production: 1555
The average DifferentinTernallinks for the year for the year.
|
DssmRandomLogQueryAvgQueryDOwnerOnlyClickRate_Reg
web_production: 1556
The average value of QueryDowneronlyClickRate_Rreg is predicted using a neural network for a year.
|
DssmRandomLogQueryAvgBocm
web_production: 1560
The average BOCM value predicted using a neural network for the year.
|
DssmRandomLogQueryAvgIsIndexPage
web_production: 1561
The average ISindEXPAGE is predicted using a neural network for the year.
|
DssmRandomLogQueryAvgQueriesAvgCM2
web_production: 1562
The average value of QueriesavGCM2 for the year for the year predicted using a neural network.
|
DssmRandomLogQueryAvgBrowserHostDownloadProbability
web_production: 1563
The average BrowserhostdowLoadProbabolyity for the year for the year.
|
DssmRandomLogQueryAvgRegBrowserUserHub
web_production: 1564
The average value of Regbrowseruserhub for the year for a year predicted using a neural network.
|
DssmRandomLogQueryAvgAuxTitleBM25
web_production: 1565
The average AuxtitlebM25 average value for the year for the year.
|
DssmRandomLogQueryAvgQueryUrlCorrectedCtrXfactor
web_production: 1566
The average value of QuryurlCorrededCTRXFACTOR for the year for the year.
|
DssmRandomLogQueryAvgQueryToDocAllSumFCountTextBm11Norm16384
web_production: 1567
The average value of QueryTodoCallsumfcountTextbM11Norm16384 for the year for the year.
|
DssmRandomLogQueryAvgXfDtShowAllSumWFSumWBodyMinWindowSize
web_production: 1568
The average value of the XFDTSHOWALSUMWFSUMWBODYMINWINDOWSIZE for the year for the year.
|
DssmRandomLogQueryClicksWeightedAvgIsMainPage
web_production: 1569
The value of the ISMAINPAGE with clicks predicted using the neural network with clicks on request for the year.
|
DssmRandomLogQueryClicksWeightedAvgYabarUrlAvgTime
web_production: 1570
A mid Yabarurlavgtime value predicted using a neural network with clicks for a year.
|
DssmRandomLogQueryClicksWeightedAvgDifferentInternalLinks
web_production: 1571
DiffferentinTernallinks, which is predicted using a neural network, is a weighted net with clicks for a year.
|
DssmRandomLogQueryDwelltimeWeightedAvgUrlDomainFraction
web_production: 1572
The Malue Network DwellTime-AMI predicted using the neural network is the value of Urldomainfraction for the year.
|
Regionality5LocalizationProbability
web_production: 1589
The prediction of the probability that the request is localized in accordance with the regionality5 rule.
|
DssmBoostingXfOneSeAmSsHardKMeans1Score
web_production: 1597
Dssm Boosting Score aggregation for XfOneSeAmSsHard model over 1-means centroids.
|
DssmBoostingXfOneSeAmSsHardKMeans1ScoreAvgClusterTop3Weighted
web_production: 1598
Dssm Boosting ScoreAvgClusterTop3Weighted aggregation for XfOneSeAmSsHard model over 1-means centroids.
|
YellownessImgMax
web_production: 1600
Average by url maximum yellowness of teaser image
|
YellownessImgAvg
web_production: 1601
Average by url average yellowness of teaser image
|
YellowImgShare
web_production: 1602
Ratio of yellow images in teasers on host
|
YellowImgCount
web_production: 1603
Average yellow images count on host
|
TeasersCount
web_production: 1604
Average teasers count on host
|
TeasersArea
web_production: 1605
Average teasers area on host
|
YellownessTxtMin
web_production: 1606
Average by url minimum yellowness of teaser text
|
YellownessTxtAvg
web_production: 1607
Average by url average yellowness of teaser text
|
HasAdvClickableBG
web_production: 1608
Background is clickable advertisement
|
AdvNetsArea
web_production: 1609
Average ratio of adverts on screen
|
AdvNetsAreaFirstPage
web_production: 1610
Ratio of adverts on screen on main page
|
AdvNetsCount
web_production: 1611
Average count of adverts on screen
|
AdvTraffOutShareDesktop
web_production: 1612
Ratio of outgoing advertisement traffic to all traffic (desktop)
|
RTBTraffOutShareDesktop
web_production: 1613
Ratio of outgoing real-time bidding traffic to all traffic (desktop)
|
NewsAgencyRating
web_production: 1614
Rating of news agency from agencies.json (Yandex.News resource)
|
DssmBoostingXfOneSeAmSsHardQueryMutationAddFixedYearWordRenormedDistance
web_production: 1624
Characterizes the request for the degree of change from the addition of a fixed word (number of some year), DSSM model DSSMBOOSTINGXFONESEAMSARD is used
|
DssmBoostingXfOneSeAmSsHardQueryMutationAddOnlineWordRenormedDistance
web_production: 1625
Characterizes a request for the degree of change from the addition of a fixed word ('online' for Kirilitsa), DSSM model DSSMBOOSTINGXFONESEAMSARD is used
|
DssmBoostingXfOneSeAmSsHardQueryMutationDelSiteWordRenormedDistance
web_production: 1626
Characterizes the request for the degree of change from removing a fixed word ('site' for Kirilitsa), DSSM model DSSMBOOSTINGXFONESEAMSARD is used
|
DocSourceFresh
web_production: 1627
A document from the hearts with fresh
|
RandomLogWordMaxHasNoTr
web_production: 1628
For each word offline, the average Hasnotr meaning is calculated for 3 months. Further, in all words of the request, the maximum of this value is taken.
|
RandomLogWordMaxIsLJ
web_production: 1629
For each word offline, the average ISLJ value is calculated for 3 months. Further, in all words of the request, the maximum of this value is taken.
|
RandomLogWordMinBclmLite
web_production: 1631
For each word offline, the average BCLMLITE value is calculated for 3 months. Further, in all words of the request, a minimum of this value is taken.
|
RandomLogWordSkipStopWordsMaxDBM40
web_production: 1632
For each word offline, the average DBM40 value is calculated for 3 months. Further, for all non -feet, the words of the request are taken as a maximum of this value.
|
RandomLogWordSkipStopWordsMaxIsDesktopRequest
web_production: 1633
For each word offline, the average ISDESKTOPREQUEST value is calculated for 3 months. Further, for all non -feet, the words of the request are taken as a maximum of this value.
|
RandomLogWordMaxRLQAvgHasNoAllWordsTrSyn
web_production: 1634
For each word offline, the average value of RLQAVGHASNOLLWORDSTRSYN is calculated at the request for 3 months. Further, in all words of the request, the maximum of this value is taken.
|
RandomLogWordMaxDssmAggregatedAnnReg
web_production: 1635
For each word offline, the average DSSMAGGRETEDANNREG value is calculated at the request for 3 months. Further, in all words of the request, the maximum of this value is taken.
|
RandomLogWordMaxMetaNumUrlsPerHostFixed
web_production: 1636
For each word offline, the average meaning of MetanumurlSperhostfixed is calculated in demands in 3 months. Further, in all words of the request, the maximum of this value is taken.
|
RandomLogWordSkipStopWordsMaxSDIsNavMxQueryMax
web_production: 1637
For each word offline, the average value of MaxsdisnavmxqueryMax is calculated at the request for 3 months. Further, for all non -feet, the words of the request are taken as a maximum of this value.
|
RandomLogHostVisitsFromWikiAvg
web_production: 1638
AVG aggregation of VisitsFromWiki web factor using random log
|
RandomLogHostNavLinearPerc25
web_production: 1640
PERCENTALE_25 aggregation of NavLinear web factor using random log
|
RandomLogHostFoundPerc90
web_production: 1641
PERCENTALE_90 aggregation of Found web factor using random log
|
RandomLogHostSubqueryThMatchAvg
web_production: 1642
AVG aggregation of SubqueryThMatch web factor using random log
|
RandomLogHostSegmentWordPortionFromMainContentAvg
web_production: 1644
AVG aggregation of SegmentWordPortionFromMainContent web factor using random log
|