Slice: web_production
(1923 ranking factors)
Factors |
---|
PR
web_production: 0
Weight: 0.182867833093047 Page Rank. The factor will be remarked.
|
TR
web_production: 1
Text relevance (Maxfreq is the frequency of the most frequent word that makes sense of the length of the document).
|
LR
web_production: 2
Weight: 0.049061648412321 Link relevance. The factor will be remarked.
|
PrBonus
web_production: 3
Weight: 0.07124278745128 Priority bonus, priority 7 - text priority. The binary factor, matters 0 for all monosyllabic requests, and the value of 1 for almost all two or more words, except for a very small number of answers for which there is not a single link that has passed quorum, and the text also did not pass the quorum.
|
TRp1
web_production: 4
Stript priority for TR is a text priority - there are all the words of the request somewhere in the document (while they pass contextual restrictions on the request, for example, both words DB in one sentence).
|
TRp2
web_production: 5
Weight: -0.109820338929289 PHRASE priority for TR is a text priority - there are all the words of the request in a row in the document.
|
LRp1
web_production: 6
(strict) there is all the words of the request in one link.
|
LRp2
web_production: 7
Weight: 0.019119257307239 (Phrase) There are all the words of the request in a row in one line.
|
TRtitle
web_production: 8
The presence of an accurate phrase (request text) in the header (more precisely, in the first sentence of the document). Contextual restrictions and feet are taken into account exactly as in TRP2, i.e. Factor [8] Minors Factor [5]
|
TRhr
web_production: 9
There was a plot that passed the quorum in which all the word positions are designated as those who have the relevance of Best_relev (title or Meta Keywords).
|
Removed_10
web_production: 10
|
News
web_production: 11
This is the news (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushichiekomponenty/klassificacionnye?v=tkd#h45859-3 Patterns in URL URL)))).
|
Shop
web_production: 12
Weight: 0.097713692186877 This is a proposal store (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayafformula/tekushhiekomponenty/klassificacionnye?v=tkd#h45859-4 Patterns in URL'TERN))))))))))). Not used (depreded)
|
Cat
web_production: 13
This is a catalog (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayafformula/tekushhiekomponenty/klassificacionnye? .
|
YaBar
web_production: 14
Weight: 0.027302374355601 Attendance from the bar - ((http://wiki.yandex-team.ru/andrejjkostjagin/yabarlog/hoststat data description)). The factor will be remarked.
|
Long
web_production: 15
Weight: -0.084798680877042 Long document (the longer the document, the greater the value of the factor).
|
TRhitw
web_production: 16
Hitweigt is a variant of textual relevance, in which the weights of all hits are considered equal (i.e., they do not take into account the allowances for title and the proximity of words). In this case, the corresponding hits must be restricted by the syntactic sorcerer, i.e. We can assume that the TRHITW factor is 0 and only when Softandok is 0
|
LongQuery
web_production: 17
Weight: 0.030334786608805 The amount of IDF words of the request. The name does not reflect the essence: for example, for the request of 'Gadyach' this factor will be more than for the request of 'Moscow Peter Yekaterinburg Samara'.
|
PureText
web_production: 18
Long text without links.
|
Root
web_production: 19
This is a muzzle.
|
Removed20
web_production: 20
|
Removed21
web_production: 21
|
Geo
web_production: 22
Means the coincidence of the region of the user and the site at the level of countries. Binar factor: 1-rush, 0-no. It is based on ((http://wiki.yandex-team.ru/ Yandexposisk/ Classification of Sytraitniki/ Geographic/Sospolzanievpoysk Geoklassification of sites)))))))
|
SubqueryThMatch
web_production: 23
Coincidence of thematic spectra of request and document. Request themes-the result of work ((http://wiki.yandex-team.ru/evgenijjkroxalev/subquery Rules of the sorcerer Subquerysearch)) The subject of the document is taken from Yandex-Catalog
|
SR
web_production: 24
Weight: 0.049845924868959 The complex Static Rank is assembled from static components according to a separate formula ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/#oftnd1 *))).
|
TRref
web_production: 25
The factor about the number of Refines. In the queries, there is a feature of user refines ('' word that is faced with a percentage sign '). According to the idea, this means something like 'it would be good if the word in the document was'. The only famous ((http://staff.yandex-team.ru/gulin Andrey Gulin)) the valuable use of this feature is a request [ %official %site name of the film]. This feature is unknown to users, because Not described in any documentation. It is planned that it will disappear from the tongue of requests, but in the sorcerer the words with the priority of User_refine will remain. The factor indicates how much the maximum user_refine was simultaneously found in the framework of a single hit in the quorum. It is believed that there are from 0 to 3 (if> 3, then it is believed that 3). This number is waved in the half interval [0.1)
|
TRboost
web_production: 26
The number for which some linseed factors are multiplied (namely, factors number 6, 7, 47, 66), if text relevant 0, and there are few links
|
TRLRlemma
web_production: 27
In textual relevance, Lemma coincides.
|
TrafgraphOutAll_share_d
web_production: 28
Remapped mascot feature TrafgraphOutAll_share_d
|
RelevSentsDssm
web_production: 29
DSSM model, trained for reformulations, in the document uses relevant to the request of the proposal
|
FreshNewsDetectorPredict
web_production: 30
The value of the news detector calculated in the Hippo. Always 0 with a detector value less than the threshold.
|
LRHitNum100
web_production: 31
Weight: 0.033485833700259 The transformed number of words of the request in all url linos.
|
LRHitNumGt16
web_production: 32
The document LR> 20 The number of words of the words of the request in the Links> 16, the factor about LR.
|
PctLinks
web_production: 33
Weight: -0.141668202468497 For documents with a high LR, a normalized lincat relevance excluding proximity, for documents with a low LR 0
|
HasLR
web_production: 34
URL High LR.
|
LinkQuality
web_production: 35
Weight: -0.001564275785704 The quality of incoming links (the classifier of the bream) is broken, cm [405]
|
AliceMusicTrackTitleCosineMatchMaxPrediction
web_production: 36
The value of the cosinematchmaxprediction factor for the Stryim ALICEMUSIC
|
NumLinks
web_production: 37
The number of incoming links. Remembrance.
|
PopularQ
web_production: 38
The popularity of the request
|
TRUnmapped
web_production: 39
TR divided by a cube of the number of words in a request and transformed by a standard REMAPTR.
|
RusLang
web_production: 40
The language of the document is Russian.
|
AddTime
web_production: 41
Weight: 0.006691168756865 The time of adding a page, more - a more old document; The root is placed from time displayed at the interval [0.1] so that 3+ years gives 1.
|
IsMainPage
web_production: 42
If the main page of the owner (most often a second -level domain, for example xxxx.ru), then the factor is 1. For bums, hosting, personal blogs, etc. (for example, Lifejornal, People.ru, etc.) - domains of the third level (such as xxxxx.narod.ru) will also have an equal factor 1.
|
AddTimeMP
web_production: 43
The time for adding the main page of the owner (host?) Will be remaped like Addtime.
|
AliceMusicTrackTitleAnnotationMaxValueWeighted
web_production: 44
The value of the factor annotationmaxvalueEWEEGTED for Stryim ALICEMUSIC
|
QueryURLClicksPCTR
web_production: 45
How often they click in this URL for this request - CTR blasting for the correction factor
|
TextBM25
web_production: 46
Simple BM25 in text.
|
LinkBM25
web_production: 47
Simple BM25 for links, the weights of the braces are not taken into account.
|
TLBM25
web_production: 48
Weight: 0.031399776481102 Simple BM25 in text and links at the same time.
|
TLp1
web_production: 49
All the words of the request are in the text + links.
|
Adv
web_production: 50
Weight: -0.250928463672112 There is advertising on the site.
|
YandexAdv
web_production: 51
Weight: -0.094261219650513 On the site there is an advertisement for Yandex.
|
NoSpam
web_production: 52
The Classifier of Spam for Picks from Antispam recognized the site not (!) Spam. Those. 0 = spam, 1 = good.
|
TxtPair
web_production: 53
Weight: -0.020921642736537 Simple BM25 in pairs of words - we take all pairs of words of the request and consider the number of their entry into the text of the document. In the quality of the weight of the pair we use the sum of the scales of words. It does not work if there is a stop-word in the request
|
LnkPair
web_production: 54
The same as txtpair, but for links; Link weights are not taken into account.
|
TxtBreak
web_production: 55
BM25 from the number of sentences in the document in which it occurs.
|
TxtHead
web_production: 56
Weight: -0.037878046829073 BM25 according to only in the heading.
|
TxtHiRel
web_production: 57
BM25 according to only with High Rel-bots ('significant', with the allocation (<b> ITP)).
|
Removed_58
web_production: 58
|
WordCount
web_production: 59
Min (number of words of request/10, 1.f)
|
InvWordCount
web_production: 60
1 / quantity_lov_v_
|
HasNoTR
web_production: 61
The document has no TR.
|
HasNoLR
web_production: 62
The document has no LR.
|
HasNoQueryURLShows
web_production: 63
For this Urla, for this request, there is no information about clickness 1 - request or request -URLA in the click database, 0 - query URL in the clicks database
|
HasNoQueryShows
web_production: 64
Weight: 0.205699196177282 For this request, there is no information about the clickness of 1 - there is no request in the click database, 0 - the request is in the click database.
|
Hops
web_production: 65
The number of hops of Url inpans (such as less - closer to the muzzle, the lower the value (0 - the muzzle, 1 - from the muzzle cannot be reached, 0 <can get from the muzzle <1). Normal value for the root of the nosta 0.0039).
|
LogLR
web_production: 66
Weight: 0.026926509552263 Logarithm from LR, linearly displayed in [0.1].
|
TxtPairEx
web_production: 67
Weight: -0.00667940021707 the presence of pairs of words in the exact form
|
TxtBreakEx
web_production: 68
Weight: 0.024006117828321 the number of sentences in which there are many words in the exact form
|
TxtHeadEx
web_production: 69
Weight: -0.03957553241619 the presence of words in the header in the exact form
|
TxtHiRelEx
web_production: 70
BM25 in the exact form
|
TxtBm25Ex
web_production: 71
Simple BM25 in the exact form.
|
TxtPairSy
web_production: 72
Weight: -0.022152880819573 the presence of pairs of words taking into account synonyms (> = txtpair)
|
TxtBreakSy
web_production: 73
Weight: -0.116819481337211 the number of sentences in which there are many words taking into account synonyms
|
TxtHeadSy
web_production: 74
Weight: -0.012919083353605 the presence of words in the header, taking into account synonyms
|
TxtHiRelSy
web_production: 75
Weight: -0.039215257302626 BM25 taking into account synonyms
|
TxtBm25Sy
web_production: 76
Simple BM25 taking into account synonyms.
|
QueryDOwnerClicksPCTR
web_production: 77
Weight: 0.219595036178226 How often they click in the URLs of this Domainid for this request - Ctr Domainid blasting for the correction factor
|
HasNoQueryDOwnerShows
web_production: 78
Weight: 0.160379344658431 For this Domainid, for this request there is no information about clickability 1 - request or request -owner is not in the clicks database, 0 - the request for clicks is in the database of clicks
|
OwnerClicksPCTR
web_production: 79
Weight: 0.231000481757815 The owner's clickness regardless of the request
|
Megafon
web_production: 80
The relative frequency of the words in the links (1 - the words of the request are often found in links, 0.3 - rarely); More precisely, the value of this factor is pessimized provided: TR = 0 && LR = 0 & (there is not a single link with all the words of the request) && (did not pass the quorum) && (at least one pair of words of the request is found in the text)
|
XLRp0
web_production: 81
There are all the words of the request in the links
|
XLRp1
web_production: 82
There are all the words of the request in one link
|
XLRp2
web_production: 83
Weight: 0.0051601584234 There is a link that has passed quorum
|
XLRgood
web_production: 84
Weight: -0.00083343707893 What is the share of “good” links
|
XLRmanyBad
web_production: 85
How many “bad” links (bad = DPR = 0)
|
XLRmaxDpr
web_production: 86
Weight: -0.065082391728977 Maximum DPR links
|
XLRtfidf
web_production: 87
TFIDF ordinary TF*IDF by links. The frequency of the word in the links is multiplied by the reverse document frequency and summarized in all words, then it is normalized to the length of the document.
|
XLRrelev
web_production: 88
Linkovaya relevance by Gulina
|
XLRrelev200
web_production: 89
Linkovaya relevance by Gulina
|
XLRlogRelev
web_production: 90
Linkovaya relevance by Gulina
|
BFexact
web_production: 91
There is an exact form of all words of the request in the text/lincers
|
BFlemma
web_production: 92
There is a lemma of all the words of the request in the text/lincers
|
SoftAndOk
web_production: 93
The document passed Softand on the restrictions of the syntactic sorcerer. Only for documents with textual relevance. For monosyllabic requests, always 1.
|
NewLinkQuality
web_production: 94
Weight: 0.021178675054476 The quality classifier of incoming links 2 is broken, cm [407]
|
Ukrainian
web_production: 95
It is equal to one if the site has a Ukrainian geoist (i.e. 1 - Ukrainian site)
|
IsBlog
web_production: 96
Page from the blogochosting
|
IsLivejournal
web_production: 97
Page with Livejournal.com
|
Removed_98
web_production: 98
|
Spam2
web_production: 99
Automatic classifier spam named after Alekseeva, the likelihood that the website spam (0 is not spam, 1- spam)
|
TextFeatures
web_production: 100
Weight: -0.016033504310566 The quality of the text. It is considered a rather complex formula
|
TextLike
web_production: 101
Weight: -0.094096848692163 Text quality (classifier Alekseeva)
|
Removed_102
web_production: 102
|
Removed_103
web_production: 103
|
YaBarCoreOwner
web_production: 104
The core of the audience of owners according to Yandex.Mrazusing
|
YaBarCoreHost
web_production: 105
The core of the audience of the hosts according to Yandex.Mrazusing
|
HasYaBarCore
web_production: 106
Does the host have a host
|
SpamKarma
web_production: 107
Weight: 0.008426829629948 Spam karma named after antispamers is the likelihood that the host is spam; based on Whois information
|
MusicQ
web_production: 108
The musicality of the request. The results of the sorcerer Anton Konygin.
|
XLExactMatches
web_production: 109
The number of links that exactly coincide with a request
|
DocLen
web_production: 110
Weight: -0.065128132003719 Document length in sentences
|
UrlLen
web_production: 111
Weight: -0.001158034315755 The length of the URL, divided by 5
|
QueryNonCommerciality
web_production: 112
The commercial request for the dictionary of phrases from Direct: 0 - maximum commercial, 1 - minimal.
|
HostSize
web_production: 113
Weight: -0.032004809610482 The size of the host named after Raskovalov in the documents without taking into account the takes (each double is taken into account in the factor by an independent document)
|
IsHTML
web_production: 114
Document type - HTML
|
LinkSpeed
web_production: 115
Weight: 0.009455905387837 The number of reverse dispersion times of the appearance of links with the words of the request
|
XThLRrelev
web_production: 116
Link relevance, taking into account thematicity
|
XThLRrelev200
web_production: 117
Link relevance, taking into account thematicity
|
XThLRlogRelev
web_production: 118
Link relevance, taking into account thematicity
|
XLerfLRrelev
web_production: 119
Link relevance, taking into account the quality of each link
|
XLerfLRrelev200
web_production: 120
Link relevance, taking into account the quality of each link
|
XLerfLRlogRelev
web_production: 121
Weight: 0.060594485044371 Link relevance, taking into account the quality of each link
|
XLerfThLRlogRelev
web_production: 122
Link relevance, taking into account the quality of each link and thematicity of each link
|
XNonCommLRlogRelev
web_production: 123
Link relevance, taking into account the non -profitability of each link
|
XNonCommThLRlogRelev
web_production: 124
Link relevance, taking into account the non -profitability of each link and thematic
|
XNonCommLerfLRlogRelev
web_production: 125
Link relevance, taking into account the non -profitability of each link and quality of each link
|
XNonCommLerfThLRlogRelev
web_production: 126
Link relevance, taking into account the non -profitability of each link, the quality of each link and thematicity
|
GeoCityProxim
web_production: 127
Weight: 0.051465613603836 Means the coincidence of the region mentioned in the request and found sites at the level of areas. Binar factor: 1-rush, 0-no. It is based on ((http://wiki.yandex-team.ru/ Yandexposisk/ Classification of Sytraitniki/ Geographic/Sospolzanievpoysk Geoklassification of sites)))))))
|
LinksWithWordsPercent
web_production: 128
Weight: -0.060922780495065 The percentage of incoming links with the words of the request
|
LinksWithAllWordsPercent
web_production: 129
Weight: -0.08383112850758 The percentage of incoming links with all the words of the request
|
PornoQuery
web_production: 130
Are there any words from Yweb/Pornofilter/Porno.query.
|
IsPorno
web_production: 131
Document from porn kitski
|
IsComm
web_production: 132
Weight: -0.066463228806236 A document from a commercial clay. Not used (depreded)
|
IsFake
web_production: 133
Fast document
|
IsSEO
web_production: 134
The page title contains commercial vocabulary. Not used (depreded)
|
IsWiki
web_production: 135
page from ru.wikipedia.org
|
IsEShop
web_production: 136
Commercial page (Classifier Savina)
|
GeoRegionProxim
web_production: 137
Weight: 0.082967074248567 |
HasNoAllWordsTRSy
web_production: 138
The document does not have all the words of the request (with an accuracy to a synonym)
|
NumWordsTRSy
web_production: 139
The percentage of the words of the request in the document (with an accuracy to a synonym)
|
HasAllWordsTRSy
web_production: 140
The document has all the words of the request (with an accuracy to a synonym)
|
NumWordsLR
web_production: 141
The percentage of the words of the request in the links (with an accuracy to a synonym)
|
HasAllWordsLR
web_production: 142
There are all the words of the request in the links (with an accuracy to a synonym)
|
PayDetectorPredict
web_production: 143
The value of the commerce detector calculated in the Hippo.
|
TxtInvPair
web_production: 144
Tr by pairs of words in the reverse order
|
LnkInvPair
web_production: 145
Lr by pairs of words of the request in the reverse order
|
TxtSkipPair
web_production: 146
Weight: -0.077504878926916 TR by pairs of words of the request through one word in texts
|
LnkSkipPair
web_production: 147
Lr by pairs of words of the request through one word in texts
|
NumWordsTRFm
web_production: 148
The percentage of all the words of the request in the text (with an accuracy to the form)
|
HasAllWordsTRFm
web_production: 149
The document has all the words of the request (with an accuracy to the form)
|
QDiversity
web_production: 150
Weight: 0.046783126435468 The degree of centralization of the points from which the request is set
|
QBlog
web_production: 151
Whether the request of blog vocabulary contains
|
XGeoLRlogRelev
web_production: 152
Weight: 0.009314594460961 log (lr, narrowed to the country of the user)
|
XLerfGeoLRlogRelev
web_production: 153
Weight: 0.044511155721215 log (leerflr, narrowed to the country of the user)
|
NonCommercialQuery
web_production: 154
Binar non -profit request: Querynoncommerciality> 0.965.
|
XLExactMatchesMap
web_production: 155
The number of links that coincide with the text of the request (other Remap)
|
XLerfNormLRlogRelev
web_production: 156
Xlerflrlogrelev (normalized for the amount of LerF-wwees of all links, and not for the amount of their source scales)
|
XNonCommNormLRlogRelev
web_production: 157
Weight: 0.062474190501436 Xnoncommlrlogrelev (normalized for the amount of noncomm all links, and not for the amount of their source scales)
|
XNonCommThNormLRlogRelev
web_production: 158
Link relevance, taking into account the non -profitability of each link and thematic
|
XNonCommLerfNormLRlogRelev
web_production: 159
Xnoncommelrfnormlrlogrelev (normalized for the amount of noncommlrf-wigles of all links, and not for the amount of their source scales)
|
XNonCommLerfThNormLRlogRelev
web_production: 160
Link relevance, taking into account the non -profitability of each link, the quality of each link and thematicity
|
Nevasca1
web_production: 161
The content of content is not used. 'Hoost is good (from 0 to 1), calculated on the basis of how many and what hosts the content from this one borrow.
|
Nevasca2
web_production: 162
Weight: -0.044963560309064 The content of content is not used. 'Host's badness' (from 0 to 1) - is proportional to the number of secondary content on the host.' The host (from 0 to 1) is proportional to the number of secondary content on the host.
|
LinkAge
web_production: 163
Weight: 0.000426528744914 The average age of links that brought something to LR linkage = min (log (average age of links)/7, 1), 3 years are adopted for 1
|
TLen
web_production: 164
The length of the page text in the words tlen = map (number of words, 1/400), where map (x, y) = x*y / (1 + x*y)
|
IsUnreachable
web_production: 165
The page is unattainable by the links from the muzzle.
|
XLangLRlogRelev
web_production: 166
LR, taking into account the coincidence of the language and request
|
XLerfLangLRlogRelev
web_production: 167
Weight: 0.000094696411924 LR, taking into account the coincidence of the language of the link and request and accuracy
|
QueryURLClicksFRC
web_production: 168
the ratio of the number of clicks on this Urlu to all clicks on request
|
QueryDOwnerClicksFRC
web_production: 169
Weight: 0.214713693660762 the ratio of the number of clicks on this Domainid to all clicks on request
|
QueryURLClicksPCTR_copy
web_production: 170
[Bug: A copy of factor 45] How often they click in this URL for this request - CTR blasting for a correction factor
|
DoppQueryUrlSessionClicksFRCCity
web_production: 171
What part (on average by the session) from the user Urlov’s user, this URL user, who has been completed to it, is this URL. It is considered to be user sessions.
|
QueryURLClicksPCTR_Reg
web_production: 172
How often do they click in this URL for this request - CTR blasting for the correction factor, by small regions from Relev_regions.web.txt
|
QueryDOwnerClicksPCTR_Reg
web_production: 173
Weight: 0.047914113074106 How often they click in the URLs of this Domainid for this request - Ctr Domainid to the correction factor, by small regions from Relev_regions.web.txt
|
QueryURLClicksFRC_Reg
web_production: 174
Weight: 0.023610887210981 The ratio of the number of clicks on this Urlu to all clicks on request, by small regions from Relev_regions.web.txt
|
QueryDOwnerClicksFRC_Reg
web_production: 175
Weight: 0.118638180985299 The ratio of the number of clicks on this Domainid to all clicks on request, by small regions from Relev_regions.web.txt
|
QueryURLClicksCombo_Reg
web_production: 176
Query URL Clicks Combo, in small regions from Relev_regions.web.txt
|
QueryDOwnerClicksCombo_Reg
web_production: 177
Weight: 0.160420713540373 Query Download Clicks Combo, in small regions from Relev_regions.web.txt
|
XLRCatalogRelev
web_production: 178
Weight: 0.0199886635755 LR for catchard descriptions
|
XLRYaCatalogRelev
web_production: 179
LR to write off in Yandex.Catalog
|
ExactWordOrderLen
web_production: 180
The length of the maximum coincidence of forms in the text and request
|
ExactWordOrderWeight
web_production: 181
Weight of maximum coincidence of forms in the text and request
|
WordOrderLen
web_production: 182
The length of the maximum coincidence in the lemma in the text and request
|
WordOrderWeight
web_production: 183
The weight of the maximum coincidence by lemma in the text and request
|
LinkMaxAge
web_production: 184
The maskimal age of a significant accumulation of links that brought something to LR
|
TRp1All
web_production: 185
Options for relevant factors taking into account the feet of words
|
LRp1All
web_production: 186
Options for relevant factors taking into account the feet of words
|
TLp1All
web_production: 187
Weight: 0.055767877134775 Options for relevant factors taking into account the feet of words
|
BFexactAll
web_production: 188
Options for relevant factors taking into account the feet of words
|
BFlemmaAll
web_production: 189
Weight: 0.059222635368125 Options for relevant factors taking into account the feet of words
|
PassageLegacyTR
web_production: 190
Weight: 0.038806477920761 TR of the best passage - how high -quality snippet
|
TxtBM25AttenSyn
web_production: 191
Weight: 0.075434934641649 Tr with discount for suggestions
|
MaxWordHostRank
web_production: 192
Weight: 0.230257144838931 Host Rank according to the most pronounced word of request (usually this is the name of the site)
|
MaxWordHostClicks
web_production: 193
Weight: 0.345115883490577 Domattr clickness for the most expressed word. For example, for all requests in which there is a word Wikipedia click on Wikipedia warders.
|
DomPhraseRank
web_production: 194
Hostrank by separate words
|
DomPhraseClickRank
web_production: 195
Weight: 0.076343383792772 Domain clickability by words
|
IsForum
web_production: 196
URL satisfies forum_detector regularly
|
AliceMusicTrackTitleAnnotationMatchWeightedValue
web_production: 197
The value of the Factor AnnotationMatchweighhedValue for Stryim ALICEMUSIC
|
IsObsolete
web_production: 198
The URL has an ancient date. Ancient news are recognized. Factor 1 if there is a year in Url <= 2007.
|
TRWithStops
web_production: 199
Weight of maximum coincidence of forms in the text and request
|
LRWithStops
web_production: 200
Weight of maximum coincidence of forms in the text and request
|
HasPayments
web_production: 201
The page has a about 'payment SMS'.
|
IsLinkPessimised
web_production: 202
Antispamers pessimized the site - all dynamic linseed factors are reset. Zerolnk.flt
|
EshopValue
web_production: 203
Weight: -0.123814718900663 Stage of the page
|
PornoValue
web_production: 204
Pornography of the page
|
TrafgraphOutAll_share_m
web_production: 205
Remapped mascot feature TrafgraphOutAll_share_m
|
TrafgraphOutAllSE_share_d
web_production: 206
Remapped mascot feature TrafgraphOutAllSE_share_d
|
TrafgraphOutAllSE_share_m
web_production: 207
Remapped mascot feature TrafgraphOutAllSE_share_m
|
NoExtClicksShare
web_production: 208
Remapped mascot feature NoExtClicksShare
|
CountersSearchTraffic1
web_production: 209
Weight: 0.024263431712643 Search traffic - transitions from search engines to the site (2nd formula)
|
CountersSearchTraffic2
web_production: 210
Weight: -0.057014032623374 Search traffic - transitions from search engines to the site (2nd formula)
|
DomPhraseYabar
web_production: 211
Weight: 0.085276276270387 Transitions to the site from search engines by individual words, according to the bar
|
AliceMusicArtistNameBclmMixPlainK000001
web_production: 212
BCLMIXPLAINK000001 Factor value for Alicemusic
|
QueryUrlLCS
web_production: 213
The largest total tuning of Urla and request, normalized by the length of Urla
|
OnlyUrl
web_production: 214
All coincidences are only in the URL, there are no coincidences in the text
|
GeoRelevRegionCity
web_production: 215
|
GeoRelevRegionRegion
web_production: 216
|
GeoRelevRegionCountry
web_production: 217
Weight: 0.084012276385059 Three levels of coincidence of the geography of the user and page
|
XLRGeoRelevRegionCity
web_production: 218
|
XLRGeoRelevRegionRegion
web_production: 219
|
XLRGeoRelevRegionCountry
web_production: 220
Weight: 0.042452794899003 Three levels of coincidence of the region of links and request
|
GeoCountryProxim
web_production: 221
Weight: 0.01317157982937 Geographical proximity
|
IsNavQuery
web_production: 222
Is the request for navigation, on the clicking of the answers
|
MaxWordHostYaBar
web_production: 223
Weight: 0.315439457304752 The most characteristic word of the request corresponding to the site, according to the bar
|
FirstWordHostClicks
web_production: 224
Weight: 0.129641401501547 The clickability of the host according to the first word of the request. Quite often, the first (last) word word is a clear indication of the site on which the information should be sought.
|
AliceMusicArtistNameCMMatchTop5AvgMatch
web_production: 225
The value of the CMMATCHTCHTOP5AVGMATCH factor for the Stryim ALICEMUSIC
|
QueryDOwnerYabarVisits
web_production: 226
Weight: 0.147136648195774 |
QueryDOwnerYabarVisitors
web_production: 227
Weight: 0.119512833156651 |
QueryDOwnerYabarAvgTime
web_production: 228
Weight: 0.122090633457258 The average for users Active continuous time of the user is (in second) on the host pages after the transition on request from the search engine (the factor depends on the pair (request, Domattr)).
|
QueryDOwnerYabarAvgTime2
web_production: 229
The average for users Active continuous time of the user is (in second) on the host pages after the transition on request from the search engine (the factor depends on the pair (request, Domattr)). In the inside of the Yandex. Bara/elements/browser counter
|
QueryDOwnerYabarAvgActions
web_production: 230
The average for users is the number of active actions (clicks, clicks) with the continuous finding of the user on the host pages after the transition on request from the search engine (the factor depends on the pair (request, Domattr)). . In the inside of the Yandex. Bara/elements/browser counter
|
QueryUrlYabarVisits
web_production: 231
|
QueryUrlYabarVisitors
web_production: 232
The number of unique visitors from search engines for a specific request
|
QueryUrlYabarAvgTime
web_production: 233
The average for users Active continuous time of the user (in second) on the page after the transition on request from the search engine (the factor depends on the pair (request, URL)).
|
QueryUrlYabarAvgTime2
web_production: 234
The average for users Active continuous time of the user (in second) on the page after the transition on request from the search engine (the factor depends on the pair (request, URL)). In the inside of the Yandex. Bara/elements/browser counter
|
QueryUrlYabarAvgActions
web_production: 235
The average for users is the number of active actions (clicks, keystrokes) on the page after the transition on request from the search engine (the factor depends on the pair (request, URL))
|
DssmBertDistillSinsigMseBaseRegChain
web_production: 236
A pool of logs is marked with BERT trained on Sinsig. DSSM model is trained on this pool using BaseregionChain
|
DssmBertDistillRelevanceMseBaseRegChain
web_production: 237
A pool from PRS logs is marked using BERT, trained for relevance. DSSM model is trained on this pool using BaseregionChain
|
AliceMusicArtistNamePerWordCMMaxMatchMin
web_production: 238
PERWORDCMMAXMATCHMIN Factor value for Alicemusic Stryim
|
AliceMusicArtistNameAttenV1_Bm15_K05
web_production: 239
The value of the factor attenv1_bm15_k05 for the Stryim ALICEMUSIC
|
AliceMusicAlbumTitleAnnotationMaxValueWeighted
web_production: 240
The value of the factor annotationmaxvalueEWEEGTED for Stryim ALICEMUSIC
|
IsForeignQuery
web_production: 241
Request is not in Russian
|
IsForeignCluster
web_production: 242
foreign cluster document
|
PageRegionSizeIn
web_production: 243
Weight: 0.056552232052119 The size of the page of the page
|
PageRegionInvSizeIn
web_production: 244
Weight: -0.006950709230428 The factor is inversely proportional to the size of the page region
|
QueryRegionSize
web_production: 245
The size of the region of the request
|
QueryRegionInvSize
web_production: 246
The factor is inversely proportional to the size of the regional region
|
GeoGeometryProxim
web_production: 247
Weight: -0.000843495929565 The geographical proximity of the user and the site
|
RingsHostRankBadnessOld
web_production: 248
Weight: -0.036532955371613 Characterizes the promotion of the site with ling rings. Value is the share of external links that are included in the lingon rings and battleships.
|
YabarHostVisitors
web_production: 249
Weight: 0.085929172196314 The number of unique visitors, remarks exponentially
|
YabarHostSearchTraffic
web_production: 250
Weight: 0.00667848123376 The share of traffic from search engines
|
YabarHostInternalTraffic
web_production: 251
Weight: 0.071417326810502 The share of suits to the site is not by links (set with hands or from bookmarks)
|
YabarHostAvgTime
web_production: 252
Weight: -0.007634608393132 average for users Active continuous time for user finding (in sec) on the host pages
|
YabarHostAvgTime2
web_production: 253
Weight: 0.074172193125966 The average for users Active continuous time of the user (in second) on the pages of the host. In the inside of the Yandex. Bara/elements/browser counter
|
YabarHostAvgActions
web_production: 254
Weight: 0.127979729953137 The average for users is the number of active actions (clicks, clicks) with the continuous finding of the user (in second) on the pages of the host.
|
YabarHostBrowseRank
web_production: 255
Implementation of the algorithm described in the article ((http://wiki.yandex-team.ru//h.yandex.net/?http%3A%2F%2FreseRosoft.microsoft.com%2fen-US%2FPEOPLIULIUUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUUP032-LIUUUU .pdf http://research.microsoft.com/en-us/people/tyliu/fp032-liu.pdf))
|
YabarUrlVisits
web_production: 256
Weight: 0.067151098341326 Varla's attendance according to I-Bara
|
YabarUrlVisitors
web_production: 257
Weight: 0.051057813309267 The number of unique visitors to Urla
|
YabarUrlAvgTime
web_production: 258
Weight: 0.003890338237824 The average for users time is the user on the page. It is read as the difference between neighboring transitions.
|
OwnerSatisfied4Rate
web_production: 259
Weight: 0.102548297661617 This is the SEA factor = s4_r/ (k_r+10) where S4_R is the number of clicks> 180 sec, k_r - the total number of clicks. It is considered taking into account reformalization.
|
OwnerSatisfied4Rate_Reg
web_production: 260
This is the SEA factor = s4_r/ (k_r+10) where S4_R is the number of clicks> 180 sec, k_r - the total number of clicks. It is considered taking into account reformalization. Localized version
|
UrlQueryVariety
web_production: 261
The degree of variety of requests for which this Urla click
|
IsCommByKeywords
web_production: 262
Commercial page by keywords. Not used (depreded)
|
DocIdfSum_broken
web_production: 263
IDF for various parts of the document, broken, are not used
|
TitleIdfSum_broken
web_production: 264
Weight: 0.070074395872424 IDF for various parts of the document, broken, are not used
|
HeadingIdfSum_broken
web_production: 265
Weight: 0.061031422056552 IDF for various parts of the document, broken, are not used
|
NormalTextIdfSum_broken
web_production: 266
IDF for various parts of the document, broken, are not used
|
XLRVideoRelev
web_production: 267
Link factor about the presence of a video on the page.
|
AuxTextBM25
web_production: 268
BM25 for the user region for localized queries, for the unflapped in Cuba, is a country. The texts of the queries sent for the regions can be viewed in Relev_regions.txt in the sorcerer
|
AuxLinkBM25
web_production: 269
The same for lingonic relevance
|
CommLinksSEOHosts
web_production: 270
Weight: -0.180963639077109 The share of incoming corrupt links. The algorithm for recognition of commercial links is implemented. The factor will be remarked to [0.1] if the share of such links is 50%, otherwise 0. ((http://wiki.yandex-team.ru/svetlanashorina/topseolinks selection of wound sites))))))
|
CommLinksSEOHostsPornoQuery
web_production: 271
Previous factor multiplied by Pornoquery
|
CommLinksSEOHostsNonComm
web_production: 272
Weight: 0.0033634994869 ComMlinksseohosts factor multiplied by Noncommercialquery
|
TovarCategoryQuery
web_production: 273
The request mentions the product category. Not used (depreded)
|
TovarCategoryVendor
web_production: 274
The request mentions a vendor. Not used (depreded)
|
Diversity2
web_production: 275
Weight: 0.001181036676865 Geographical distribution of the request
|
NightQuery
web_production: 276
The request is set mainly at night
|
MorningQuery
web_production: 277
Weight: -0.013510450334814 The request is set mainly in the morning
|
DayQuery
web_production: 278
The request is given mainly in the afternoon
|
EveningQuery
web_production: 279
The request is set mainly in the evening
|
HourDiversity
web_production: 280
The severity of the querial tasks at different times of the day
|
LCor
web_production: 281
Weight: 0.038372460585705 Characterizes the frequency of words in links. The factor is large, if the word that played in a lincoat relevance is rare for links.
|
SubqueryThMatchA
web_production: 282
Weight: 0.178646516342524 Coincidence of thematic spectra of request and document. Request themes - the result of work ((http://wiki.yandex-team.ru/evgenijjkroxalev/subquery Rules of the sorcerer Subquerysearch)) The subject of the document is determined by the automatic classifier
|
TRDocQuorum
web_production: 283
The weight of the words of the request that is in the text
|
LRDocQuorum
web_production: 284
The weight of the words of the request that is in the Links
|
TRLRDocQuorum
web_production: 285
The weight of the words of the request that is in the text and links
|
OwnerSDiffClickEntropy
web_production: 286
Weight: -0.017928063556114 Entropy - distribution of clicks
|
OwnerSDiffShowEntropy
web_production: 287
Weight: 0.032525279432611 Entropy - distribution of shows
|
OwnerSDiffCSRatioEntropy
web_production: 288
Weight: -0.01129676986565 Entropy - Distribution of clique/shows.
|
XPornoLRlogRelev
web_production: 289
Document Porn on the text of Leskok
|
XPornoNormLRlogRelev
web_production: 290
Document Porn on the text of Leskok, other normalization
|
XPornoQuery
web_production: 291
Classifier of Porn Causions, another dictionary than Pornoquery
|
AliceMusicAlbumTitleAttenV1_Bm15_K05
web_production: 292
The value of the factor attenv1_bm15_k05 for the Stryim ALICEMUSIC
|
GeoCountryCountryProxim
web_production: 293
The geographical proximity of the country of the site and the country of request
|
UrlDomainFraction
web_production: 294
Weight: 0.564095297143887 Coating domain three -bouqu and request. (Chelyabinsk lottery - Chelloto. We translate a request to translite, find the three -book that are covered (Che, Hel, Lot, Olo), we look at what share of all three -bouquets are covered)
|
UrlPathAndParamsFraction
web_production: 295
Weight: -0.162220616846705 The same as the previous factor, but about the entire Url except the domain
|
SpecificalQuery
web_production: 296
The request is local-specific. The request is often reformulated with the obvious task of the region. ((https://ml.yandex-team.ru/archive/thread1433892/#Message1433892 more))
|
JokerLen
web_production: 297
We consider text features, believing that the page title is attributed to each of its proposal, i.e. The distance between the word from Title and any other word 1 sentence. Len is the maximum attitude of words from the request of the text met in some sentence (with attributed Title) in relation to the length of the request. Example [Harms Circus Vertunov] for ((http://wiki.yandex-team.ru//h.yandex.net/?http%3A%2F%2FWWWWIKILIVRES.info%2FWIKI%2F%25D0%25A6%25D %25b8%25D1%2580%25D0%25D0%25A %25BC%25D1%2581%of this document))
|
JokerWeight
web_production: 298
The ratio of the amount of IDF words in a sentence+Title to all words.
|
ExactJokerLen
web_production: 299
The same as Jokerlen, in the exact forms
|
ExactJokerWeight
web_production: 300
The same as Jokerweight, in the exact forms
|
More120SecVisitsNotSearchShare
web_production: 301
Remapped mascot feature More120SecVisitsNotSearchShare
|
LnkBreak
web_production: 302
Weight: 0.078872214489662 Analogs of the corresponding text factors for links. BM25 from the number of links in which a coincidence occurred.
|
LnkBm25Ex
web_production: 303
Simple BM25 in the exact form in link texts
|
LnkPairSy
web_production: 304
Weight: 0.046891090311905 The presence of pairs in the links of the words, taking into account synonyms
|
LnkBrkSy
web_production: 305
Weight: 0.035447186193336 The number of links passed the threshold
|
LnkBm25Sy
web_production: 306
Simple BM25 by links taking into account synonyms
|
VideoQuery
web_production: 307
Request about the video
|
OwnerClicksPCTR_Reg
web_production: 308
Weight: 0.166327421401765 The owner's clickness regardless of the request, separately in the regions
|
OwnerSDiffClickEntropy_Reg
web_production: 309
Weight: -0.160285061981584 Entropy is the distribution of clicks. Regionalized
|
OwnerSDiffShowEntropy_Reg
web_production: 310
Weight: 0.004768007631846 Entropy is the distribution of shows. Regionalized
|
OwnerSDiffCSRatioEntropy_Reg
web_production: 311
Weight: -0.023916010788926 Entropy - distribution of clique/shows. Regionalized
|
Adultness
web_production: 312
equals 2 * NastyContent
|
HostAdultness
web_production: 313
equals 2 * NastyContent
|
KCHostAdultness
web_production: 314
always zero
|
IsCom
web_production: 315
Weight: 0.276250497243267 Domna in Zone .com
|
IsUa
web_production: 316
Domain in the .ua zone
|
IsNotRu
web_production: 317
Weight: 0.081289466115302 Domain is not in the .ru zone
|
XLRMarketRelev
web_production: 318
LR by links from Yandex.Market
|
Poetry
web_production: 319
The poetry of the document
|
PoetryQuad
web_production: 320
The maximum poetry of the quatrain
|
EngLang
web_production: 321
Document language - English
|
Has2ExactQueryParts
web_production: 322
The request is fully covered by two exact groups consisting of an exact Match of the words of a contract in a row ((http://wiki.yandex-team.ru/poiskovajaplatform/tr/coveragebygroups about coating in groups))
|
HasLevensht1QueryFragment
web_production: 323
There is a group consisting of an Exact Match of the words of the request that covers the request (possibly with a pass, addition or replacement of a word)
|
LargestSyInexactGroup
web_production: 324
Weight: -0.067337343351376 The share of the request, covered by the longest group consisting of any hits (including word forms and synonyms). Possibly with a pass, addition or replacement of a word
|
TimeProfilesMatchWD
web_production: 325
Characterizes the proximity of temporary profiles of request and documents on business days
|
TimeProfilesMatchWE
web_production: 326
Characterizes the proximity of temporary profiles of the request and documents on weekends
|
CyrLang
web_production: 327
The language of the document is Cyrillic
|
GeoRegionalityU
web_production: 328
Requestful factors - the result of work ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/georegionality classifier of geolocalization of the request)))))))))))))
|
GeoRegionalityR
web_production: 329
R- Georelevan - regional results in the issuance could be useful, but nothing more
|
GeoRegionalityV
web_production: 330
V- geovital - regional issuance is of fundamental importance
|
UrlHasNoDigits
web_production: 331
There are no numbers in Urla
|
AliceMusicTrackArtistNamesAllWcmMaxMatch
web_production: 332
ALLWCMMAXMatch factor
|
AliceMusicTrackAlbumTitleCosineMatchMaxPrediction
web_production: 333
The value of the cosinematchmaxprediction factor for the Stryim ALICEMUSIC
|
SynS1
web_production: 334
Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
SynFLremap1
web_production: 335
Weight: 0.002431406823392 Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
SynFLremap2
web_production: 336
Weight: 0.08033186404617 Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
OwnerSessNormDuration
web_production: 337
Weight: 0.126700168643196 ND/K normalized time to click
|
UrlSessNormDurRate
web_production: 338
Weight: 0.025806639721603 nd/i
|
QueryDOwnerSessNormDuration
web_production: 339
CONTRY / K
|
QueryDOwnerWeightClick
web_production: 340
Weight: 0.202186193546053 w/k
|
QueryDOwnerOnlyClickRate
web_production: 341
Weight: 0.185032224423923 o/i
|
QueryDOwnerClickSummary
web_production: 342
Weight: 0.077454131996933 Selected formula
|
QueryDOwnerSatisfied4Rate
web_production: 343
Weight: 0.148292222594522 r_s4b/(r_k + 10)
|
SyntQuality
web_production: 344
Weight: 0.010872234578071 Does the request have a complete syntactic analysis
|
PageDate
web_production: 345
Weight: -0.034716206980983 The date of the document that is registered on the page is remarkable
|
VisitsPVisitors
web_production: 346
Remapped mascot feature VisitsPVisitors
|
RingsHostRankBadness2
web_production: 347
Additional factors about the promotion of the site with ling rings, ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/antispam?v=181r#h58953-4))
|
RingsHostRankBadness3
web_production: 348
Weight: -0.02860873903883 Additional factors about the promotion of the site with ling rings, ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/antispam?v=181r#h58953-4))
|
RingsHostRankBadness4
web_production: 349
Additional factors about the promotion of the site with ling rings, ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/antispam?v=181r#h58953-4))
|
HasTextPos
web_production: 350
The document has textual relevance
|
QSegmentsBM25
web_production: 351
Weight: -0.059299975637935 BM25, where the selected segments of the request act as 'words'
|
QSegmentsWeight
web_production: 352
Weight: -0.057628362537565 'Weight' of the segments of the request in the text
|
SynPercentBadWordPairs
web_production: 353
An indicator of the unnaturalness of the text from the point of view of the Russian language. The number of bad pairs of words in the text, transferred to the segment [0.1] according to the Z/(Z+10) formula
|
SynNumBadWordPairs
web_production: 354
The proportion of bad steam among all found in the table: Z/(X+1), where Z is the number of bad couples in the text, and X is (http://wiki.yandex-team.ru/evgenijgrechnikov/testsynonimizers of 2000-navigable )) steam
|
NumLatinLetters
web_production: 355
Weight: -0.086731079136512 The number of Latin letters in the text (not counting the markings), driven into [0.1] formula n/(n+100)
|
RingsHostRankBadness1
web_production: 356
Weight: -0.036381245328354 Additional factors about the promotion of the site with ling rings, ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/antispam?v=181r#h58953-4))
|
DocIdfSumFixed
web_production: 357
Previous factors - fixed
|
TitleIdfSumFixed
web_production: 358
Weight: 0.047164043400143 Previous factors - fixed
|
HeadingIdfSumFixed
web_production: 359
Weight: -0.068235863277027 Previous factors - fixed
|
NormalTextIdfSumFixed
web_production: 360
Previous factors - fixed
|
QueryURLClicksCombo
web_production: 361
factor cunningly combined from FRC and Pseudo-CTR
|
QueryDOwnerClicksCombo
web_production: 362
Weight: 0.369078039338024 factor cunningly combined from FRC and Pseudo-CTR
|
LRAmortizedByAge
web_production: 363
Weight: 0.003128580544172 Link relevance with pessimization for great age Link
|
RusWordsInText
web_production: 364
The number of words in the text (the word is what the lemmeter selected) is displayed in [0.1] according to the formula x/(x+a)
|
RusWordsInTitle
web_production: 365
Weight: 0.03118624384934 The number of words of the Russian language in the title
|
MeanWordLength
web_production: 366
Weight: 0.019580616053835 The average length of the word
|
PercentWordsInLinks
web_production: 367
Weight: 0.057053549836014 The percentage of the number of words inside the tag <a> .. </a> from the number of all words
|
PercentVisibleContent
web_production: 368
Weight: -0.032828345615772 The percentage of the number of words outside the tags (outside the brackets <>) from the number of all words
|
PercentFreqWords
web_production: 369
Weight: -0.020210221137273 The percentage of the number of words, which are 200 the most frequent words of the language, from the number of all words of the text
|
PercentUsedFreqWords
web_production: 370
Weight: -0.063976585802142 The number used in the text 500 of the most popular words of the language, divided by 500
|
TrigramsProb
web_production: 371
Weight: -0.002170850269151 Logarithm of average geometric probabilities of trigrams in the text. (the probability of a trigram - the number of its meetings in the text, divided by the number of all trigrams) is displayed in [0.1] according to the formula -x (x+a)
|
TrigramsCondProb
web_production: 372
Weight: 0.026650508120317 Logarithm of the average geometric conditional probabilities of trigrams. The conditional probability of a trigram is its probability, divided by the probability of a bigram from the first two words
|
DoppDOwnerPCTR
web_production: 373
The analogue of the QueryDownerClickSpCTR factor differs from it in that the requests are normalized by doppelgage (details of such normalization -((http://staff.yandex-team.ru/finder Andrei Plakhov)), code/yandex/doppelganges)
|
DoppDOwnerPCTR_Reg
web_production: 374
The analogue of the QueryDownerClickspCTR factor differs from it in that the requests are normalized according to doppelgage (details of such normalization -((http://staff.yandex-team.ru/finder Andrei Plakhov)), code/yandex/Doppelganges). Localized to Relev_regions.web.txt
|
DoppUrlPCTR
web_production: 375
The analogue of the QueryurlClickSpCTR factor differs from it in that the requests are normalized by doppelgagers (details of such normalization - ((http://staff.yandex-team.ru/finder Andrei Plakhov)), code - Yandex/Doppelganges)
|
DoppUrlPCTR_Reg
web_production: 376
The analogue of the QueryurlClickSpCTR factor differs from it in that the requests are normalized by doppelgage (details of such normalization - ((http://staff.yandex-team.ru/finder Andrei Plakhov)), code - Yandex/Doppelganges). Localized to Relev_regions.web.txt
|
UrlBM25
web_production: 377
Weight: 0.066890922161289 BM25 on URL'U
|
HasBigPicture
web_production: 378
The page has a big picture
|
MatrixNet
web_production: 379
Weight: 0.114624515228977 Matrixnet is applied to all factors - formula (tg_unized - to prevent the entrance to any formulas)
|
DaterAge
web_production: 380
Weight: -0.207437366708906 The difference between the current date and the date of the document defined by the dates, 1 - the date of the document is equal to the current, 0 - the document of 10 years or more, if the date is not defined, equal to 0. Attention! ((1 - dateraage)*60)^2 = age of the page In days.
|
IsHardPessimization
web_production: 381
Hard pessimization (AKA PR = 0), binary factor, is considered in Antispam
|
CInDegree1
web_production: 382
The host factors determine the sites screwed by the links-the second and third incoming degrees ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/antispam?v=181rh58958953
|
CInDegree2
web_production: 383
Weight: 0.000692523218694 The host factors determine the sites screwed by the links-the second and third incoming degrees ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/antispam?v=181rh58958953
|
NumNonRussianLinks
web_production: 384
The number of incoming links without Russian letters. Remembrance.
|
TextMaxForms
web_production: 385
Weight: -0.015212586791057 The maximum number of forms in all words of the request is max in all words of the request request_form_dl_lov/64
|
TextWeightedForms
web_production: 386
Weight: 0.022803839020796 The sum of the number of forms balanced by the scales of words - the amount in all words of the request of the number_form_dly_lov/64*weight_lov; REMAP species x/(1 + x).
|
TextForms
web_production: 387
Weight: -0.008656938143421 The unwarmed amount of the number of forms is the amount in all words of the request of the number_form_dl_lov/64/number_lov_
|
LinkMaxForms
web_production: 388
The maximum number of forms in all words of the request
|
LinkWeightedForms
web_production: 389
Weight: 0.096811143316269 Summer of the number of forms balanced by scales
|
LinkForms
web_production: 390
Undested amount of the number of forms
|
TR_W1
web_production: 391
Analogues of the factors of the same name, the weight of the word = 1
|
XLR_W1
web_production: 392
Analogues of the factors of the same name, the weight of the word = 1
|
TextBM25_Fm_W1
web_production: 393
Analogues of the factors of the same name, the weight of the word = 1
|
TextBM25_Sy_W1
web_production: 394
Analogues of the factors of the same name, the weight of the word = 1
|
LinkBM25_W1
web_production: 395
Analogues of the factors of the same name, the weight of the word = 1
|
TLBM25_W1
web_production: 396
Analogues of the factors of the same name, the weight of the word = 1
|
QSegmentsBreaks
web_production: 397
Weight: 0.017641843798363 Request segments are parts of the request, which in themselves are frequency requests. The factor shows how much the segments are in the text. value 0 - all words are found only within the framework of the indicated segments, 1 - all the entries break segments
|
AliceMusicTrackLyricsCMMatchTop5AvgMatch
web_production: 398
The value of the CMMATCHTCHTOP5AVGMATCH factor for the Stryim ALICEMUSIC
|
NumeralsPortion
web_production: 399
The share of different parts of speech in the text. The share of numerals (among all words that managed to recognize part of the speech)
|
ParticlesPortion
web_production: 400
Weight: -0.012429221647235 The share of particles
|
AdjPronounsPortion
web_production: 401
Weight: -0.005976754416269 The share of pronoun adjectives
|
AdvPronounsPortion
web_production: 402
Weight: -0.001250755074786 The proportion of pronoun nouns
|
VerbsPortion
web_production: 403
The share of verbs
|
FemAndMasNounsPortion
web_production: 404
Weight: 0.011650367441796 The share of words that can be both masculine nouns and nouns of the feminine, but not of the middle kind, among all nouns (examples: 'hummingbirds' are an example of an indefinite kind that can be determined in two ways, 'Alexander' is homonymy).
|
LinkQualityFixed
web_production: 405
Weight: 0.013112575551553 Quality of incoming links (hauser classifier) corrected
|
HasLinkQualityFixed
web_production: 406
Considered LinkQuality for this page or not (did not think, if there are few links) corrected
|
NewLinkQualityFixed
web_production: 407
Weight: 0.021178675054476 Quality classifier of incoming links 2 corrected
|
IsOrg
web_production: 408
Weight: -0.018278527670779 The request is the name of the organization (example: Gazprom, Gazprom) ((http://wiki.yandex-team.ru/arsengadzhikurbanov/warees Description))
|
AliceMusicArtistNameCMMatchTop5AvgMatchValue
web_production: 409
The value of the CMMATCHTCHTOP5AVGMATCHVALUE factor for Stryim ALICEMUSIC
|
LongestText
web_production: 410
Weight: 0.069696682544392 The size of the largest text segment (from the factor [18] puretext)
|
SmartUkrainian
web_production: 411
|
SmartBelorussian
web_production: 412
|
LRWithoutRare
web_production: 413
Weight: -0.011221458184058 Link relevance without taking into account rare words
|
DifferentInternalLinks
web_production: 414
Weight: 0.096447224363928 The number of different internal links to the page
|
HasDeterminedCities
web_production: 415
Weight: 0.165031403865939 The city is defined for the site
|
GeoRegionalityUNew
web_production: 416
Requestful factors - the result of the work ((http://wiki.yandex-team.ru/poiskovajaplatforma/lingvistika/zaprosnyjefefactory/localizovannyjezaprosya classifier of the request of the request)) - a new version of factors [328] - [328] - [328]: u - u - u - u - u - u - uceleless sites the request is meaningless;
|
GeoRegionalityRNew
web_production: 417
Запросные факторы - результат работы ((http://wiki.yandex-team.ru/PoiskovajaPlatforma/Lingvistika/ZaprosnyjeFactory/LocalizovannyjeZaprosy классификатора геолокализованности запроса)) - новая версия факторов [328]-[330]: R - георелевантные - региональные результаты в issuing could be useful, but nothing more;
|
GeoRegionalityVNew
web_production: 418
Requestful factors - the result of work ((http://wiki.yandex-team.ru/poiskovajaplatforma/lingvistika/zaprosnyjefefactory/localizovannyjezaprosya classifier of the request of the request)) - a new version of factors [328]: Vegetable fundamental importance.
|
AliceMusicArtistNamePerWordCMMaxPredictionMin
web_production: 419
The value of the PERWORDCMMAXPREDITICIONMIN factor for the Stryim ALICEMUSIC
|
UkrainPageRank
web_production: 420
Weight: 0.087122791007993 Ukrainian Page Rank
|
QClassDownload
web_production: 421
= 1 - v. Download formula. Class requests: download/watch online/play/photo/listen
|
QClassBrandnames
web_production: 422
The result of the classifier of the request - in the request there are words from the corresponding dictionary. brand
|
QClassDisease
web_production: 423
Medication Dictionary
|
QClassKak
web_production: 424
question
|
QClassMoscow
web_production: 425
Specific request for Moscow
|
QClassOAO
web_production: 426
Weight: -0.005085205304656 organization
|
QClassPorno
web_production: 427
porn
|
QClassTravel
web_production: 428
trips
|
VideoRating
web_production: 429
The popularity of the video roller comes from the video
|
PeriodicLinkDatesPercent
web_production: 430
Weight: 0.013900531929943 The frequency of links to the site
|
LinkAlmostPeriod
web_production: 431
The number of almost-periodic links
|
QDOwnerStatPower
web_production: 432
Weight: -0.025355498987515 The number of Owner shows on request, normalization x/(100 + x).
|
QUrlStatPower
web_production: 433
Weight: -0.194376876842978 The number of URL shows on request, normalization x/(100 + x).
|
HasLiRuCounter
web_production: 434
The presence of a LiveInternet meter
|
OwnerReqsPopularity
web_production: 435
Weight: 0.209508533629415 The popularity of Owner is in requests
|
DssmYaMusicASREarlyBindingCe
web_production: 436
DSSM model with early binding, trained on reforming and learned by ASR hypotheses of musical requests for Alice
|
DssmBertDistillSinsigCeCountryRegChain
web_production: 437
A model trained on a PRS-Law PRS to predict BERT, trained on sinsig_ce with threshold value 0.5, using a chain of regions to the country
|
DssmYaMusicEarlyBindingCe
web_production: 438
DSSM model with early binding, trained on reforming and learned on musical requests for Alice
|
SecondIndegDistrXi
web_production: 439
Weight: -0.01085051113308 Eleven factors based on the statistical properties of the distributions of the incoming degrees of peaks that refer to the fixed top of the hostographer. ((Http://wiki.yandex-team.ru/jandekpoisk/kachestvopoiska/obshayaformula/tekushhiekmponenty/HostdDEGRE)
|
PiracyDetectorPredict
web_production: 440
The value of the pirate detector calculated in the hippo.
|
AliceMusicUrlTypeIsAlbum
web_production: 441
Type of canonized Urla Yandex Music - Album
|
FirstValidTs10Days
web_production: 442
It is considered as (10-x) where X is the return of the document in days (continuously) regarding the validity time of the document in Samovar
|
HostInQuery
web_production: 443
The host of the document is recognized in the request
|
VitalHostInQuery
web_production: 444
URL consists only of the host, which is recognized in the request
|
YandexNewsStoryUrl
web_production: 445
URL is the plot of Yandex News
|
RcSpylogUrlRationalSigmoidD1T240
web_production: 446
URL feature computed from rapid clicks spy_log counters with decay of 1 day
|
RcSpylogUrlRationalSigmoidD1T240Frozen
web_production: 447
URL feature computed from rapid clicks spy_log counters with decay of 1 day
|
RcSpylogUrlRationalSigmoidD0_5T30
web_production: 448
URL feature computed from rapid clicks spy_log counters with decay of 0.5 days
|
RcSpylogUrlRationalSigmoidD0_5T30Frozen
web_production: 449
URL feature computed from rapid clicks spy_log counters with decay of 0.5 day
|
Timestamp
web_production: 450
They are considered as (80 - x) / 80, where X is the age of the document in the watch. Factors make sense only for the fast -button base (the last 80 hours). Not used in ranking. Used in disconnecting.
|
AddTimeFull
web_production: 451
They are considered as (80 - x) / 80, where X is the age of the document in the watch. Factors make sense only for the fast -button base (the last 80 hours). Not used in ranking. Used in disconnecting.
|
Swbm25
web_production: 452
Weight: 0.019740981979634 Cunning BM25 in a sliding window. The size of the window is set in sentences. 'Jokers' are used for headlines and the beginning of the document. Morphological proximity and structure of the text are taken into account. The weight of the window fades with the removal from the beginning of the document.
|
PositionLanguageModel
web_production: 453
Weight: -0.032269052994315 The factor about that, a good snippet can turn out.
|
TxtPair_W1
web_production: 454
Weight: -0.016932610010322 Simple BM25 in pairs of words - we take all pairs of words of the request and consider the number of their entry into the text of the document. Weight = 1. It does not work if there is a stop-word in the request
|
AuraDocLogShared
web_production: 455
Weight: -0.097686304848915 Logarithm of the number of shingles on which this document is not unique
|
AuraDocLogAuthor
web_production: 456
Weight: -0.097277529611975 Logarithm of the number of shingles on which this owner of the document is recognized as the author
|
AuraDocMeanSharedWeight
web_production: 457
Weight: -0.110593487056685 The average weight of non-ugly shingles of this document
|
MarketQualityRating
web_production: 458
Mascot feature MarketQualityRating
|
Medical2HostQuality
web_production: 459
Medical host quality for new marks.
|
Medical2HostQualityFresh
web_production: 460
Medical host quality for new marks for experiments.
|
FinLawHostQuality
web_production: 461
Finance or law host quality for new marks.
|
FinLawHostQualityFresh
web_production: 462
Finance or law host quality for new marks for experiments.
|
SosHostQuality
web_production: 463
Finance or law host quality for new marks.
|
SosHostQualityFresh
web_production: 464
Finance or law host quality for new marks for experiments.
|
CsDocumentationHost
web_production: 465
Factor for host in list of documentation cs hosts for experiments
|
Remved_466
web_production: 466
|
RegHostRank
web_production: 467
Weight: 0.156712439907419 It reads in the same way as the Hostrank factor, but not on all the Owner graph, but on its subrack, consisting of Owner's in this region. Belonging to the region is determined by TLD, or by the presence of pages with this Owner in the index, about which the GEO or Geoa classifier says that they are from this region. Mapped in the same way as the Hostrank factor, from 0 to 1 with 256 gradations
|
RegIsWiki
web_production: 468
A document from the language section of Wikipedia corresponding to the user region
|
LanguageCompliance
web_production: 469
Weight: 0.054576897612176 The language of the document corresponds to the language language
|
CountryPopularQ
web_production: 470
The popularity of the request within the country
|
CountryQDiversity
web_production: 471
Weight: 0.03718037385465 The degree of centralization of the points from which the request is set (inside the country)
|
CountryQDiversity2
web_production: 472
Weight: -0.00120970063307 Geographical distribution of the request within the country
|
CountryHour
web_production: 473
The hour at which this request is given the most
|
CountryHourDiversity
web_production: 474
The degree of severity of the querial tasks at different times of the day (inside the country)
|
Removed_475
web_production: 475
|
NationalDomain
web_production: 476
The country of the document (domain) and the country of the user coincide ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaFormula/tekushhiekomponijafaktorov#national
|
IsPornoAdvert
web_production: 477
On the Porn Advertising page
|
RcSpylogUrlRationalSigmoidD3T120
web_production: 478
URL feature computed from rapid clicks spy_log counters with decay of 3 days
|
CountryQueryRegionality
web_production: 479
Weight: 0.012081787040108 Country classifier of localization - how much the request implies the context of the country
|
NumSlashes
web_production: 480
Weight: 0.050576094170344 The number of slashes in Url
|
BM25FdPR_obsolete
web_production: 481
Weight: 0.054156294329288 BM25 with different parameters for different fields, including an incoming anchortekst. The weight of the text of the links included on the page is normalized depending on Delta Page Rank links
|
WatchVideo
web_production: 482
The presence of a built -in video player on the page
|
DownloadVideo
web_production: 483
Video for downloading
|
RcSpylogUrlRationalSigmoidD3T120Frozen
web_production: 484
URL feature computed from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogUrlRationalSigmoidD14T300
web_production: 485
URL feature computed from rapid clicks spy_log counters with decay of 14 days
|
SubRelevance
web_production: 486
The service factor that was needed to search the site, and in the future it will still be needed.
|
GskUrlModel
web_production: 487
Weight: 0.013412340418363 The factor is calculated from the text of Url using the classifier of sequences Quality/Seq/GSK
|
UrlTrigrams
web_production: 488
Weight: 0.064310714968383 Model with the training of each trigram on '+' and '-' Urlah. It does not depend on the request.
|
RcSpylogUrlRationalSigmoidD14T300Frozen
web_production: 489
URL feature computed from rapid clicks spy_log counters with decay of 14 days
|
RcSpylogAge
web_production: 490
Age of rapid clicks spy_log update, in seconds
|
RcSpylogFreshness
web_production: 491
Freshness of rapid clicks spy_log update
|
YmwFull
web_production: 492
Weight: -0.044940112806396 The size of the minimum piece of text, including all the words of the request found in the document. Not used now. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/ymw Read more))
|
Bclm
web_production: 493
Weight: 0.030786458206337 Buettcher, Clarke and Lushman factor (modified) ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushichiekomponenty/bclm more)))))))))
|
QueryCommercialityMx
web_production: 494
Weight: 0.103903118421863 The measure of 'commercial' request. It is a comprehensively calculated Matrixnet factor formula for the procurement vocabulary in direct + for user queries + add. Intensive dictionaries. Requests with intensity to buy a factor seeks to -> 1 commodity requests -> 0.6 with intensity cannot buy, reviews, etc. -> 0 ((http://wiki.yandex-team.ru/Faktorydljanovogokatorazaprosov Factors of the Classifier))) (HTTP : //wiki.yandex-team.ru/jandekspoisk/antispam/antiseo/klassifikATORCHESSKIXZAPROSOV STUNITURE OF HIM))
|
FieldLM
web_production: 495
Weight: 1.36522746e-7 Unigramal language model. Language is modeling according to the document, smoothed out by the general linguistic model. When building a model, the document uses information on which field of the document met the word request (Title, Head or Plain Text)
|
GeoCityUrlRegionCity
web_production: 496
The coincidence of geography, determined from the Url of the document and the city of the request (IP or LR)
|
GeoCityUrlRegionRegion
web_production: 497
The coincidence of geography, determined from the Url of the Document and the Request region (IP or LR)
|
GeoCityUrlRegionCountry
web_production: 498
Weight: -0.168645758020604 The coincidence of geography, determined from the Url of the document and the country of request (IP or LR). Actual for Russia and Ukraine.
|
GeoCityUrlGeoCityCity
web_production: 499
The coincidence of geography, determined from Ural Documents and the City in the request (GEOCITY rule)
|
PayAppDetectorPredict
web_production: 500
The value of the chopped commerce detector, calculated in the hippo.
|
TitleTrigramsQuery
web_production: 501
Weight: 0.112928770384249 Calculates the coating of the request with letter trigrams of the document header
|
TitleTrigramsTitle
web_production: 502
Calculates the heading of the heading of the document header with letter trigrams
|
InlinksModel
web_production: 503
Probabilistic model built on the texts of incoming links
|
QueryWordSequencesTR
web_production: 504
Weight: -0.11860635115951 He considers the sum of the following species: the sequence of words of the request more than two, met in one sentence; It is normalized for the length of the document.
|
QueryWordSequencesLR
web_production: 505
He considers the sum of the following species: the sequence of words of the request more than two, met in one link; It is normalized to the number of links.
|
OwnerNavQuota
web_production: 506
Weight: 0.189743110446303 The share of clicks for navigation requests
|
GeoRelevAlienCity
web_production: 507
Weight: 0.084699401575226 The result has a geography of the user at the city level ([415] == 1 && [215] == 0)
|
GeoVQueryInUserCity
web_production: 508
Request geovitality for results from the user region
|
GeoVQueryInAlienCity
web_production: 509
Request geovitality for the results is not from the user region
|
HostReliability
web_production: 510
Weight: -0.045942748393758 The share of the Urlov that respond without errors
|
DmozThemeMatchAll
web_production: 511
Coincidence of the thematic spectrum (according to DMOZ) request and document. The theme of the request is determined ((http://wiki.yandex-team.ru/jandekspoisk/zarubezhnyjjinternet/dmozqueryClassifier1 The rule of the sorcerer Dmoztheme))
|
DmozThemeMatchBest
web_production: 512
Coincidence of the thematic spectrum (according to DMOZ) request and document. The theme of the request is determined by the best result ((http://wiki.yandex-team.ru/jandekspoisk/zarubezhnyjjinternet/dmozqueryClassifier1 Rules for the sorcerer DmozTheme)) The subject of the document is determined by the automatic classifier
|
Mpsa
web_production: 513
Weight: 0.093045433292429 Evaluates the minimum distance between the pairs of words of the request, taking into account the remoteness of the pair from the beginning of the document (Minimal Pair Size with Attenuation). Steles are understood to mean all consistent bigrams of the words of the request. Thus, the number of vapor is equal to the number of words in a request reduced by 1. Accordingly, the factor makes sense for requests consisting of more than one word. (Http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/ Tekushhiekomponenty/MPSA MPSA))
|
Bclm2
web_production: 514
It differs from BCLM in that the weights of all words are considered the same. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/bclm2 BCLM2))))))))))))
|
AbsolutePLM
web_production: 515
Text relevant based on the language model, taking into account the absolute position. We go along the text with a window of 20 words, build a language model on each window (that is, the distribution of probabilities in the words of the Russian language) and calculate the probability of generating a request. For removal from the beginning of the document, we finish the model.
|
PageRegionCoverage
web_production: 516
Weight: -0.063761467432684 |
PageRegionSize
web_production: 517
Weight: -0.030877746812643 The size of the page of the page
|
PageRegionRelCoverage
web_production: 518
Weight: -0.000832706989751 |
RcSpylogFreshnessAtReq
web_production: 519
Freshness of rapid clicks spy_log update, calculated at the request time
|
IsGeo
web_production: 520
Weight: -0.027287688639737 It launches on the basic search under the name ISGEO the maximum weight of the meters of the gelator in the request. A geo-object is understood as an object of the category GEO, Geo1, Geoaddr, Geoaddr1, Landmark, Landmark1 (see ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects kaovsky allocation))))))))))))))))))))))))))))))). wiki.yandex-team.ru/arsengadzhikurbanov/wares Read more))
|
IsMusic
web_production: 521
It launches for the basic search under the name ISMUSIC the maximum weight of the Music or Music1 category of the category of the Category in the request. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/warees more)))))))))))))))))))))
|
BclmLite
web_production: 522
Modification of the BCLM2 factor, lightweight for use in tulle. The main difference is that BCLMLite does not use absolute displacements of words relative to the beginning of the document. Instead, the factor works with the usual positions of the type <number of the_prising, position_v_production>. At the same time, the proximity between the words is taken into account only inside the sentence. (Http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaFormula/tekushichiekomponenty/bclmlite bclmlite)))))))))))))
|
NearbyQuery
web_production: 523
When responding to a request, the results are important in close proximity ([pharmacies], [children's clinic])
|
CityQuery
web_production: 524
Weight: -0.091993052812036 When answering a request, the results within the city are important (the bulk of localized queries)
|
AdmQuery
web_production: 525
When responding to a request, the results from the region of the user ([airport], [dairy]) are important
|
NumLinksFromMP
web_production: 526
The number of incoming muzzle links
|
YmwFull2
web_production: 527
Weight: -0.044940112806396 Fixed YMWFull. It differs from the previous version only by behavior on 2 -word queries. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/ymw Read more))
|
FullQuorum
web_production: 528
Binary factor, every word of the request is in the text or in the links
|
AuxCTextBM25
web_production: 529
'Country praets' (AUXQC)
|
AuxCLinkBM25
web_production: 530
'Country praets' (AUXQC)
|
Soft404
web_production: 531
Page - '404' (share of tokens '404' in relation to the total number of tokens on the page)
|
RcSpylogUrlRationalSigmoidD1T240AtReq
web_production: 532
URL feature computed at the request time from rapid clicks spy_log counters with decay of 1 day
|
DBM25
web_production: 533
BM25, in which the weight of the word is machine -like
|
QueryWordCohesionTR
web_production: 534
Weight: -0.053739168786067 The factor evaluates as the words of the request is grouped with each other in the text of the document without taking into account their order. ((http://wiki.yandex-team.ru/sergejjkrylov/queryWordCohesionTR Description))
|
OwnerSessNormDuration_Reg
web_production: 535
ND/K normalized time to click
|
RcSpylogUrlRationalSigmoidD0_5T30AtReq
web_production: 536
URL feature computed at the request time from rapid clicks spy_log counters with decay of 0.5 days
|
QueryDOwnerSessNormDuration_Reg
web_production: 537
CONTRY / K
|
QueryDOwnerWeightClick_Reg
web_production: 538
Weight: 0.115262514353577 w/k
|
QueryDOwnerOnlyClickRate_Reg
web_production: 539
Weight: 0.179216994410993 o/i
|
QueryDOwnerClickSummary_Reg
web_production: 540
Weight: 0.054680076158058 Selected formula
|
QueryDOwnerSatisfied4Rate_Reg
web_production: 541
Weight: 0.07148176099275 r_s4b/(r_k + 10)
|
SegmentAuxAlphasInText
web_production: 542
Weight: 0.010581678208134 Number of letters in the AUX segment
|
SegmentAuxSpacesInText
web_production: 543
Weight: -0.011681967583253 The number of spaces in the AUX segment
|
SegmentContentCommasInText
web_production: 544
The number of commas in the Content segment
|
IsShop
web_production: 545
Weight: -0.133931985443449 Page is a store. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/opisanijafaktorov#SSHOP Description)). Not used (depreded)
|
XLRGeoRelevRegionNatDomain
web_production: 546
Weight: 0.013370500669584 |
AuraDocLogOrigin
web_production: 547
Logarithm of the number of shingles in the document added by the owner of the site as original texts in ((http://wiki.yandex-team.ru/jandekspoisk/jekosistema/marketingPr/webmasters/plan/vtorcontect of originality plugin)). It does not participate in the formula, it is needed to disconnect the takes
|
AuraDocMeanFltAuthorSource
web_production: 548
The average filtered number of sources of authorship of the document. It does not participate in the formula, it is needed to disconnect the takes
|
QueryRefTrigramQuery
web_production: 549
Weight: 0.054926147793071 ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/opisanijafaktorov#queryreftrigrams Description))))))))))))))))))
|
QueryRefTrigramReferences
web_production: 550
Weight: -0.096496414873675 ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/opisanijafaktorov#queryreftrigrams Description))))))))))))))))))
|
IdfVariance
web_production: 551
Weight: 0.025691573951246 Dispersion of IDF words,
|
UrlNGramsModel
web_production: 552
Weight: 0.055185094441888 Urlngramsmodel ranking factor in ERF
|
NationalLanguage
web_production: 553
The language of the document corresponds to the country's request
|
OwnerIsCommercial
web_production: 554
|
GeoCountryUrlRegionCountry
web_production: 555
|
GeoCountryUrlGeoCountry
web_production: 556
|
NumLinksFromSegmentContent
web_production: 557
Weight: 0.094045741102708 |
Locm
web_production: 558
Weight: -0.070483297609751 The order of words in exiles.
|
UrlQueryVariety_Reg
web_production: 559
Weight: -0.020628033510418 The degree of variety of requests for which this Urla click is read by regions
|
UrlSessNormDurRate_Reg
web_production: 560
Weight: 0.025328925792111 nd/i
|
FiltrationSegments
web_production: 561
The share of the segments of the request present in the text
|
LanguageGoodForTurkey
web_production: 562
The language of the document is one of the permissible for Turkey (Turkish, English, German, French, Arabic, Azerbaijani) or the document has zero length. In the search stage is calculated only for Isrealgeolocal requests.
|
DBM25_2
web_production: 563
Variation of Temo ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/DBM25 dBM25), cm.
|
GeoDispersion
web_production: 564
Document links dispersion
|
QueryDownerEnoughClicked
web_production: 565
Weight: -0.118870879105496 The number of clicks on the owner and the number of clicks on request more than 5
|
BM25FdPRFixed
web_production: 566
Weight: 0.058870258158539 BM25FDPR with standardization on the average length of the document, depending on the language of the document. ((http://wiki.yandex-team.ru/bm25frework test results.))
|
LanguagePopularity
web_production: 567
The popularity of the language of the document. Number from 0 to 1. (http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/languaguaguagepopalarity)))))))
|
QueryDOwnerWeightedSumFRCAndBM25FdPRFixed
web_production: 568
Weight: 0.087850313290757 The amount of factors QueryDownerClicksFRC and BM25FDPRFIXED with scales 0.358449 and 0.184922, respectively. '565' in the name of the factor does not need to be perceived literally, it is Legashi or a typo.
|
QueryDOwnerWeightedSumMaxWHRAndOnlyClickRate
web_production: 569
Weight: 0.152953808712409 The amount of factors 192 and 341 with scales 0.298942 and 0.454625, respectively.
|
RcSpylogUrlRationalSigmoidD3T120AtReq
web_production: 570
URL feature computed at the request time from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogUrlRationalSigmoidD14T300AtReq
web_production: 571
URL feature computed at the request time from rapid clicks spy_log counters with decay of 14 days
|
Tocm
web_production: 572
Weight: -0.005028751679547 The factor evaluates the differences in the positions of words in the heading from the posterity in the request
|
RelevGeoLinksPercent
web_production: 573
Weight: -0.069803680024687 |
LangDispersion
web_production: 574
Dispersion of languages in XMAP
|
HasMisspell
web_production: 575
There is a typo in the request
|
DBM30Smerch
web_production: 576
Variation of Temo ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/DBM25 dBM25), cm.
|
IsUrlForClickDeboost
web_production: 577
It is known about URL that it is shown too often with very low relevance (according to Bert and/or BM25)
|
UrlLinkPercent
web_production: 578
Weight: 0.089404211238337 The ratio of the number of incoming links, the text of which is the URL, is one of the incoming links
|
DssmBertDistillL2
web_production: 579
A pool of logs is marked with BERT trained on Sinsig. DSSM model is trained on this pool using BaseregionChain
|
NumNonLettersInUrl
web_production: 580
Weight: -0.011207582653854 The number of 'Nebukv 'in Url
|
UrlLen2
web_production: 581
Weight: 0.007908808762912 The length of the URL with an accuracy to the symbol. Disconnected in production.
|
IsHub
web_production: 582
Weight: 0.097073501164592 Habi page
|
StaticTitleComm
web_production: 583
The degree of commerce page title. Not used (depreded)
|
StaticTitleBM25Ex
web_production: 584
Weight: 0.016179974819787 BM25 page title by its text
|
StaticTitleLRBM25
web_production: 585
Weight: 0.038263040612831 BM25 page title by texts of links to it
|
SeoInPayLinks
web_production: 586
Weight: -0.028595315195293 The number of COO-Thrilling links between hosts
|
USLongPeriodUrlMobileDt180Avg
web_production: 587
Static URL factor for search sessions for 1600 days calculated on mobile sessions. Average dwelltime, and DwellTime is cut from the session if more than 180 seconds
|
USLongPeriodUrlMobileLongClickProb
web_production: 588
Static URL factor for search sessions for 1600 days calculated on mobile sessions. The probability that the URL click will be more than 120 seconds
|
USLongPeriodUrlMobileLossesProb
web_production: 589
Static URL factor for search sessions for 1600 days calculated on mobile sessions. The probability that URL is not clicks if they click at least one URL below.
|
USLongPeriodUrlMobileDt3600AvgReg
web_production: 590
Static URL factor for search sessions for 1600 days calculated on mobile sessions. Average dwelltime, and DwellTime is cut from the session if more than 3600 seconds. Localization to the level of countries.
|
USLongPeriodUrlMobileDt180AvgReg
web_production: 591
Static URL factor for search sessions for 1600 days calculated on mobile sessions. Average dwelltime, and DwellTime is cut from the session if more than 180 seconds. Localization to the level of countries.
|
HpDetectorPredict
web_production: 592
The value of the health detector calculated in the Hippo.
|
IsFeedListing
web_production: 593
OffersBase feature for ecoboost.
|
IsFeedMain
web_production: 594
OffersBase feature for ecoboost.
|
IsFeedStratocaster
web_production: 595
OffersBase feature for ecoboost.
|
IsFeedAny
web_production: 596
OffersBase feature for ecoboost.
|
TitleInLinksTrigrams
web_production: 597
Weight: -0.076334972364641 The share of unique trigrams in the trigrams of links
|
LinksInTitleTrigrams
web_production: 598
Weight: 0.019301158836494 Share of unique trigrams of links in trigrams header
|
TrashAdv
web_production: 599
The greasy of the page
|
MetrikaUrlVisits
web_production: 600
Similar to Yabarurlvisits
|
UrlGeoAdms
web_production: 601
The URL document corresponds to the user (http://wiki.yandex-team.ru/jandekspoisk/kacheStvopoiska/geo/regnavquerispoisk/KacheStvopoiska/GEO/RENAVAVQURIES)
|
UrlGeoCity
web_production: 602
URL document corresponds to the city of the user
|
RegNavQuery
web_production: 603
Regional and navigation request - in the user region there are one or more navigation results on it
|
YabarUrlLcAc
web_production: 604
Weight: -0.046030869083841 The number of sessions in which Url was the last, classified as the sessions in which Url appeared
|
SOMaxSumSourceRank
web_production: 605
Weight: 0.061675217167197 The sum of the maximum values of Sourcerank's for each incoming link, taking into account the uniqueness of the owner.
|
DBM35
web_production: 606
Weight: 0.046757967567051 BM25 in texts and links with special. Libra in the level of coincidence (shape, lemma, synonym)
|
TRLRQuorumFm
web_production: 607
Weight: -0.062810308974889 The weight of the words of the request that is in the text in the exact form
|
TRLRQuorumLemma
web_production: 608
Weight: -0.003021983245146 The weight of the words of the request that is in the text with an accuracy to lemma
|
TRLRQuorumSyn
web_production: 609
The weight of the words of the request that is in the text
|
IsHum
web_production: 610
Weight: 0.003622338166697 It launches on the basic search under the name ISHUM the maximum weight of the enclosed object of the Hum or Hum1 category in the request. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#ishum more)))))
|
IsText
web_production: 611
It launches on the basic search under the name ISTEXT the maximum weight of the TEXT or Text1 category of the category of the category met in the request. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#istext more)))
|
IsPicture
web_production: 612
It launches on the basic search under the name Ispicture the maximum weight of the Picture or Picture1 category of the category of the category of the category in the request. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#ispicture))))))))))))))))))
|
MaxOne
web_production: 613
Weight: -0.059871381556405 Returns the maximum degree of household objects in the request under the name Wmaxone. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#maxone more)))))))
|
MinOne
web_production: 614
Weight: 0.113671587879567 Returns the maximum degree of household objects in the request under the name Wminone. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#minone more)))))
|
OqBm25Str
web_production: 615
Weight: 0.125675707803009 BM25 on the request for Domattr index
|
OqBm25Lem
web_production: 616
Weight: 0.129668929638366 BM25 on the request for Domattr index
|
OqBm25Syn
web_production: 617
Weight: 0.112334631253023 BM25 on the request for Domattr index
|
OqBclmWeighted
web_production: 618
Weight: 0.105135837056982 BCLM for the Domattr Index
|
OqBclmPlain
web_production: 619
Weight: 0.254915495706702 BCLM on the request of the owners index
|
LinksAlive
web_production: 620
Allows you to evaluate whether the document is 'alive' is from the point of view of links to it coming.
|
SmallWindow
web_production: 621
Maximum amount weight of the words of the request in the window of 50 words
|
MetrikaUrlVisitors
web_production: 622
Similar to Yabarurlvisitors
|
MetrikaUrlAvgTime
web_production: 623
Similar to Yabarurlavgtime
|
MetrikaUrlCoreAudience
web_production: 624
Weight: -0.057658302748215 The core of the page of the pages on which there is a metric counter
|
RegexMaxClickPercent
web_production: 625
The share of clicks on this Urlu among all clicks on similar requests
|
RegexCtr
web_production: 626
Corrected CTR of this Urla for all similar requests
|
DomPhraseClickRankBi
web_production: 627
Weight: 0.209866937086235 Clicking domain on biograms (excluding thesaurus extensions of requests)
|
DomPhraseYabarBi
web_production: 628
Weight: 0.20518490511548 Transitions to the site from search engines by biograms, according to the bar (excluding thesaurus extensions of requests)
|
LastWordHostClicks
web_production: 629
Weight: 0.06275358178297 The clickableness of the host according to the latest request (excluding thesaurus extensions of requests)
|
HostHasFeedUrls
web_production: 630
OffersBase feature for ecoboost.
|
IsFeedOffer
web_production: 631
OffersBase feature for ecoboost.
|
HostEcomKernel1
web_production: 632
Business kernel.
|
HostEcomKernel2
web_production: 633
Business kernel.
|
HostEcomKernel3
web_production: 634
Business kernel.
|
RcSearchBaseUrlRationalSigmoidD1TM600AtReq
web_production: 635
URL feature computed at the request time from rapid clicks search counters with decay of 1 day
|
SynSetLocm
web_production: 636
Weight: -0.070483297609751 Копия фактора ((http://wiki.yandex-team.ru/JandeksPoisk/KachestvoPoiska/ObshayaFormula/TekushhieKomponenty/Locm LOCM)) для((http://wiki.yandex-team.ru/JandeksPoisk/KachestvoPoiska/ObshayaFormula/TekushhieKomponenty /Synset sinsetas)).
|
SynSetLinkBM25
web_production: 637
A copy of the LinkBM25 factor for ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayafformula/tekushhiekomponenty/synset Sinsetov)).
|
RcSearchBaseUrlContrastD30Odd0_9_X_D30T1AtReq
web_production: 638
URL feature computed at the request time from rapid clicks search counters with decay of 30 days
|
Removed_639
web_production: 639
|
DmozQueryBestTheme
web_production: 640
Weight: -0.000807198317231 The most likely theme of the request determined ((http://wiki.yandex-team.ru/jandekspoisk/zarubezhnyjjinternet/dmozqueryClassifier1 The rule of the sorcerer DmozTheme)), only the most popular topics are taken into account (but there are more than in the DMOZQUREMES factor). The factor contains the likelihood of a correspondence of the request of the theme, but for each topic, its own interval is taken on the segment [0..1]
|
DmozQueryThemes
web_production: 641
The theme of the request determined ((http://wiki.yandex-team.ru/jandekspoisk/zarubezhnyjjinternet/dmozqueryClassifier1 The rule of the sorcerer Dmoztheme)), only a few of the most popular topics are taken into account.
|
DiversityCategNeedPhoto
web_production: 642
0 or 1, depending on the presence in the request of the clearly expressed intent Need_photo from the variety
|
DiversityCategNeedMap
web_production: 643
0 or 1, depending on the presence in the request of the clearly expressed intent Need_map from the variety
|
LongQuerySyn
web_production: 644
Weight: 0.058415162135787 The factor is an analogue of LongQuery (the sum of the IDF words of the request), but with the 'correct' accounting of synonyms. Specifically, a minimum of IDF (i.e. the most frequent) of synonyms and words is selected.
|
UrlHasShortCountryNameToken
web_production: 645
Url contains a token that coincides with the short name of the user country. The factor is considered only on the EU stream.
|
TurkeyPageRank
web_production: 646
Personalized Turkish Pagerank
|
ExpectedFound
web_production: 647
Expected number of found on request
|
FooterInLinksTrigrams
web_production: 648
The share of unique trigrams of a footer fragment in trigrams of links
|
LinksInFooterTrigrams
web_production: 649
The share of unique trigrams of links among a fragment of trigrams of a footer
|
ErratumLogQueryProbability
web_production: 650
Double logarithm of the probability of a request for a language model of the Erratum typo service
|
UrlIsMarketOffer
web_production: 651
URL is an offer in the latest version of the market base.
|
DBM40
web_production: 652
Variation of Temo ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/DBM25 dBM25), cm.
|
Removed_653
web_production: 653
|
BM25_0
web_production: 654
Variation on the topic BM25
|
BM25_1
web_production: 655
Variation on the topic BM25
|
BM25_0123
web_production: 656
Variation on the topic BM25
|
QueryUrlCorrectedCtr
web_production: 657
'Fixed' clicks counted using REQUESTAGGRETELIB
|
QueryUrlCorrectedCtr_Reg
web_production: 658
'Fixed' clicks calculated using Requestaggregatelib. Regional version
|
YabarUrlVisits_Reg
web_production: 659
Regional attendance of Urla according to the I-Bara
|
MetrikaUrlHostVisitTime
web_production: 660
The average time of the user stay on the host with an external (from another non-search site) entry from a specific URL
|
MetrikaUrlHostVisitDepth
web_production: 661
The average 'depth' (the number of transitions within the framework of the host) of the user stay on the host with an external (from another non-playing site) entry from a specific URL
|
DBMNumbers
web_production: 662
DBM separately by numbers
|
DBMGeo
web_production: 663
DBM separately by geo-objects of request
|
DBMSubstantive
web_production: 664
DBM separately on the noun
|
AvgSessionLen
web_production: 665
The average length of the logical session in which there was a request
|
NHopTextBclmWeighted
web_production: 666
BCLM (Weighted) by Hopes texts.
|
YabarUrlDownloads
web_production: 667
Assessment of the probability of leaps from the document
|
Bocm
web_production: 668
Evals the correspondence of the positions of words in the sentences of the document to the positions of words in the request.
|
HostUserLeakage
web_production: 669
User outflow coefficient from the search after a visit to the site
|
FioMatch
web_production: 670
The document contains a name from the request.
|
IsIndexPage
web_production: 671
This is Index. (HTML/PHP/ASPX?/...), without CGI parameters. It is considered to be for all takes.
|
IsIndexPageSoft
web_production: 672
This is Index. (HTML/PHP/ASPX?/...), possibly with CGI parameters. It is considered to be for all takes.
|
IsOwner
web_production: 673
Whether the host is the owner, conditionally host == Owner (Host).
|
MinPathLen
web_production: 674
The minimum length of Pathandquery for all half -shoes.
|
XLerfGeoLRlogRelevCnt
web_production: 675
Regionalized (only links from the country of request are taken) variant of the Xlerfgeolrlogrelev factor
|
XNonCommLerfNormLRlogRelevCnt
web_production: 676
Regionalized (only links from the country of request are taken) variant of the factor XNONCOMMLERFNORMLRLOGRELAV
|
LocmCnt
web_production: 677
Regionalized (only links from the country of request are taken) Variant of Locm factor
|
XLRrelevCnt
web_production: 678
Regionalized (only links from the country of request are taken) variant of factor xlrrelev
|
XLerfLRrelev200Cnt
web_production: 679
Regionalized (only links from the country of request are taken) variant of factor Xlerflrrelev200
|
NavLinear
web_production: 680
((http://wiki.yandex-team.ru/jandekspoisk/antispam/polunavigacionnyezaprosy#faktornnostiparyurl-zapros Classifier)) pairs of vitalnikov [query Url], Url Vital for the request, if value is valuable for Ф> 0.5
|
RankComGoodness
web_production: 681
Classifier for estimates of commercial sites
|
HasDownloadLinkOnFile
web_production: 682
The document has a direct link to the file
|
HasDownloadLinkOnFileHosting
web_production: 683
The document has a link to filehosting
|
DiversityCategDownload
web_production: 684
0 or 1 - whether the request is matured by the tickt
|
DiversityCategReview
web_production: 685
0 or 1 - whether the request is matured by the tickt
|
DiversityCategWatch
web_production: 686
0 or 1 - whether the request is matured by the tickt
|
QrTur
web_production: 687
The prediction of the share of “good” (at least two different cities and frequency> = 10) references to the request with geography in Turkey
|
QueryThEncyclopedic
web_production: 688
The result of the work of the lexical classifier of requests predicting the likelihood of click on the theme of 3561
|
QueryThVideohosting
web_production: 689
The result of the work of the lexical classifier of requests predicting the likelihood of click on the page 3973 page
|
IsNavMxQuery
web_production: 690
Rank 'navigation'
|
QueryUrlYabarVisits_Reg
web_production: 691
Regional attendance from search engines for a specific request
|
ClickedWithAnotherSEClicks
web_production: 692
Clicks on the urlahs shown in the issuance for requests, by which they went to look for other search engines
|
ShowsWithAnotherSEClicks
web_production: 693
Urlov shows in the issuance for requests, by which they went to look for other search engines
|
CommercialOwnerRank_Reg
web_production: 694
Classifier of the commerciality of the site
|
HostIsMarketOffer
web_production: 695
In the latest version of the market base there are offers from this host.
|
BclmMax
web_production: 696
The proximity of the words of the request to the most difficult word.
|
UrlPronRegexpMatch
web_production: 697
Url satisfies the regexp expression set in the demon
|
HasUserReviews
web_production: 698
The document contains user review/comment
|
RegexMaxClickPercentReg
web_production: 699
The share of clicks on this Urlu among all clicks according to similar requests, the country version, see ((http://wiki.yandex-team.ru/development/poisk/Arcadia/indexregex Indexregex)))))))))
|
RegexCtrReg
web_production: 700
Corrected CTR of this Urla for all similar requests, country version, see (http://wiki.yandex-team.ru/development/poisk/Arcadia/indexregex Indexregex))))))
|
Found
web_production: 701
The average number of found on request
|
YabarWordDepthNodesGradientMin
web_production: 702
The angle in the Depth Nodes space, counted only by words (min for all)
|
DBM15Wares
web_production: 703
|
RankComGoodnessBar
web_production: 704
Classifier that approximate the quality of commercial sites based on user behavior data
|
DocCreateMonth
web_production: 705
The time of creating a document with an accuracy of 1.0 is the current month, 0- 10 years ago and older. Temporarily disconnected
|
DocUpdateMonth
web_production: 706
The time for updating the document with an accuracy of 1.0 is the current month, 0- 10 years ago and older. Temporarily disconnected
|
XLRSourceRank
web_production: 707
|
XLRMainPage
web_production: 708
|
DaterStatsYearNormLikelihood
web_production: 709
The function of the credibility of the distribution of years in the document. Temporarily disconnected
|
HostNumSovetnik
web_production: 710
Num of Sovetnik URLS
|
LcmVar
web_production: 711
Dispersion of the number of words in the links.
|
DaterStatsAverageSourceSegment
web_production: 712
The arithmetic mean position of dates in the document. Temporarily disconnected
|
DBM15Wares2
web_production: 713
|
Cabm
web_production: 714
BM with attenuation in the text of catalog links.
|
BeastNqUrlMeanPos
web_production: 715
The average position of Urla for a normalized request
|
BeastNqOwnerMeanPos
web_production: 716
The average position of Domattr for a normalized request
|
BeastUrlMeanPos
web_production: 717
The average position of Urla for all requests
|
BeastHostMeanPos
web_production: 718
The average position of the host for all requests
|
BeastUrlNumQueries
web_production: 719
Number of requests for URL
|
BeastHostNumQueries
web_production: 720
Number of requests for host
|
YabarHostBrowseRank_Reg
web_production: 721
Implementation of the algorithm described in the article ((http://wiki.yandex-team.ru//h.yandex.net/?http%3A%2F%2FreseRosoft.microsoft.com%2Fen-US%2FPEOPLIULIUUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUUP032-LIUUUU .pdf http://research.microsoft.com/en-us/people/tyliu/fp032-liu.pdf)) by large regions (tube)
|
Removed_722
web_production: 722
|
SegmentWordPortionFromMainContent
web_production: 723
The share of the words of the document from the segments with Score> 2.
|
UrlDomainSimilarityFixed
web_production: 724
|
TotalDups
web_production: 725
|
RankBoostGoodness
web_production: 726
The rank of site quality used for boosts of the Moscow commercial formula
|
QueryDOwnerClicksFRCRegGeo
web_production: 727
|
QueryURLClicksFRCRegGeo
web_production: 728
|
LanguageDistribution
web_production: 729
|
UrlShowsWithNextPageClicksP1
web_production: 730
|
UrlShowsWithNextPageClicksP10
web_production: 731
The factor is used in Selectionrank. TG_UNUSED: should not be included in the formulas to avoid feedback
|
QueryURLClicksPCTRYear
web_production: 732
|
QueryURLClicksPCTRPreviousYear
web_production: 733
|
SmallWindowAttenuation
web_production: 734
|
RcSearchBaseUrlRationalSigmoidD3T120AtReq
web_production: 735
URL feature computed at the request time from rapid clicks search counters with decay of 3 days
|
OwnerCTRWithNextPageClicksP10
web_production: 736
|
CommRus
web_production: 737
The weight of the document on a monosyllabic dictionary of commercial vocabulary
|
WikiLinkCount
web_production: 738
|
UrlInLinksTrigramsStatic
web_production: 739
|
LinksInUrlTrigramsStatic
web_production: 740
|
UkrIsQueryLang
web_production: 741
Shows that a request in Ukrainian
|
QueriesAvgCM2
web_production: 742
Average query commerciality
|
QiQueryCount
web_production: 743
The number of requests in the group of frequency requests similar to a given
|
QiUrlFreqWeightedFRC
web_production: 744
FRC groups of frequency requests similar to a given, with averaging through the sum of clicks and shows
|
QiUrlFreqWeightedFRCReg
web_production: 745
FRC groups of frequency requests similar to a given, with averaging through the sum of clicks and shows, according to regional statistics
|
RcSearchBaseUrlRationalSigmoidD1TM600Frozen
web_production: 746
URL feature computed from rapid clicks search frozen counters with decay of 1 day
|
WordHostWikiSum
web_production: 747
The relative popularity of the Word -Host pair, where Word is the word from the Title article on Wikipedia, and the Host is the host that is referred to in this article.
|
RegWordHostClicksSum
web_production: 748
The relative clickability of the three Countryid-Word-Host according to the search in Yandex.
|
RegWordHostYabarSum
web_production: 749
The relative clickability of the three Countryid-Word-Host according to the data from popular search engines on the bar and Similargroup.
|
RegexMaxClickPercentYabarReg
web_production: 750
The share of clicks on this Urlu among all clicks on similar requests, counted according to Popular Search Engine
|
YabarHostSurfTrDpNdLeafLn
web_production: 751
The length of the Depth Nodes petal counted for hosts
|
YabarHostSurfTrNdTmGrDsp
web_production: 752
Dispersion of the angle in the space of Nodes Time, calculated for hosts
|
YabarHostSurfTrNdTmLeafLn90
web_production: 753
0.9-quarter of the length of the petal in the space of Nodes Time, calculated for hosts
|
WordHostDownloadProbability
web_production: 754
The average according to the request is the probability of download the file from the host after click.
|
NastyContent
web_production: 755
Content ugliness factor.
|
SynnormURLPCTR
web_production: 756
CTR according to click data, the request is normalized according to Sinsets
|
SynnormURLPCTRReg
web_production: 757
Regional CTR according to click data, the request is normalized according to Sinsets
|
UrlQueryTrigramsStatic
web_production: 758
Static trigrams intercection of url and queries by which users visited the url.
|
AdvAspam
web_production: 759
|
HasPornoQuery
web_production: 760
The result of the work of Adult Rules for the Sorcerer.
|
QUBm15Weighted
web_production: 761
Weighed BM15 for a request for an index document - a list of requests for which they switched to it.
|
WeightedSumIsIndexPageBocm
web_production: 762
|
WeightedSumIsIndexPageIsNavMxQuery
web_production: 763
|
BrowserHostDownloadProbability
web_production: 764
The likelihood of a racing from a host after click (on the logs of the bar).
|
NHopChainsCountFrc
web_production: 765
The number of chains on request / (the number of chains in which URL + the number of chains on request participated).
|
NHopIsFinal
web_production: 766
The number of chains in which Url was the last normalized for the total number of chains in which this URL was.
|
VisitsFromWiki
web_production: 767
Number of transitions to URL from Wikipedia
|
RcSearchBaseUrlContrastD30Odd0_9_X_D30T1Frozen
web_production: 768
URL feature computed from rapid clicks search frozen counters with decay of 30 days
|
RegBrowserUserHub
web_production: 769
The page indicator is like a hub (how many pages are the bar users pass from it).
|
AuxTitleBM25
web_production: 770
TEXTBM25 is considered in the title by the text of the name of the user region - similar to the factor 268.
|
Bclmf
web_production: 771
BCLM for Annotation index, doc text and links.
|
NoProductsProbability
web_production: 772
DSSM Prediction of the probability of URL + Title that there is no product on the page.
|
PopularSEFRCBrowser
web_production: 773
FRC Popular Search System for Browser Logs
|
LogCtrMean
web_production: 774
Weighted mean of log(query_clicks)/log(query_shows) for given host. Weights are proportional to log(query_shows) + 0.2.
|
QueryUrlNhopTotalFrc
web_production: 775
The number of transitions on the request for URL, found in the Hopes chain, normalized to the general garlic of the transitions on request.
|
QueryUrlNhopIsFinal
web_production: 776
The probability of Urla to be the last upon request in the chain of Hopes.
|
OneProductProbability
web_production: 777
DSSM Prediction of the probability of URL + Title, which is on the page one product.
|
ManyProductsProbability
web_production: 778
DSSM Prediction of the probability of URL + Title, that there are a lot of goods on the page.
|
RcSearchBaseUrlRationalSigmoidD3T120Frozen
web_production: 779
URL feature computed from rapid clicks search frozen counters with decay of 3 days
|
GeoCityUrlHasCity
web_production: 780
For Urla, a geo-approval of the city level is determined according to the rules of the BUKI-1125
|
GeoCityUrlHasCountry
web_production: 781
For Urla, a geo-approval of the country's level is determined according to the BUKI-1125 rules
|
GeoRelevRegionCityGeoa
web_production: 782
Factor Gorelevregions of the 1th Attichut and Geoa
|
GeoRelevRegionRegionGeoa
web_production: 783
Factor GorelevregionRegionRegion Natthew GEOA
|
GeoGeometryProximGeoa
web_production: 784
Factor Geogeetryproxim ▪ Attributu GEOA
|
GeoRelevAlienCityGeoa
web_production: 785
Factor Gorelevaliencity n Att. Att. Attibtu Geoa
|
GeoVQueryInUserCityGeoa
web_production: 786
Factor Geovqueryinusercidence n Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Att. Attfruut and Geoa
|
GeoVQueryInAlienCityGeoa
web_production: 787
Geovquery Geovqueryinieniencity n Att. Att. Attib
|
PageRegionSizeGeo
web_production: 788
PageRegionsize Factor by GEO attribute
|
PageRegionCoverageGeo
web_production: 789
PageRegioncoverage Factor GEO attribute
|
PageRegionCoverageAdresa
web_production: 790
PageRegioncoverage Factor on Adresa attribute
|
GeoRelevRegionCityAdresa
web_production: 791
GeorelevregionCity Factor on Adresa attribute
|
DoppQueryUrlSessionClicksFRC
web_production: 792
What part (on average in the session) from the clinked in this query Urlov is this URL. It is considered to be user sessions.
|
OwnerIsActualShop
web_production: 793
Aries is a store
|
OwnerIsService
web_production: 794
Aries is a service
|
NHopTextBclmPlane
web_production: 795
BCLM (Plane) in texts from hopes.
|
SameQueryReturnFRCBrowser
web_production: 796
FRC by transitions from requests that were set by the user several times
|
QueryURLISBMCTR
web_production: 797
The average weight of the shows on the first page; Click weighs 1, non -click - according to the SBM_GAMMAS table
|
QueryURLISBMCTRReg
web_production: 798
The average weight of the shows on the first page; Click weighs 1, non -click - according to the SBM_GAMMAS table. Regional version
|
RegexBeastPositionReg
web_production: 799
Half -Summaria assessment of the position of Url with a median position for all similar queries on bisters
|
RcSpylogHostRationalSigmoidD3T0AtReq
web_production: 800
Host feature computed at the request time from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogHostRationalSigmoidD3DTM3600AtReq
web_production: 801
Host feature computed at the request time from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogHostRationalSigmoidD14T0AtReq
web_production: 802
Host feature computed at the request time from rapid clicks spy_log counters with decay of 14 days
|
RcSpylogHostRationalSigmoidD14DTM3600AtReq
web_production: 803
Host feature computed at the request time from rapid clicks spy_log counters with decay of 14 days
|
RcSpylogHostRationalSigmoidedCTRD3DT0TM3600AtReq
web_production: 804
Host feature computed at the request time from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogHostRationalSigmoidedCTRD14DT0TM3600AtReq
web_production: 805
Host feature computed at the request time from rapid clicks spy_log counters with decay of 14 days
|
RcSpylogHostRationalSigmoidD3T0Frozen
web_production: 806
Host feature computed from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogHostRationalSigmoidD3DTM3600Frozen
web_production: 807
Host feature computed from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogHostRationalSigmoidD14T0Frozen
web_production: 808
Host feature computed from rapid clicks spy_log counters with decay of 14 days
|
RcSpylogHostRationalSigmoidD14DTM3600Frozen
web_production: 809
Host feature computed from rapid clicks spy_log counters with decay of 14 days
|
RcSpylogHostRationalSigmoidedCTRD3DT0TM3600Frozen
web_production: 810
Host feature computed from rapid clicks spy_log counters with decay of 3 days
|
RcSpylogHostRationalSigmoidedCTRD14DT0TM3600Frozen
web_production: 811
Host feature computed from rapid clicks spy_log counters with decay of 14 days
|
CommercialDssmOddLike
web_production: 812
Finetuned reformulations DSSM to commercial clicked bargain odd-like target from visit log
|
DistributorHosts
web_production: 813
Is legal video distributor
|
OneProductProbabilityAvg
web_production: 814
Average value of feature OneProductProbability
|
ManyProductsProbabilityAvg
web_production: 815
Average value of feature ManyProductsProbability
|
PayDetectorPredictAvg
web_production: 816
Average value of feature PayDetectorPredict
|
OwnerIsPartner
web_production: 817
Aries is a partner
|
ShopInShopUrl
web_production: 818
The document is Shopinshop
|
QueryConversionDetectorPredict
web_production: 819
The value of the conversion of the request calculated in the Hippo.
|
FioFromOriginalRequestBodyChain0Wcm
web_production: 820
The factor according to the name from the original request is considered according to the contents of the document. Algorithm: Chain0wcm
|
ProductOfferAnyAvailable
web_production: 821
At least one offer from a sporled scheme has an accessibility status.
|
ProductOfferNoProducts
web_production: 822
There is not a single offer in the porous scheme.
|
BadYtierUrl
web_production: 823
For Ural from Ytier, it is known that he has low -quality content
|
NormYtierUrl
web_production: 824
For Ural from Ytier, it is known that he has a content of acceptable quality
|
GoodYtierUrl
web_production: 825
For Ural from Ytier, it is known that he has good quality content
|
BestYtierUrl
web_production: 826
For Urla from Ytier, it is known that he has an excellent content content
|
HostIsEcomPurchase
web_production: 827
The host has an ecom purchase.
|
HostIsVisitLogsPurchase
web_production: 828
The host has a purchase by Visit Log.
|
YandexMarketProductUrl
web_production: 829
URL is a product on the market.
|
YandexMarketProductIncludeOfferidUrl
web_production: 830
URL is a product on the market and has Offerid.
|
ShopInShopCPAUrl
web_production: 831
URL is Shopinshopcpa.
|