Tag: TG_REARR_USE
(728 ranking factors)
Factors |
---|
TR
web_production: 1
Text relevance (Maxfreq is the frequency of the most frequent word that makes sense of the length of the document).
|
TRp1
web_production: 4
Stript priority for TR is a text priority - there are all the words of the request somewhere in the document (while they pass contextual restrictions on the request, for example, both words DB in one sentence).
|
TRp2
web_production: 5
Weight: -0.109820338929289 PHRASE priority for TR is a text priority - there are all the words of the request in a row in the document.
|
TRhr
web_production: 9
There was a plot that passed the quorum in which all the word positions are designated as those who have the relevance of Best_relev (title or Meta Keywords).
|
News
web_production: 11
This is the news (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushichiekomponenty/klassificacionnye?v=tkd#h45859-3 Patterns in URL URL)))).
|
Long
web_production: 15
Weight: -0.084798680877042 Long document (the longer the document, the greater the value of the factor).
|
TRhitw
web_production: 16
Hitweigt is a variant of textual relevance, in which the weights of all hits are considered equal (i.e., they do not take into account the allowances for title and the proximity of words). In this case, the corresponding hits must be restricted by the syntactic sorcerer, i.e. We can assume that the TRHITW factor is 0 and only when Softandok is 0
|
LongQuery
web_production: 17
Weight: 0.030334786608805 The amount of IDF words of the request. The name does not reflect the essence: for example, for the request of 'Gadyach' this factor will be more than for the request of 'Moscow Peter Yekaterinburg Samara'.
|
PureText
web_production: 18
Long text without links.
|
Root
web_production: 19
This is a muzzle.
|
HasLR
web_production: 34
URL High LR.
|
PopularQ
web_production: 38
The popularity of the request
|
AddTime
web_production: 41
Weight: 0.006691168756865 The time of adding a page, more - a more old document; The root is placed from time displayed at the interval [0.1] so that 3+ years gives 1.
|
IsMainPage
web_production: 42
If the main page of the owner (most often a second -level domain, for example xxxx.ru), then the factor is 1. For bums, hosting, personal blogs, etc. (for example, Lifejornal, People.ru, etc.) - domains of the third level (such as xxxxx.narod.ru) will also have an equal factor 1.
|
AddTimeMP
web_production: 43
The time for adding the main page of the owner (host?) Will be remaped like Addtime.
|
QueryURLClicksPCTR
web_production: 45
How often they click in this URL for this request - CTR blasting for the correction factor
|
YandexAdv
web_production: 51
Weight: -0.094261219650513 On the site there is an advertisement for Yandex.
|
NoSpam
web_production: 52
The Classifier of Spam for Picks from Antispam recognized the site not (!) Spam. Those. 0 = spam, 1 = good.
|
TxtHead
web_production: 56
Weight: -0.037878046829073 BM25 according to only in the heading.
|
WordCount
web_production: 59
Min (number of words of request/10, 1.f)
|
InvWordCount
web_production: 60
1 / quantity_lov_v_
|
HasNoLR
web_production: 62
The document has no LR.
|
HasNoQueryURLShows
web_production: 63
For this Urla, for this request, there is no information about clickness 1 - request or request -URLA in the click database, 0 - query URL in the clicks database
|
HasNoQueryShows
web_production: 64
Weight: 0.205699196177282 For this request, there is no information about the clickness of 1 - there is no request in the click database, 0 - the request is in the click database.
|
Hops
web_production: 65
The number of hops of Url inpans (such as less - closer to the muzzle, the lower the value (0 - the muzzle, 1 - from the muzzle cannot be reached, 0 <can get from the muzzle <1). Normal value for the root of the nosta 0.0039).
|
TxtPairEx
web_production: 67
Weight: -0.00667940021707 the presence of pairs of words in the exact form
|
TxtBreakEx
web_production: 68
Weight: 0.024006117828321 the number of sentences in which there are many words in the exact form
|
TxtHeadEx
web_production: 69
Weight: -0.03957553241619 the presence of words in the header in the exact form
|
TxtBm25Ex
web_production: 71
Simple BM25 in the exact form.
|
TxtHeadSy
web_production: 74
Weight: -0.012919083353605 the presence of words in the header, taking into account synonyms
|
TxtBm25Sy
web_production: 76
Simple BM25 taking into account synonyms.
|
QueryDOwnerClicksPCTR
web_production: 77
Weight: 0.219595036178226 How often they click in the URLs of this Domainid for this request - Ctr Domainid blasting for the correction factor
|
XLRgood
web_production: 84
Weight: -0.00083343707893 What is the share of “good” links
|
IsBlog
web_production: 96
Page from the blogochosting
|
IsLivejournal
web_production: 97
Page with Livejournal.com
|
TextFeatures
web_production: 100
Weight: -0.016033504310566 The quality of the text. It is considered a rather complex formula
|
TextLike
web_production: 101
Weight: -0.094096848692163 Text quality (classifier Alekseeva)
|
YaBarCoreHost
web_production: 105
The core of the audience of the hosts according to Yandex.Mrazusing
|
MusicQ
web_production: 108
The musicality of the request. The results of the sorcerer Anton Konygin.
|
DocLen
web_production: 110
Weight: -0.065128132003719 Document length in sentences
|
UrlLen
web_production: 111
Weight: -0.001158034315755 The length of the URL, divided by 5
|
QueryNonCommerciality
web_production: 112
The commercial request for the dictionary of phrases from Direct: 0 - maximum commercial, 1 - minimal.
|
XNonCommLRlogRelev
web_production: 123
Link relevance, taking into account the non -profitability of each link
|
LinksWithAllWordsPercent
web_production: 129
Weight: -0.08383112850758 The percentage of incoming links with all the words of the request
|
PornoQuery
web_production: 130
Are there any words from Yweb/Pornofilter/Porno.query.
|
IsPorno
web_production: 131
Document from porn kitski
|
IsFake
web_production: 133
Fast document
|
IsWiki
web_production: 135
page from ru.wikipedia.org
|
IsEShop
web_production: 136
Commercial page (Classifier Savina)
|
GeoRegionProxim
web_production: 137
Weight: 0.082967074248567 |
NumWordsTRSy
web_production: 139
The percentage of the words of the request in the document (with an accuracy to a synonym)
|
HasAllWordsTRFm
web_production: 149
The document has all the words of the request (with an accuracy to the form)
|
QDiversity
web_production: 150
Weight: 0.046783126435468 The degree of centralization of the points from which the request is set
|
XLerfGeoLRlogRelev
web_production: 153
Weight: 0.044511155721215 log (leerflr, narrowed to the country of the user)
|
NonCommercialQuery
web_production: 154
Binar non -profit request: Querynoncommerciality> 0.965.
|
QueryURLClicksFRC
web_production: 168
the ratio of the number of clicks on this Urlu to all clicks on request
|
QueryDOwnerClicksFRC
web_production: 169
Weight: 0.214713693660762 the ratio of the number of clicks on this Domainid to all clicks on request
|
QueryURLClicksPCTR_copy
web_production: 170
[Bug: A copy of factor 45] How often they click in this URL for this request - CTR blasting for a correction factor
|
DoppQueryUrlSessionClicksFRCCity
web_production: 171
What part (on average by the session) from the user Urlov’s user, this URL user, who has been completed to it, is this URL. It is considered to be user sessions.
|
QueryURLClicksPCTR_Reg
web_production: 172
How often do they click in this URL for this request - CTR blasting for the correction factor, by small regions from Relev_regions.web.txt
|
QueryURLClicksFRC_Reg
web_production: 174
Weight: 0.023610887210981 The ratio of the number of clicks on this Urlu to all clicks on request, by small regions from Relev_regions.web.txt
|
QueryDOwnerClicksFRC_Reg
web_production: 175
Weight: 0.118638180985299 The ratio of the number of clicks on this Domainid to all clicks on request, by small regions from Relev_regions.web.txt
|
ExactWordOrderLen
web_production: 180
The length of the maximum coincidence of forms in the text and request
|
ExactWordOrderWeight
web_production: 181
Weight of maximum coincidence of forms in the text and request
|
LinkMaxAge
web_production: 184
The maskimal age of a significant accumulation of links that brought something to LR
|
PassageLegacyTR
web_production: 190
Weight: 0.038806477920761 TR of the best passage - how high -quality snippet
|
IsForum
web_production: 196
URL satisfies forum_detector regularly
|
GeoRelevRegionRegion
web_production: 216
|
GeoRelevRegionCountry
web_production: 217
Weight: 0.084012276385059 Three levels of coincidence of the geography of the user and page
|
XLRGeoRelevRegionCity
web_production: 218
|
XLRGeoRelevRegionRegion
web_production: 219
|
GeoCountryProxim
web_production: 221
Weight: 0.01317157982937 Geographical proximity
|
IsNavQuery
web_production: 222
Is the request for navigation, on the clicking of the answers
|
MaxWordHostYaBar
web_production: 223
Weight: 0.315439457304752 The most characteristic word of the request corresponding to the site, according to the bar
|
QueryDOwnerYabarAvgTime
web_production: 228
Weight: 0.122090633457258 The average for users Active continuous time of the user is (in second) on the host pages after the transition on request from the search engine (the factor depends on the pair (request, Domattr)).
|
QueryDOwnerYabarAvgTime2
web_production: 229
The average for users Active continuous time of the user is (in second) on the host pages after the transition on request from the search engine (the factor depends on the pair (request, Domattr)). In the inside of the Yandex. Bara/elements/browser counter
|
QueryDOwnerYabarAvgActions
web_production: 230
The average for users is the number of active actions (clicks, clicks) with the continuous finding of the user on the host pages after the transition on request from the search engine (the factor depends on the pair (request, Domattr)). . In the inside of the Yandex. Bara/elements/browser counter
|
QueryUrlYabarVisitors
web_production: 232
The number of unique visitors from search engines for a specific request
|
IsForeignQuery
web_production: 241
Request is not in Russian
|
IsForeignCluster
web_production: 242
foreign cluster document
|
GeoGeometryProxim
web_production: 247
Weight: -0.000843495929565 The geographical proximity of the user and the site
|
YabarHostInternalTraffic
web_production: 251
Weight: 0.071417326810502 The share of suits to the site is not by links (set with hands or from bookmarks)
|
YabarHostAvgTime
web_production: 252
Weight: -0.007634608393132 average for users Active continuous time for user finding (in sec) on the host pages
|
YabarHostAvgTime2
web_production: 253
Weight: 0.074172193125966 The average for users Active continuous time of the user (in second) on the pages of the host. In the inside of the Yandex. Bara/elements/browser counter
|
YabarUrlAvgTime
web_production: 258
Weight: 0.003890338237824 The average for users time is the user on the page. It is read as the difference between neighboring transitions.
|
UrlQueryVariety
web_production: 261
The degree of variety of requests for which this Urla click
|
AuxTextBM25
web_production: 268
BM25 for the user region for localized queries, for the unflapped in Cuba, is a country. The texts of the queries sent for the regions can be viewed in Relev_regions.txt in the sorcerer
|
AuxLinkBM25
web_production: 269
The same for lingonic relevance
|
TovarCategoryQuery
web_production: 273
The request mentions the product category. Not used (depreded)
|
TovarCategoryVendor
web_production: 274
The request mentions a vendor. Not used (depreded)
|
Diversity2
web_production: 275
Weight: 0.001181036676865 Geographical distribution of the request
|
NightQuery
web_production: 276
The request is set mainly at night
|
MorningQuery
web_production: 277
Weight: -0.013510450334814 The request is set mainly in the morning
|
DayQuery
web_production: 278
The request is given mainly in the afternoon
|
EveningQuery
web_production: 279
The request is set mainly in the evening
|
HourDiversity
web_production: 280
The severity of the querial tasks at different times of the day
|
LCor
web_production: 281
Weight: 0.038372460585705 Characterizes the frequency of words in links. The factor is large, if the word that played in a lincoat relevance is rare for links.
|
XPornoNormLRlogRelev
web_production: 290
Document Porn on the text of Leskok, other normalization
|
XPornoQuery
web_production: 291
Classifier of Porn Causions, another dictionary than Pornoquery
|
UrlDomainFraction
web_production: 294
Weight: 0.564095297143887 Coating domain three -bouqu and request. (Chelyabinsk lottery - Chelloto. We translate a request to translite, find the three -book that are covered (Che, Hel, Lot, Olo), we look at what share of all three -bouquets are covered)
|
UrlPathAndParamsFraction
web_production: 295
Weight: -0.162220616846705 The same as the previous factor, but about the entire Url except the domain
|
VideoQuery
web_production: 307
Request about the video
|
Poetry
web_production: 319
The poetry of the document
|
PoetryQuad
web_production: 320
The maximum poetry of the quatrain
|
EngLang
web_production: 321
Document language - English
|
Has2ExactQueryParts
web_production: 322
The request is fully covered by two exact groups consisting of an exact Match of the words of a contract in a row ((http://wiki.yandex-team.ru/poiskovajaplatform/tr/coveragebygroups about coating in groups))
|
HasLevensht1QueryFragment
web_production: 323
There is a group consisting of an Exact Match of the words of the request that covers the request (possibly with a pass, addition or replacement of a word)
|
CyrLang
web_production: 327
The language of the document is Cyrillic
|
SynFLremap1
web_production: 335
Weight: 0.002431406823392 Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
SynFLremap2
web_production: 336
Weight: 0.08033186404617 Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
OwnerSessNormDuration
web_production: 337
Weight: 0.126700168643196 ND/K normalized time to click
|
UrlSessNormDurRate
web_production: 338
Weight: 0.025806639721603 nd/i
|
QueryDOwnerSessNormDuration
web_production: 339
CONTRY / K
|
QueryDOwnerWeightClick
web_production: 340
Weight: 0.202186193546053 w/k
|
SyntQuality
web_production: 344
Weight: 0.010872234578071 Does the request have a complete syntactic analysis
|
HasTextPos
web_production: 350
The document has textual relevance
|
SynNumBadWordPairs
web_production: 354
The proportion of bad steam among all found in the table: Z/(X+1), where Z is the number of bad couples in the text, and X is (http://wiki.yandex-team.ru/evgenijgrechnikov/testsynonimizers of 2000-navigable )) steam
|
NormalTextIdfSumFixed
web_production: 360
Previous factors - fixed
|
QueryURLClicksCombo
web_production: 361
factor cunningly combined from FRC and Pseudo-CTR
|
PercentWordsInLinks
web_production: 367
Weight: 0.057053549836014 The percentage of the number of words inside the tag <a> .. </a> from the number of all words
|
MatrixNet
web_production: 379
Weight: 0.114624515228977 Matrixnet is applied to all factors - formula (tg_unized - to prevent the entrance to any formulas)
|
DaterAge
web_production: 380
Weight: -0.207437366708906 The difference between the current date and the date of the document defined by the dates, 1 - the date of the document is equal to the current, 0 - the document of 10 years or more, if the date is not defined, equal to 0. Attention! ((1 - dateraage)*60)^2 = age of the page In days.
|
TextWeightedForms
web_production: 386
Weight: 0.022803839020796 The sum of the number of forms balanced by the scales of words - the amount in all words of the request of the number_form_dly_lov/64*weight_lov; REMAP species x/(1 + x).
|
LinkWeightedForms
web_production: 389
Weight: 0.096811143316269 Summer of the number of forms balanced by scales
|
TR_W1
web_production: 391
Analogues of the factors of the same name, the weight of the word = 1
|
TextBM25_Fm_W1
web_production: 393
Analogues of the factors of the same name, the weight of the word = 1
|
IsOrg
web_production: 408
Weight: -0.018278527670779 The request is the name of the organization (example: Gazprom, Gazprom) ((http://wiki.yandex-team.ru/arsengadzhikurbanov/warees Description))
|
LongestText
web_production: 410
Weight: 0.069696682544392 The size of the largest text segment (from the factor [18] puretext)
|
HasDeterminedCities
web_production: 415
Weight: 0.165031403865939 The city is defined for the site
|
GeoRegionalityUNew
web_production: 416
Requestful factors - the result of the work ((http://wiki.yandex-team.ru/poiskovajaplatforma/lingvistika/zaprosnyjefefactory/localizovannyjezaprosya classifier of the request of the request)) - a new version of factors [328] - [328] - [328]: u - u - u - u - u - u - uceleless sites the request is meaningless;
|
GeoRegionalityRNew
web_production: 417
Запросные факторы - результат работы ((http://wiki.yandex-team.ru/PoiskovajaPlatforma/Lingvistika/ZaprosnyjeFactory/LocalizovannyjeZaprosy классификатора геолокализованности запроса)) - новая версия факторов [328]-[330]: R - георелевантные - региональные результаты в issuing could be useful, but nothing more;
|
GeoRegionalityVNew
web_production: 418
Requestful factors - the result of work ((http://wiki.yandex-team.ru/poiskovajaplatforma/lingvistika/zaprosnyjefefactory/localizovannyjezaprosya classifier of the request of the request)) - a new version of factors [328]: Vegetable fundamental importance.
|
QClassDownload
web_production: 421
= 1 - v. Download formula. Class requests: download/watch online/play/photo/listen
|
QClassBrandnames
web_production: 422
The result of the classifier of the request - in the request there are words from the corresponding dictionary. brand
|
QClassDisease
web_production: 423
Medication Dictionary
|
QClassKak
web_production: 424
question
|
QClassMoscow
web_production: 425
Specific request for Moscow
|
QClassOAO
web_production: 426
Weight: -0.005085205304656 organization
|
QClassPorno
web_production: 427
porn
|
QClassTravel
web_production: 428
trips
|
QDOwnerStatPower
web_production: 432
Weight: -0.025355498987515 The number of Owner shows on request, normalization x/(100 + x).
|
QUrlStatPower
web_production: 433
Weight: -0.194376876842978 The number of URL shows on request, normalization x/(100 + x).
|
Timestamp
web_production: 450
They are considered as (80 - x) / 80, where X is the age of the document in the watch. Factors make sense only for the fast -button base (the last 80 hours). Not used in ranking. Used in disconnecting.
|
AddTimeFull
web_production: 451
They are considered as (80 - x) / 80, where X is the age of the document in the watch. Factors make sense only for the fast -button base (the last 80 hours). Not used in ranking. Used in disconnecting.
|
PositionLanguageModel
web_production: 453
Weight: -0.032269052994315 The factor about that, a good snippet can turn out.
|
AuraDocLogAuthor
web_production: 456
Weight: -0.097277529611975 Logarithm of the number of shingles on which this owner of the document is recognized as the author
|
RegIsWiki
web_production: 468
A document from the language section of Wikipedia corresponding to the user region
|
LanguageCompliance
web_production: 469
Weight: 0.054576897612176 The language of the document corresponds to the language language
|
CountryPopularQ
web_production: 470
The popularity of the request within the country
|
CountryQDiversity
web_production: 471
Weight: 0.03718037385465 The degree of centralization of the points from which the request is set (inside the country)
|
CountryQDiversity2
web_production: 472
Weight: -0.00120970063307 Geographical distribution of the request within the country
|
IsPornoAdvert
web_production: 477
On the Porn Advertising page
|
NumSlashes
web_production: 480
Weight: 0.050576094170344 The number of slashes in Url
|
BM25FdPR_obsolete
web_production: 481
Weight: 0.054156294329288 BM25 with different parameters for different fields, including an incoming anchortekst. The weight of the text of the links included on the page is normalized depending on Delta Page Rank links
|
WatchVideo
web_production: 482
The presence of a built -in video player on the page
|
SubRelevance
web_production: 486
The service factor that was needed to search the site, and in the future it will still be needed.
|
YmwFull
web_production: 492
Weight: -0.044940112806396 The size of the minimum piece of text, including all the words of the request found in the document. Not used now. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/ymw Read more))
|
Bclm
web_production: 493
Weight: 0.030786458206337 Buettcher, Clarke and Lushman factor (modified) ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushichiekomponenty/bclm more)))))))))
|
QueryCommercialityMx
web_production: 494
Weight: 0.103903118421863 The measure of 'commercial' request. It is a comprehensively calculated Matrixnet factor formula for the procurement vocabulary in direct + for user queries + add. Intensive dictionaries. Requests with intensity to buy a factor seeks to -> 1 commodity requests -> 0.6 with intensity cannot buy, reviews, etc. -> 0 ((http://wiki.yandex-team.ru/Faktorydljanovogokatorazaprosov Factors of the Classifier))) (HTTP : //wiki.yandex-team.ru/jandekspoisk/antispam/antiseo/klassifikATORCHESSKIXZAPROSOV STUNITURE OF HIM))
|
FieldLM
web_production: 495
Weight: 1.36522746e-7 Unigramal language model. Language is modeling according to the document, smoothed out by the general linguistic model. When building a model, the document uses information on which field of the document met the word request (Title, Head or Plain Text)
|
GeoCityUrlRegionCity
web_production: 496
The coincidence of geography, determined from the Url of the document and the city of the request (IP or LR)
|
GeoCityUrlRegionCountry
web_production: 498
Weight: -0.168645758020604 The coincidence of geography, determined from the Url of the document and the country of request (IP or LR). Actual for Russia and Ukraine.
|
GeoCityUrlGeoCityCity
web_production: 499
The coincidence of geography, determined from Ural Documents and the City in the request (GEOCITY rule)
|
TitleTrigramsQuery
web_production: 501
Weight: 0.112928770384249 Calculates the coating of the request with letter trigrams of the document header
|
TitleTrigramsTitle
web_production: 502
Calculates the heading of the heading of the document header with letter trigrams
|
OwnerNavQuota
web_production: 506
Weight: 0.189743110446303 The share of clicks for navigation requests
|
GeoRelevAlienCity
web_production: 507
Weight: 0.084699401575226 The result has a geography of the user at the city level ([415] == 1 && [215] == 0)
|
Mpsa
web_production: 513
Weight: 0.093045433292429 Evaluates the minimum distance between the pairs of words of the request, taking into account the remoteness of the pair from the beginning of the document (Minimal Pair Size with Attenuation). Steles are understood to mean all consistent bigrams of the words of the request. Thus, the number of vapor is equal to the number of words in a request reduced by 1. Accordingly, the factor makes sense for requests consisting of more than one word. (Http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/ Tekushhiekomponenty/MPSA MPSA))
|
NearbyQuery
web_production: 523
When responding to a request, the results are important in close proximity ([pharmacies], [children's clinic])
|
CityQuery
web_production: 524
Weight: -0.091993052812036 When answering a request, the results within the city are important (the bulk of localized queries)
|
AdmQuery
web_production: 525
When responding to a request, the results from the region of the user ([airport], [dairy]) are important
|
YmwFull2
web_production: 527
Weight: -0.044940112806396 Fixed YMWFull. It differs from the previous version only by behavior on 2 -word queries. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/ymw Read more))
|
FullQuorum
web_production: 528
Binary factor, every word of the request is in the text or in the links
|
AuraDocLogOrigin
web_production: 547
Logarithm of the number of shingles in the document added by the owner of the site as original texts in ((http://wiki.yandex-team.ru/jandekspoisk/jekosistema/marketingPr/webmasters/plan/vtorcontect of originality plugin)). It does not participate in the formula, it is needed to disconnect the takes
|
NationalLanguage
web_production: 553
The language of the document corresponds to the country's request
|
FiltrationSegments
web_production: 561
The share of the segments of the request present in the text
|
DBM25_2
web_production: 563
Variation of Temo ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/DBM25 dBM25), cm.
|
IsUrlForClickDeboost
web_production: 577
It is known about URL that it is shown too often with very low relevance (according to Bert and/or BM25)
|
YabarUrlLcAc
web_production: 604
Weight: -0.046030869083841 The number of sessions in which Url was the last, classified as the sessions in which Url appeared
|
DBM35
web_production: 606
Weight: 0.046757967567051 BM25 in texts and links with special. Libra in the level of coincidence (shape, lemma, synonym)
|
TRLRQuorumFm
web_production: 607
Weight: -0.062810308974889 The weight of the words of the request that is in the text in the exact form
|
TRLRQuorumLemma
web_production: 608
Weight: -0.003021983245146 The weight of the words of the request that is in the text with an accuracy to lemma
|
TRLRQuorumSyn
web_production: 609
The weight of the words of the request that is in the text
|
SmallWindow
web_production: 621
Maximum amount weight of the words of the request in the window of 50 words
|
MetrikaUrlAvgTime
web_production: 623
Similar to Yabarurlavgtime
|
NavLinear
web_production: 680
((http://wiki.yandex-team.ru/jandekspoisk/antispam/polunavigacionnyezaprosy#faktornnostiparyurl-zapros Classifier)) pairs of vitalnikov [query Url], Url Vital for the request, if value is valuable for Ф> 0.5
|
DiversityCategDownload
web_production: 684
0 or 1 - whether the request is matured by the tickt
|
QrTur
web_production: 687
The prediction of the share of “good” (at least two different cities and frequency> = 10) references to the request with geography in Turkey
|
IsNavMxQuery
web_production: 690
Rank 'navigation'
|
UrlDomainSimilarityFixed
web_production: 724
|
QueryURLClicksPCTRYear
web_production: 732
|
QueryURLClicksPCTRPreviousYear
web_production: 733
|
HasPornoQuery
web_production: 760
The result of the work of Adult Rules for the Sorcerer.
|
AuxTitleBM25
web_production: 770
TEXTBM25 is considered in the title by the text of the name of the user region - similar to the factor 268.
|
Medical2UrlQuality
web_production: 1227
Neural model of content quality for medical subjects
|
Medical2UrlQualityFresh
web_production: 1244
Neural model of content quality for medical subjects (for ex -)
|
FinLawUrlQuality
web_production: 1247
Neural model of content quality for financial and legal topics
|
FinLawUrlQualityFresh
web_production: 1249
Neural model of content quality for financial and legal topics (for exposures)
|
SosUrlQuality
web_production: 1268
Neural model of content quality for SOS topics
|
SosUrlQualityFresh
web_production: 1270
Neural model of content quality for SOS subjects (for ex -)
|
UrlHostFraction
web_production: 1271
Copy of Old Version No.294 Factor. Added for Use on L3 Stage Only. Coating domain three -bouqu and request. (Chelyabinsk lottery - Chelloto. We translate a request to translite, find the three -book that are covered (Che, Hel, Lot, Olo), we look at what share of all three -bouquets are covered)
|
AliceMusicUrlTypeIsTrack
web_production: 1559
Type of canonized Urla Yandex Music - track
|
UnexpectedTrashUrlQuality
web_production: 1656
Neural document model for finding unexpected tin
|
TolokaBasedPornQueryClassificationSigmoid
web_production: 1767
Sigmoid rationed the value of a textual classifier of porn according to Toloka Porn
|
TolokaBasedPornQueryClassificationBinary
web_production: 1768
Binarized value of a textual classifier text classifier according to Toloka estimates
|
WebClassificationBasedPornQueryClassification
web_production: 1769
The value of the text classifier of porn according to the classifier of the web and add. dictionaries
|
WebClassificationBasedPornQueryClassificationBinary
web_production: 1770
Binarized with the use of networks, the value of a textual classifier of porn according to the estimates of the web and additional classifier. dictionaries
|
DirtyLanguageInQuery
web_production: 1771
The presence of obscene vocabulary in the request. 0 - absent, 0.5 - non -seated, 1 - hard
|
PornMarkersInQuery
web_production: 1772
The presence of porn markers in the request (0 - is, 1/3 - no, 1 - request 'gray')
|
AdultnessProd
web_production: 1774
Documentary classifier of porn, features according to the text of the document
|
AdultnessUrl
web_production: 1775
Documentary classifier of porn, features on Ural document
|
NastyImageValue
web_production: 1776
Documentary classifier of porn, features according to the pictures of the document (information is taken from the picture index)
|
NastyVideo
web_production: 1777
Documentary classifier of porn, features by video of the document (information is taken from the video index)
|
NastyHost
web_production: 1778
A host classifier of porn, features about the porn of the requests, according to which the host was shown and clung.
|
UnexpectedTrashUrlQualityFresh
web_production: 1909
Neuron document model for finding unexpected tin (for ex -)
|
PopularQ
begemot_query_factors: 3
The popularity of the request
|
QDiversity
begemot_query_factors: 4
The degree of centralization of the points from which the request is set
|
Diversity2
begemot_query_factors: 5
Geographical distribution of the request
|
HourDiversity
begemot_query_factors: 6
The severity of the querial tasks at different times of the day
|
NightQuery
begemot_query_factors: 9
The request is set mainly at night
|
MorningQuery
begemot_query_factors: 10
The request is set mainly in the morning
|
DayQuery
begemot_query_factors: 11
The request is given mainly in the afternoon
|
EveningQuery
begemot_query_factors: 12
The request is set mainly in the evening
|
CountryPopularQ
begemot_query_factors: 13
The popularity of the request within the country
|
CountryQDiversity
begemot_query_factors: 14
The degree of centralization of the points from which the request is set (inside the country)
|
CountryQDiversity2
begemot_query_factors: 15
Geographical distribution of the request within the country
|
MusicQ
begemot_query_factors: 18
The musicality of the request. The results of the sorcerer Anton Konygin.
|
QueryNonCommerciality
begemot_query_factors: 19
The commercial request for the dictionary of phrases from Direct: 0 - maximum commercial, 1 - minimal.
|
QueryCommercialityMx
begemot_query_factors: 20
The measure of 'commercial' request. It is a comprehensively calculated Matrixnet factor formula for the procurement vocabulary in direct + for user queries + add. Intensive dictionaries. Requests with intensity to buy a factor seeks to -> 1 commodity requests -> 0.6 with intensity cannot buy, reviews, etc. -> 0 ((http://wiki.yandex-team.ru/Faktorydljanovogokatorazaprosov Factors of the Classifier))) (HTTP : //wiki.yandex-team.ru/jandekspoisk/antispam/antiseo/klassifikATORCHESSKIXZAPROSOV STUNITURE OF HIM))
|
IsNavMxQuery
begemot_query_factors: 21
Rank 'navigation'
|
NonCommercialQuery
begemot_query_factors: 22
Binar non -profit request: Querynoncommerciality> 0.965.
|
PornoQuery
begemot_query_factors: 23
Are there any words from Yweb/Pornofilter/Porno.query.
|
XPornoQuery
begemot_query_factors: 24
Classifier of Porn Causions, another dictionary than Pornoquery
|
IsNavQuery
begemot_query_factors: 26
Is the request for navigation, on the clicking of the answers
|
IsForeignQuery
begemot_query_factors: 27
Request is not in Russian
|
VideoQuery
begemot_query_factors: 28
Request about the video
|
GeoRegionalityUNew
begemot_query_factors: 32
Requestful factors - the result of the work ((http://wiki.yandex-team.ru/poiskovajaplatforma/lingvistika/zaprosnyjefefactory/localizovannyjezaprosya classifier of the request of the request)) - a new version of factors [328] - [328] - [328]: u - u - u - u - u - u - uceleless sites the request is meaningless;
|
GeoRegionalityRNew
begemot_query_factors: 33
Запросные факторы - результат работы ((http://wiki.yandex-team.ru/PoiskovajaPlatforma/Lingvistika/ZaprosnyjeFactory/LocalizovannyjeZaprosy классификатора геолокализованности запроса)) - новая версия факторов [328]-[330]: R - георелевантные - региональные результаты в issuing could be useful, but nothing more;
|
GeoRegionalityVNew
begemot_query_factors: 34
Requestful factors - the result of the work ((http://wiki.yandex-team.ru/poiskovajaplatforma/lingvistika/zaprosnyjefefactory/localizovannyjezaprosy classifier of the request of the request)) - a new version of factors [328]: Vegetable fundamental importance.
|
QrTur
begemot_query_factors: 38
The prediction of the share of “good” (at least two different cities and frequency> = 10) references to the request with geography in Turkey
|
NearbyQuery
begemot_query_factors: 39
When responding to a request, the results are important in close proximity ([pharmacies], [children's clinic])
|
CityQuery
begemot_query_factors: 40
When answering a request, the results within the city are important (the bulk of localized queries)
|
AdmQuery
begemot_query_factors: 41
When responding to a request, the results from the region, the region of the user ([airport], [dairy]) are important
|
IsOrg
begemot_query_factors: 44
The request is the name of the organization (example: Gazprom, Gazprom) ((http://wiki.yandex-team.ru/arsengadzhikurbanov/warees Description))
|
QClassDownload
begemot_query_factors: 51
= 1 - v. Download formula. Class requests: download/watch online/play/photo/listen
|
QClassBrandnames
begemot_query_factors: 52
The result of the classifier of the request - in the request there are words from the corresponding dictionary. brand
|
QClassDisease
begemot_query_factors: 53
Medication Dictionary
|
QClassKak
begemot_query_factors: 54
question
|
QClassMoscow
begemot_query_factors: 55
Specific request for Moscow
|
QClassOAO
begemot_query_factors: 56
organization
|
QClassPorno
begemot_query_factors: 57
porn
|
QClassTravel
begemot_query_factors: 58
trips
|
DiversityCategDownload
begemot_query_factors: 61
0 or 1 - whether the request is matured by the tickt
|
HasPornoQuery
begemot_query_factors: 79
The result of the work of Adult Rules for the Sorcerer.
|
TovarCategoryQuery
begemot_query_factors: 215
The request mentions the product category. Not used (depreded)
|
TovarCategoryVendor
begemot_query_factors: 216
The request mentions a vendor. Not used (depreded)
|
IsMedicalQuery
begemot_query_factors: 281
Prediction of the classifier that the request is a medical
|
IsLawQuery
begemot_query_factors: 282
Prediction of the classifier that the request is a legal
|
IsFinancialQuery
begemot_query_factors: 283
Prediction of the classifier that the request is financial
|
IsSosQuery
begemot_query_factors: 284
The prediction of the classifier is that the request of SOS topics
|
IsNavigationalQuery
begemot_query_factors: 285
The prediction of the classifier that the request is navigation
|
IsExpectedSafeAnswerQuery
begemot_query_factors: 287
The prediction of the classifier is that the request is white (that is, it is not worth showing tin on it)
|
HasTrackInQuery
begemot_query_factors: 298
Request contains a word song or track
|
IsAliceMusicQuery
begemot_query_factors: 299
Musical request from Alice
|
IsMobileStoreQuery
begemot_query_factors: 300
The prediction of the classifier is that upon request you need to show URL from a mobile market
|
IsServicePlusQuery
begemot_query_factors: 301
Prediction of the classifier to especially commercial requests
|
IsApplianceRepairQuery
begemot_query_factors: 304
Classifier of a request for equipment repair services: office, household, computers, phones
|
IsArabicAliceMusicQuery
begemot_query_factors: 325
Musical request for the script of Arab Alice
|
LongQuery
collections_production: 0
|
Bclm
collections_production: 4
|
TxtBm25Sy
collections_production: 5
|
DocLen
collections_production: 6
|
TitleTrigramsTitle
collections_production: 9
|
TextBM25_Fm_W1
collections_production: 10
|
TxtBm25Ex
collections_production: 11
|
TxtHeadSy
collections_production: 14
|
YmwFull2
collections_production: 15
|
TxtHeadEx
collections_production: 16
|
TxtHead
collections_production: 17
|
TxtBreakEx
collections_production: 19
|
OriginalRequestAcceptDoc
geo_production: 588
|
WhatOnlyRequestAcceptDoc
geo_production: 589
|
Pessimization
geo_recommendations: 103
In this factor, information about pessimization is put in
|
RandomAddition
geo_recommendations: 104
A random additive to a faster is placed in this factor
|
PessimizeTrash
geo_recommendations: 106
Pessimize, unwanted organization
|
Boosting
geo_recommendations: 107
In this factor, information about Busts is placed
|
BoostTripadvisorUrl
geo_recommendations: 108
Beating by the presence of tripadvisor urla
|
BoostTripadvisorAttractionsUrl
geo_recommendations: 109
Bewilder Tripadvisor Urla in the entertainment section
|
BoostTripadvisorRating
geo_recommendations: 110
Bay by rating in Tripadvisor
|
BoostTripadvisorReviewsCount
geo_recommendations: 111
Buster by the number of reviews in Tripadvisor
|
PriceRate
geo_hotels: 25
Price rating for sorting by price (more expensive)
|
LongQuery
images_l1: 75
The amount of IDF words of the request. The name does not reflect the essence: for example, for the request of 'Gadyach' this factor will be more than for the request of 'Moscow Peter Yekaterinburg Samara'.
|
GruesomeCombined
images_l1: 93
The result of the aggregated tin classifier is used on average to determine tin queries
|
PopularQ
images_new_l1: 3
The popularity of the request
|
QDiversity
images_new_l1: 4
The degree of centralization of the points from which the request is set
|
Diversity2
images_new_l1: 5
Geographical distribution of the request
|
HourDiversity
images_new_l1: 6
The severity of the querial tasks at different times of the day
|
NightQuery
images_new_l1: 9
The request is set mainly at night
|
MorningQuery
images_new_l1: 10
The request is set mainly in the morning
|
DayQuery
images_new_l1: 11
The request is given mainly in the afternoon
|
EveningQuery
images_new_l1: 12
The request is set mainly in the evening
|
CountryPopularQ
images_new_l1: 13
The popularity of the request within the country
|
CountryQDiversity
images_new_l1: 14
The degree of centralization of the points from which the request is set (inside the country)
|
CountryQDiversity2
images_new_l1: 15
Geographical distribution of the request within the country
|
MusicQ
images_new_l1: 18
The musicality of the request. The results of the sorcerer Anton Konygin.
|
QueryNonCommerciality
images_new_l1: 19
The commercial request for the dictionary of phrases from Direct: 0 - maximum commercial, 1 - minimal.
|
QueryCommercialityMx
images_new_l1: 20
The measure of 'commercial' request. It is a comprehensively calculated Matrixnet factor formula for the procurement vocabulary in direct + for user queries + add. Intensive dictionaries. Requests with intensity to buy a factor seeks to -> 1 commodity requests -> 0.6 with intensity cannot buy, reviews, etc. -> 0 ((http://wiki.yandex-team.ru/Faktorydljanovogokatorazaprosov Factors of the Classifier))) (HTTP : //wiki.yandex-team.ru/jandekspoisk/antispam/antiseo/klassifikATORCHESSKIXZAPROSOV STUNITURE OF HIM))
|
IsNavMxQuery
images_new_l1: 21
Rank 'navigation'
|
NonCommercialQuery
images_new_l1: 22
Binar non -profit request: Querynoncommerciality> 0.965.
|
PornoQuery
images_new_l1: 23
Are there any words from Yweb/Pornofilter/Porno.query.
|
XPornoQuery
images_new_l1: 24
Classifier of Porn Causions, another dictionary than Pornoquery
|
IsNavQuery
images_new_l1: 26
Is the request for navigation, on the clicking of the answers
|
IsForeignQuery
images_new_l1: 27
Request is not in Russian
|
VideoQuery
images_new_l1: 28
Request about the video
|
GeoRegionalityUNew
images_new_l1: 32
Requestful factors - the result of the work ((http://wiki.yandex-team.ru/poiskovajaplatforma/lingvistika/zaprosnyjefefactory/localizovannyjezaprosya classifier of the request of the request)) - a new version of factors [328] - [328] - [328]: u - u - u - u - u - u - uceleless sites the request is meaningless;
|
GeoRegionalityRNew
images_new_l1: 33
Запросные факторы - результат работы ((http://wiki.yandex-team.ru/PoiskovajaPlatforma/Lingvistika/ZaprosnyjeFactory/LocalizovannyjeZaprosy классификатора геолокализованности запроса)) - новая версия факторов [328]-[330]: R - георелевантные - региональные результаты в issuing could be useful, but nothing more;
|
GeoRegionalityVNew
images_new_l1: 34
Requestful factors - the result of the work ((http://wiki.yandex-team.ru/poiskovajaplatforma/lingvistika/zaprosnyjefefactory/localizovannyjezaprosy classifier of the request of the request)) - a new version of factors [328]: Vegetable fundamental importance.
|
QrTur
images_new_l1: 38
The prediction of the share of “good” (at least two different cities and frequency> = 10) references to the request with geography in Turkey
|
NearbyQuery
images_new_l1: 39
When responding to a request, the results are important in close proximity ([pharmacies], [children's clinic])
|
CityQuery
images_new_l1: 40
When answering a request, the results within the city are important (the bulk of localized queries)
|
AdmQuery
images_new_l1: 41
When responding to a request, the results from the region, the region of the user ([airport], [dairy]) are important
|
IsOrg
images_new_l1: 44
The request is the name of the organization (example: Gazprom, Gazprom) ((http://wiki.yandex-team.ru/arsengadzhikurbanov/warees Description))
|
QClassDownload
images_new_l1: 51
= 1 - v. Download formula. Class requests: download/watch online/play/photo/listen
|
QClassBrandnames
images_new_l1: 52
The result of the classifier of the request - in the request there are words from the corresponding dictionary. brand
|
QClassDisease
images_new_l1: 53
Medication Dictionary
|
QClassKak
images_new_l1: 54
question
|
QClassMoscow
images_new_l1: 55
Specific request for Moscow
|
QClassOAO
images_new_l1: 56
organization
|
QClassPorno
images_new_l1: 57
porn
|
QClassTravel
images_new_l1: 58
trips
|
DiversityCategDownload
images_new_l1: 61
0 or 1 - whether the request is matured by the tickt
|
HasPornoQuery
images_new_l1: 79
The result of the work of Adult Rules for the Sorcerer.
|
VwChildPorn
images_new_runtime_doc_features: 1
The value of the DP classifier is used to filter on average
|
ChildPornProbability
images_new_runtime_doc_features: 4
The value of the DP classifier is used to filter on average
|
GruesomeCombined
images_new_runtime_doc_features: 28
The result of the aggregated tin classifier is used on average to determine tin queries
|
ImageIsFace
images_new_runtime_doc_features: 36
|
VwPorno2
images_new_runtime_doc_features: 59
The result of the text classifier of porn viapal wabbit
|
ImagePorno4
images_new_runtime_doc_features: 85
Image porn classifier output
|
MinLinkAdultnessBetaLevel
images_new_runtime_doc_features: 170
Level of minimal AdultnessBeta values among selected links (0.1 = MinAdultnessBetaAboveFamilyThreshold, 0.3 - MinAdultnessBetaAboveGrayThreshold, 0.5 - MinAdultnessBetaAboveNormalThreshold
|
VwChildPorn
images_production: 1
The value of the DP classifier is used to filter on average
|
LongQuery
images_production: 8
The amount of IDF words of the request. The name does not reflect the essence: for example, for the request of 'Gadyach' this factor will be more than for the request of 'Moscow Peter Yekaterinburg Samara'.
|
ChildPornProbability
images_production: 10
The value of the DP classifier is used to filter on average
|
GruesomeCombined
images_production: 58
The result of the aggregated tin classifier is used on average to determine tin queries
|
ImageIsFace
images_production: 72
|
NightQuery
images_production: 92
The request is set mainly at a certain time of the day. Examples of such requests are GDZ, SMS good night, porn requests.
|
MorningQuery
images_production: 93
|
DayQuery
images_production: 94
|
EveningQuery
images_production: 95
|
QueryCommercialityMx
images_production: 110
The measure of 'commercial' request. It is a comprehensively calculated Matrixnet factor formula for the procurement vocabulary in direct + for user queries + add. Intensive dictionaries. Requests with intensity to buy a factor seeks to -> 1 commodity requests -> 0.6 with intensity cannot buy, reviews, etc. -> 0 ((http://wiki.yandex-team.ru/Faktorydljanovogokatorazaprosov Factors of the Classifier))) (HTTP : //wiki.yandex-team.ru/jandekspoisk/antispam/antiseo/klassifikATORCHESSKIXZAPROSOV STUNITURE OF HIM))
|
ImageIBocm
images_production: 153
|
MatrixNet
images_production: 160
Matrixnet is applied to all factors - formula
|
NumWordsTRSy
images_production: 164
The percentage of the words of the request in the document (with an accuracy to a synonym)
|
VwPorno2
images_production: 176
The result of the text classifier of porn viapal wabbit
|
AnnYaBclmPlain
images_production: 203
Plane bclm based on queries from yandex sessions
|
QueryUrlPCTR2
images_production: 207
the CTR of the url given query
|
ImagePorno4
images_production: 338
Image porn classifier output
|
MinLinkAdultnessBetaLevel
images_production: 694
Level of minimal AdultnessBeta values among selected links (0.1 = MinAdultnessBetaAboveFamilyThreshold, 0.3 - MinAdultnessBetaAboveGrayThreshold, 0.5 - MinAdultnessBetaAboveNormalThreshold
|
VwChildPorn
images_recommendations: 1
The value of the DP classifier is used to filter on average
|
ChildPornProbability
images_recommendations: 4
The value of the DP classifier is used to filter on average
|
GruesomeCombined
images_recommendations: 29
The result of the aggregated tin classifier is used on average to determine tin queries
|
ImageIsFace
images_recommendations: 37
|
VwPorno2
images_recommendations: 70
The result of the text classifier of porn viapal wabbit
|
ImagePorno4
images_recommendations: 114
Image porn classifier output
|
MinLinkAdultnessBetaLevel
images_recommendations: 215
Level of minimal AdultnessBeta values among selected links (0.1 = MinAdultnessBetaAboveFamilyThreshold, 0.3 - MinAdultnessBetaAboveGrayThreshold, 0.5 - MinAdultnessBetaAboveNormalThreshold
|
IsPorno
neural_network_over_dssm_factors: 0
Document from porn kitski
|
IsFake
neural_network_over_dssm_factors: 2
Fast document
|
IsEShop
neural_network_over_dssm_factors: 3
Commercial page (Classifier Savina)
|
IsForum
neural_network_over_dssm_factors: 4
URL satisfies forum_detector regularly
|
IsPornoAdvert
neural_network_over_dssm_factors: 11
On the Porn Advertising page
|
Poetry
neural_network_over_dssm_factors: 12
The poetry of the document
|
PoetryQuad
neural_network_over_dssm_factors: 13
The maximum poetry of the quatrain
|
SynFLremap1
neural_network_over_dssm_factors: 15
Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
SynFLremap2
neural_network_over_dssm_factors: 16
Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
UrlSessNormDurRate
neural_network_over_dssm_factors: 17
nd/i
|
SynNumBadWordPairs
neural_network_over_dssm_factors: 19
The proportion of bad steam among all found in the table: Z/(X+1), where Z is the number of bad couples in the text, and X is (http://wiki.yandex-team.ru/evgenijgrechnikov/testsynonimizers of 2000-navigable )) steam
|
PercentWordsInLinks
neural_network_over_dssm_factors: 25
The percentage of the number of words inside the tag <a> .. </a> from the number of all words
|
LongestText
neural_network_over_dssm_factors: 37
The size of the largest text segment (from the factor [18] puretext)
|
NumSlashes
neural_network_over_dssm_factors: 40
The number of slashes in Url
|
WatchVideo
neural_network_over_dssm_factors: 41
The presence of a built -in video player on the page
|
News
neural_network_over_dssm_factors: 56
This is the news (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushichiekomponenty/klassificacionnye?v=tkd#h45859-3 Patterns in URL $))))).
|
PureText
neural_network_over_dssm_factors: 58
Long text without links.
|
Root
neural_network_over_dssm_factors: 59
This is a muzzle.
|
AddTime
neural_network_over_dssm_factors: 67
The time of adding a page, more - a more old document; The root is placed from time displayed at the interval [0.1] so that 3+ years gives 1.
|
IsMainPage
neural_network_over_dssm_factors: 68
If the main page of the owner (most often a second -level domain, for example xxxx.ru), then the factor is 1. For bums, hosting, personal blogs, etc. (for example, Lifejornal, People.ru, etc.) - domains of the third level (such as xxxxx.narod.ru) will also have an equal factor 1.
|
Hops
neural_network_over_dssm_factors: 69
The number of hops of Url inpans (such as less - closer to the muzzle, the lower the value (0 - the muzzle, 1 - from the muzzle cannot be reached, 0 <can get from the muzzle <1). Normal value for the root of the nosta 0.0039).
|
IsBlog
neural_network_over_dssm_factors: 71
Page from the blogochosting
|
IsLivejournal
neural_network_over_dssm_factors: 72
Page with Livejournal.com
|
AuraDocLogAuthor
neural_network_over_dssm_factors: 78
Logarithm of the number of shingles on which this owner of the document is recognized as the author
|
AuraDocLogOrigin
neural_network_over_dssm_factors: 79
Logarithm of the number of shingles in the document added by the owner of the site as original texts in ((http://wiki.yandex-team.ru/jandekspoisk/jekosistema/marketingPr/webmasters/plan/vtorcontect of originality plugin)). It does not participate in the formula, it is needed to disconnect the takes
|
TextFeatures
neural_network_over_dssm_factors: 119
The quality of the text. It is considered a rather complex formula
|
TextLike
neural_network_over_dssm_factors: 120
Text quality (classifier Alekseev)
|
DocLen
neural_network_over_dssm_factors: 121
Document length in sentences
|
UrlLen
neural_network_over_dssm_factors: 122
The length of the URL, divided by 5
|
YabarUrlAvgTime
neural_network_over_dssm_factors: 126
The average for users time is the user on the page. It is read as the difference between neighboring transitions.
|
UrlQueryVariety
neural_network_over_dssm_factors: 127
The degree of variety of requests for which this Urla click
|
YabarUrlLcAc
neural_network_over_dssm_factors: 129
The number of sessions in which Url was the last, classified as the sessions in which Url appeared
|
MetrikaUrlAvgTime
neural_network_over_dssm_factors: 131
Similar to Yabarurlavgtime
|
EngLang
neural_network_over_dssm_factors: 136
Document language - English
|
CyrLang
neural_network_over_dssm_factors: 137
The language of the document is Cyrillic
|
IsWiki
neural_network_over_dssm_factors: 142
page from ru.wikipedia.org
|
OwnerNavQuota
neural_network_over_dssm_factors: 155
The share of clicks for navigation requests
|
OwnerSessNormDuration
neural_network_over_dssm_factors: 157
ND/K normalized time to click
|
AddTimeMP
neural_network_over_dssm_factors: 181
The time for adding the main page of the owner (host?) Will be remaped like Addtime.
|
YaBarCoreHost
neural_network_over_dssm_factors: 302
The core of the audience of the hosts according to Yandex.Mrazusing
|
YabarHostInternalTraffic
neural_network_over_dssm_factors: 303
The share of suits to the site is not by links (set with hands or from bookmarks)
|
YabarHostAvgTime
neural_network_over_dssm_factors: 304
average for users Active continuous time for user finding (in sec) on host pages
|
YabarHostAvgTime2
neural_network_over_dssm_factors: 305
The average for users Active continuous time of the user (in second) on the pages of the host. In the inside of the Yandex. Bara/elements/browser counter
|
MaxD30Long
personalization: 0
Max cosine similarity between document and user history with clicks dwelltime > 30sec, by realtime user_history
|
MaxD60Long
personalization: 1
Max cosine similarity between document and user history with clicks dwelltime > 60sec, by realtime user_history
|
MaxD120Long
personalization: 2
Max cosine similarity between document and user history with clicks dwelltime > 120sec, by realtime user_history
|
MaxD180Long
personalization: 3
Max cosine similarity between document and user history with clicks dwelltime > 180sec, by realtime user_history
|
MaxD360Long
personalization: 4
Max cosine similarity between document and user history with clicks dwelltime > 360sec, by realtime user_history
|
MaxD30Short
personalization: 5
Max cosine similarity between document and user history with clicks dwelltime <= 30sec, by realtime user_history
|
MaxD60Short
personalization: 6
Max cosine similarity between document and user history with clicks dwelltime <= 60sec, by realtime user_history
|
MaxD120Short
personalization: 7
Max cosine similarity between document and user history with clicks dwelltime <= 120sec, by realtime user_history
|
MaxD180Short
personalization: 8
Max cosine similarity between document and user history with clicks dwelltime <= 180sec, by realtime user_history
|
MaxD360Short
personalization: 9
Max cosine similarity between document and user history with clicks dwelltime <= 360sec, by realtime user_history
|
TopavgS5D30Long
personalization: 10
Avg by top-5 maximum cosine similarity between document and user history with clicks dwelltime > 30sec, by realtime user_history
|
TopavgS5D60Long
personalization: 11
Avg by top-5 maximum cosine similarity between document and user history with clicks dwelltime > 60sec, by realtime user_history
|
TopavgS5D120Long
personalization: 12
Avg by top-5 maximum cosine similarity between document and user history with clicks dwelltime > 120sec, by realtime user_history
|
TopavgS5D180Long
personalization: 13
Avg by top-5 maximum cosine similarity between document and user history with clicks dwelltime > 180sec, by realtime user_history
|
TopavgS5D360Long
personalization: 14
Avg by top-5 maximum cosine similarity between document and user history with clicks dwelltime > 360sec, by realtime user_history
|
TopavgS10D30Long
personalization: 15
Avg by top-10 maximum cosine similarity between document and user history with clicks dwelltime > 30sec, by realtime user_history
|
TopavgS10D60Long
personalization: 16
Avg by top-10 maximum cosine similarity between document and user history with clicks dwelltime > 60sec, by realtime user_history
|
TopavgS10D120Long
personalization: 17
Avg by top-10 maximum cosine similarity between document and user history with clicks dwelltime > 120sec, by realtime user_history
|
TopavgS10D180Long
personalization: 18
Avg by top-10 maximum cosine similarity between document and user history with clicks dwelltime > 180sec, by realtime user_history
|
TopavgS10D360Long
personalization: 19
Avg by top-10 maximum cosine similarity between document and user history with clicks dwelltime > 360sec, by realtime user_history
|
TopavgS15D30Long
personalization: 20
Avg by top-15 maximum cosine similarity between document and user history with clicks dwelltime > 30sec, by realtime user_history
|
TopavgS15D60Long
personalization: 21
Avg by top-15 maximum cosine similarity between document and user history with clicks dwelltime > 60sec, by realtime user_history
|
TopavgS15D120Long
personalization: 22
Avg by top-15 maximum cosine similarity between document and user history with clicks dwelltime > 120sec, by realtime user_history
|
TopavgS15D180Long
personalization: 23
Avg by top-15 maximum cosine similarity between document and user history with clicks dwelltime > 180sec, by realtime user_history
|
TopavgS15D360Long
personalization: 24
Avg by top-15 maximum cosine similarity between document and user history with clicks dwelltime > 360sec, by realtime user_history
|
UserClicksLong
personalization: 32
Saturated TotalClicks by long user_profiles
|
UserClicksRt
personalization: 33
Saturated TotalClicks by realtime user_profiles
|
ClicksForFirstQRealtime
personalization: 34
Saturated clicks on first query in history, by realtime user_actions
|
ClicksForPrevQRealtime
personalization: 35
Saturated clicks on previous query in history, by realtime user_actions
|
SessionClicksCountRealtime
personalization: 42
Saturated number of clicks in session by realtime user_actions
|
SessionRequestsCountRealtime
personalization: 43
Saturated number of requests in session by realtime user_actions
|
UserIntermediateClicksLong
personalization: 44
Saturated number of long clicks which are not last in session by long user_profile
|
UserIntermediateClicksRt
personalization: 45
Saturated number of long clicks which are not last in session by realtime user_profile
|
UserFinalClicksLong
personalization: 46
Saturated number of last clicks in session by long user_profile
|
InverseUserQueryTopicCountRt
personalization: 54
Inverted number of queries with the same topic by realtime user_profiles
|
InverseUserQueryClickedTopicCountRt
personalization: 56
Inverted number of clicked queries with the same topic by realtime user_profiles
|
HostClicksByUserLong
personalization: 59
Saturated number of user clicks on host by long user_profiles
|
HostClicksByUserRt
personalization: 61
Saturated number of user clicks on host by realtime user_profiles
|
HostClicksWithTimeDiscountLong
personalization: 64
Sum of time discounted user clicks on host by long user_profiles
|
HostClicksWithTimeDiscountRt
personalization: 66
Sum of time discounted user clicks on host by realtime user_profiles
|
HostClickedForFirstQRealtime
personalization: 67
Host was clicked on first query in session by realtime user_actions
|
HostShownForFirstQRealtime
personalization: 68
Host was shown on first query in session by realtime user_actions
|
InvertedHostPosForFirstQRealtime
personalization: 69
Inverted position of host on first query in session by realtime user_actions
|
HostClickedForPrevQRealtime
personalization: 70
Host was clicked on previous query in session by realtime user_actions
|
HostShownForPrevQRealtime
personalization: 71
Host was shown on first query in session by realtime user_actions
|
InvertedHostPosForPrevQRealtime
personalization: 72
Inverted position of host on previous query in session by realtime user_actions
|
QIsSameAsPrevQRealtime
personalization: 74
Normalized query equals to previous one in session by realtime user_actions
|
UserHostCctrRt
personalization: 118
CCTR(clicks[host], shows[host]), rt user_profiles
|
UserHostCtrWithTimeDiscountRt
personalization: 121
CTR(clicks[host], shows[host]) with time discount (1 / (1 + dt) for every click/skip where dt = days between now and click/skip), rt user_profiles
|
HostIntermediateClicksLong
personalization: 123
Sat(clicks[host] without short clicks and last-in-session clicks), long user_profiles
|
HostIntermediateClicksRt
personalization: 126
Sat(clicks[host] without short clicks and last-in-session clicks), rt user_profiles
|
UserQhashHostCctrLong
personalization: 130
CCTR(clicks[query-host], shows[query-host]) by long user_profiles
|
UserQhashHostCctrRt
personalization: 132
CCTR(clicks[query-host], shows[query-host]) by rt user_profiles
|
UserQhashHostClicksRt
personalization: 135
Sat(clicks[query-host]) by rt user_profiles
|
HostFinalClicksLong
personalization: 139
Saturated final host clicks by long user_profiles
|
HostFinalClicksRt
personalization: 142
Saturated final host clicks by rt user_profiles
|
HostDwelltimeAtPrevQFixedRealtime
personalization: 149
hostDwelltime / (hostDwelltime + 60), realtime user_actions
|
HostIntermediateClicksWithTimeDiscountLong
personalization: 151
Sat(clicks[host] without short clicks and last-in-session clicks) with time discount (1 / (1 + dt) where dt = days between now and click), long user_profiles
|
HostIntermediateClicksWithTimeDiscountRt
personalization: 154
Sat(clicks[host] without short clicks and last-in-session clicks) with time discount (1 / (1 + dt) where dt = days between now and click), rt user_profiles
|
HostFinalClicksWithTimeDiscountLong
personalization: 156
Saturated host final clicks with time discount (1 / (1 + dt) where dt = days between now and click) by long user_profiles
|
HostFinalClicksWithTimeDiscountRt
personalization: 159
Saturated host final clicks with time discount (1 / (1 + dt) where dt = days between now and click) by rt user_profiles
|
HostLongClicksLong
personalization: 161
Sat(long clicks[host]), long user_profiles
|
HostLongClicksRt
personalization: 164
Sat(long clicks[host]), rt user_profiles
|
HostLongClicksWithTimeDiscountLong
personalization: 166
Sat(long clicks[host]) with time discount (1 / (1 + dt) where dt = days between now and click), long user_profiles
|
HostLongClicksWithTimeDiscountRt
personalization: 169
Sat(long clicks[host]) with time discount (1 / (1 + dt) where dt = days between now and click), rt user_profiles
|
UserHostCctr2Rt
personalization: 172
CCTR2(clicks[host], shows[host]), rt user_profiles
|
UserQhashHostCctr2Long
personalization: 174
CCTR2(clicks[query-host], shows[query-host]) by long user_profiles
|
UserQhashHostCctr2Rt
personalization: 176
CCTR2(clicks[query-host], shows[query-host]) by rt user_profiles
|
UserDocposCctr2Long
personalization: 178
CCTR2(clicks[pos], skips[pos]) by long user_profiles
|
UserDocposCctr2Rt
personalization: 179
CCTR2(clicks[pos], skips[pos]) by realtime user_profiles
|
UserHostTopicShowsFractionRt
personalization: 182
HostTopicShows to HostShows fraction by realtime user_profiles
|
UserHostTopicClicksFractionRt
personalization: 184
HostTopicClicks to HostClicks fraction by realtime user_profiles
|
UserHostTopicCtrRt
personalization: 186
HostTopicClicks to HostTopicShows fraction by realtime user_profiles
|
UserQhashHostFinalClicksLong
personalization: 188
Saturated final clicks for query-host pair by long user_profiles
|
UserQhashHostFinalClicksRt
personalization: 189
Saturated final clicks for query-host pair by realtime user_profiles
|
UserQhashHostCtrLong
personalization: 191
CTR(clicks[query-host], shows[query-host]) by long user_profiles
|
UserQhashHostCtrRt
personalization: 192
CTR(clicks[query-host], shows[query-host]) by realtime user_profiles
|
UserUrlCtrLong
personalization: 194
CTR(clicks[url], shows[url]) by long user_profiles
|
UserUrlCtrRt
personalization: 196
CTR(clicks[url], shows[url]) by realtime user_profiles
|
UserUrlCctrRt
personalization: 198
CCTR(clicks[url], shows[url]) by realtime user_profiles
|
UserUrlCctr2Long
personalization: 200
CCTR2(clicks[url], shows[url]) by long user_profiles
|
UserUrlCctr2Rt
personalization: 202
CCTR2(clicks[url], shows[url]) by realtime user_profiles
|
UserUrlFinalClicksLong
personalization: 204
Saturated final clicks for url by long user_profiles
|
UserUrlFinalClicksRt
personalization: 206
Saturated final clicks for url by realtime user_profiles
|
UserUrlClicksLong
personalization: 208
Saturated clicks for url by long user_profiles
|
UserUrlClicksRt
personalization: 210
Saturated clicks for url by realtime user_profiles
|
UserQurlCtrLong
personalization: 216
CTR(clicks[query-url], shows[query-url]) by long user_profiles
|
UserQurlCtrRt
personalization: 217
CTR(clicks[query-url], shows[query-url]) by realtime user_profiles
|
UserQurlCctrRt
personalization: 218
CCTR(clicks[query-url], shows[query-url]) by realtime user_profiles
|
UserQurlCctr2Long
personalization: 220
CCTR2(clicks[query-url], shows[query-url]) by long user_profiles
|
UserQurlCctr2Rt
personalization: 221
CCTR2(clicks[query-url], shows[query-url]) by realtime user_profiles
|
UserQurlFinalClicksLong
personalization: 223
Saturated final clicks for query-url by long user_profiles
|
UserQurlFinalClicksRt
personalization: 224
Saturated final clicks for query-url by realtime user_profiles
|
UserQurlClicksLong
personalization: 226
Saturated clicks for query-url by long user_profiles
|
UserQurlClicksRt
personalization: 227
Saturated clicks for query-url by realtime user_profiles
|
HostFullSatisfactionClicksLong
personalization: 232
Saturated number of single clicks per request for host by long user_profiles
|
HostFullSatisfactionClicksRt
personalization: 234
Saturated number of single clicks per request for host by realtime user_profiles
|
HostFullSatisfactionClicksWithTimeDiscountLong
personalization: 236
Sum of 1 / (1 + dt), where dt = days between now and single click per request for host by long user_profiles
|
HostFullSatisfactionClicksWithTimeDiscountRt
personalization: 238
Sum of 1 / (1 + dt), where dt = days between now and single click per request for host by realtime user_profiles
|
TimeBetweenPrevAndCurQRealtime
personalization: 257
Normalized time between previous and current requests by realtime user_actions
|
SessionLengthRealtime
personalization: 258
Normalized time between first and current requests by realtime user_actions
|
WebTRp1
video_production: 2
Stript priority for TR is a text priority - there are all the words of the request somewhere in the document (while they pass contextual restrictions on the request, for example, both words DB in one sentence).
|
WebPassageLegacyTR
video_production: 8
Text relevance (maxfreq is the frequency of the most frequent word that makes sense of the length of the document).
|
PornoQuery
video_production: 16
Porn request.
|
Duration
video_production: 47
The duration of the video. It is also used in the ranking of ether.
|
IsFilmQuery
video_production: 51
The probable duration of the request is not less than 72 minutes, the request is not a serial and the intent Waresfilm corrects Setupids.
|
BestLRSyn
video_production: 69
Link rank the best link. Takes into account thesaurus extensions.
|
OntoIdMatched
video_production: 81
The coincidence of the ONTOID document and the object specified in the request, and a sufficiently duration of the roller for this object.
|
OntoIdMatchedWide
video_production: 82
The permissible coefficient of the Bust of the document for the coincidence of the Ontoid document and the request of the type HUM, Food, Auto, Music, etc.
|
IsTrusted
video_production: 87
The video is located on a trusted hosting, authorized content (for example, legal online cinema).
|
DssmL2WebReformulationsDt
video_production: 99
Logdwelltime by the VEB model DSSM, trained in reformulations. It is also used in the ranking of ether.
|
IsBoostable
video_production: 104
Doc is quite good to be boosted
|
IsGoodSvod
video_production: 106
Is the SVOD document suitable by the date of the license and the request (the same film, series series episode).
|
IsGoodAvod
video_production: 107
Is the AVOD document suitable by the date of the license and to the request (that film, etc.).
|
IsSvod
video_production: 110
Svod document, the date of the license is suitable ..
|
IsAvod
video_production: 114
AVOD document, the date of the license is suitable.
|
SerialNameMatched
video_production: 129
Coincidence of the name of the serial object in the request and document.
|
SerialSeason
video_production: 130
Season of a serial object.
|
SerialEpisode
video_production: 131
Serial object series.
|
SerialEpisodeMatched
video_production: 132
The coincidence of a series of a serial object in a request and a document.
|
SerialSeasonMatched
video_production: 133
The coincidence of the season of the serial object in the request and document.
|
WaresFilm
video_production: 139
Request about the film, the series, etc.
|
Bclm2
video_production: 155
The factor about the proximity of the request and text of the document. It differs from BCLM in that the weights of all words are considered the same. It is also used in the ranking of ether.
|
FirstHitSentenceBocmFull
video_production: 179
BOCM for gluing Links, calculated only on the first sentences with hits and all forms of entering are considered equivalent.
|
IsWideFilmRequest
video_production: 195
Is the request a wide film? For example, the 'Best Comedies of 2013' and 'Watch Online Films about Love' are wide film queries.
|
PlayerDepthHostMeanReg
video_production: 210
The average depth of player views of the video for the host. Aggregaled through all the urlahs of the host, according to which there are player logs.
|
TitleTrigramsInQuery
video_production: 219
Coating trigrams of Title trigrams. It is also used in the ranking of ether.
|
DssmQueryEmbeddingWebCtrNoMinerPca0
video_production: 242
The main components of the requesting Embling from the DSSMCTRNOMINER model. It is also used in the ranking of ether.
|
DssmQueryEmbeddingWebCtrNoMinerPca1
video_production: 243
The main components of the requesting Embling from the DSSMCTRNOMINER model. It is also used in the ranking of ether.
|
DssmQueryEmbeddingWebCtrNoMinerPca2
video_production: 244
The main components of the requesting Embling from the DSSMCTRNOMINER model. It is also used in the ranking of ether.
|
DssmQueryEmbeddingWebCtrNoMinerPca3
video_production: 245
The main components of the requesting Embling from the DSSMCTRNOMINER model. It is also used in the ranking of ether.
|
DssmQueryEmbeddingWebCtrNoMinerPca4
video_production: 246
The main components of the requesting Embling from the DSSMCTRNOMINER model. It is also used in the ranking of ether.
|
DssmQueryEmbeddingWebCtrNoMinerPca5
video_production: 247
The main components of the requesting Embling from the DSSMCTRNOMINER model. It is also used in the ranking of ether.
|
AdditionRank
video_production: 256
Cloud additive
|
IsPornoDoc
video_production: 259
A document that we consider porn with a filed filter
|
TrailerDuration
video_production: 362
The video has a trailer duration
|
MarkUrlShowsNormal
video_production: 370
A linearly normalized number of Ural shows on request with marker normalization.
|
MetaDuration
video_production: 376
The average value of the Duration factor is average.
|
PornoShowVideoQuery
video_production: 403
Porn request from manual list
|
IsYandexHosting
video_production: 519
The flag is Lee Host Yandex.Chosting.
|
DssmL3WebLogDwellTime
video_production: 559
Logdwelltime by the VEB model DSSM. It is also used in the ranking of ether.
|
DssmL3VideoDeepClickPlayerDepth
video_production: 596
DSSM with PlayerDepth Target on the deep click pool video. It is also used in the ranking of ether.
|
BadSerialDoc
video_production: 597
Not a suitable series (another series, season or series).
|
DssmBoostingXfOneSeAmSsHardQueryMutationAddOnlineWordRenormedDistance
video_production: 697
It characterizes the request for the degree of change from the addition of a fixed word ('online' for Kirilitsa), the DSSM model DSSMBOOSTINGXFONESEAMSHARD is used. It is also used in the ranking of ether.
|
FreshRelevance
video_production: 712
Predict formula for the relevance of a fresh document.
|
SrcIsPornoDoc
video_related: 9
A document that we consider porn with a filed filter
|
IsSecondaryTag
video_hub: 111
It costs 1 when the request for several tags and the document has already got into the tag
|
MedianWaitTime
web_discovery: 0
Median breaks between messages for some most active interval
|
LeftIsPorno
web_itditp: 0
Document from porn kitski
|
IsPorno
web_itditp: 1
Document from porn kitski
|
LeftIsFake
web_itditp: 6
Fast document
|
IsFake
web_itditp: 7
Fast document
|
LeftIsEShop
web_itditp: 10
Commercial page (Classifier Savina)
|
IsEShop
web_itditp: 11
Commercial page (Classifier Savina)
|
LeftIsForum
web_itditp: 12
URL satisfies forum_detector regularly
|
IsForum
web_itditp: 13
URL satisfies forum_detector regularly
|
LeftIsPornoAdvert
web_itditp: 26
On the Porn Advertising page
|
IsPornoAdvert
web_itditp: 27
On the Porn Advertising page
|
LeftPoetry
web_itditp: 28
The poetry of the document
|
Poetry
web_itditp: 29
The poetry of the document
|
LeftPoetryQuad
web_itditp: 30
The maximum poetry of the quatrain
|
PoetryQuad
web_itditp: 31
The maximum poetry of the quatrain
|
LeftSynFLremap1
web_itditp: 34
Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
SynFLremap1
web_itditp: 35
Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
LeftSynFLremap2
web_itditp: 36
Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
SynFLremap2
web_itditp: 37
Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
LeftUrlSessNormDurRate
web_itditp: 38
nd/i
|
UrlSessNormDurRate
web_itditp: 39
nd/i
|
LeftSynNumBadWordPairs
web_itditp: 42
The proportion of bad steam among all found in the table: Z/(x+1), where Z 342 200 223 The number of bad couples in the text, and X 342 200 223 number ((http: //wiki.yandex- Team.ru/evgenijjjgrechnikov/testSynonimizers 2000-navigable)) steam
|
SynNumBadWordPairs
web_itditp: 43
The proportion of bad steam among all found in the table: Z/(x+1), where Z 342 200 223 The number of bad couples in the text, and X 342 200 223 number ((http: //wiki.yandex- Team.ru/evgenijjjgrechnikov/testSynonimizers 2000-navigable)) steam
|
LeftPercentWordsInLinks
web_itditp: 54
The percentage of the number of words inside the tag <a> .. </a> from the number of all words
|
PercentWordsInLinks
web_itditp: 55
The percentage of the number of words inside the tag <a> .. </a> from the number of all words
|
LeftLongestText
web_itditp: 80
The size of the largest text segment (from the factor [18] puretext)
|
LongestText
web_itditp: 81
The size of the largest text segment (from the factor [18] puretext)
|
LeftNumSlashes
web_itditp: 86
The number of slashes in Url
|
NumSlashes
web_itditp: 87
The number of slashes in Url
|
LeftWatchVideo
web_itditp: 88
The presence of a built -in video player on the page
|
WatchVideo
web_itditp: 89
The presence of a built -in video player on the page
|
LeftYandexAdv
web_itditp: 122
On the site there is an advertisement for Yandex.
|
YandexAdv
web_itditp: 123
On the site there is an advertisement for Yandex.
|
LeftNoSpam
web_itditp: 124
The Classifier of Spam for Picks from Antispam recognized the site not (!) Spam. Those. 0 = spam, 1 = good.
|
NoSpam
web_itditp: 125
The Classifier of Spam for Picks from Antispam recognized the site not (!) Spam. Those. 0 = spam, 1 = good.
|
LeftIsWiki
web_itditp: 128
page from ru.wikipedia.org
|
IsWiki
web_itditp: 129
page from ru.wikipedia.org
|
LeftOwnerNavQuota
web_itditp: 158
The share of clicks for navigation requests
|
OwnerNavQuota
web_itditp: 159
The share of clicks for navigation requests
|
LeftOwnerSessNormDuration
web_itditp: 164
ND/K normalized time to click
|
OwnerSessNormDuration
web_itditp: 165
ND/K normalized time to click
|
LeftRegIsWiki
web_itditp: 250
A document from the language section of Wikipedia corresponding to the user region
|
RegIsWiki
web_itditp: 251
A document from the language section of Wikipedia corresponding to the user region
|
LeftNews
web_itditp: 278
This is the news (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushichiekomponenty/klassificacionnye?v=tkd#h45859-3 Patterns in URL $))))).
|
LeftLong
web_itditp: 281
Long document (the longer the document, the greater the value of the factor).
|
LeftPureText
web_itditp: 282
Long text without links.
|
LeftRoot
web_itditp: 283
This is a muzzle.
|
LeftAddTime
web_itditp: 285
The time of adding a page, more - a more old document; The root is placed from time displayed at the interval [0.1] so that 3+ years gives 1.
|
LeftIsMainPage
web_itditp: 286
If the main page of the owner (most often a second -level domain, for example xxxx.ru), then the factor is 1. For bums, hosting, personal blogs, etc. (for example, Lifejornal, People.ru, etc.) - domains of the third level (such as xxxxx.narod.ru) will also have an equal factor 1.
|
LeftHops
web_itditp: 287
The number of hops of Url inpans (such as less - closer to the muzzle, the lower the value (0 - the muzzle, 1 - from the muzzle cannot be reached, 0 <can get from the muzzle <1). Normal value for the root of the nosta 0.0039).
|
LeftIsBlog
web_itditp: 289
Page from the blogochosting
|
LeftIsLivejournal
web_itditp: 290
Page with Livejournal.com
|
LeftTextFeatures
web_itditp: 291
The quality of the text. It is considered a rather complex formula
|
LeftTextLike
web_itditp: 292
Text quality (classifier Alekseev)
|
LeftDocLen
web_itditp: 293
Document length in sentences
|
LeftUrlLen
web_itditp: 294
The length of the URL 'A, divided by 5
|
LeftYabarUrlAvgTime
web_itditp: 299
The average for users time is the user on the page. It is read as the difference between neighboring transitions.
|
LeftUrlQueryVariety
web_itditp: 300
The degree of variety of requests for which this Urla click
|
LeftEngLang
web_itditp: 305
Document language - English
|
LeftCyrLang
web_itditp: 306
The language of the document is Cyrillic
|
LeftAuraDocLogAuthor
web_itditp: 309
Logarithm of the number of shingles on which this owner of the document is recognized as the author
|
LeftAuraDocLogOrigin
web_itditp: 312
Logarithm of the number of shingles in the document added by the owner of the site as original texts in ((http://wiki.yandex-team.ru/jandekspoisk/jekosistema/marketingPr/webmasters/plan/vtorcontect of originality plugin)). It does not participate in the formula, it is needed to disconnect the takes
|
LeftYabarUrlLcAc
web_itditp: 322
The number of sessions in which Url was the last, classified as the sessions in which Url appeared
|
LeftMetrikaUrlAvgTime
web_itditp: 324
Similar to Yabarurlavgtime
|
News
web_itditp: 373
This is the news (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushichiekomponenty/klassificacionnye?v=tkd#h45859-3 Patterns in URL $))))).
|
Long
web_itditp: 376
Long document (the longer the document, the greater the value of the factor).
|
PureText
web_itditp: 377
Long text without links.
|
Root
web_itditp: 378
This is a muzzle.
|
AddTime
web_itditp: 380
The time of adding a page, more - a more old document; The root is placed from time displayed at the interval [0.1] so that 3+ years gives 1.
|
IsMainPage
web_itditp: 381
If the main page of the owner (most often a second -level domain, for example xxxx.ru), then the factor is 1. For bums, hosting, personal blogs, etc. (for example, Lifejornal, People.ru, etc.) - domains of the third level (such as xxxxx.narod.ru) will also have an equal factor 1.
|
Hops
web_itditp: 382
The number of hops of Url inpans (such as less - closer to the muzzle, the lower the value (0 - the muzzle, 1 - from the muzzle cannot be reached, 0 <can get from the muzzle <1). Normal value for the root of the nosta 0.0039).
|
IsBlog
web_itditp: 384
Page from the blogochosting
|
IsLivejournal
web_itditp: 385
Page with Livejournal.com
|
TextFeatures
web_itditp: 386
The quality of the text. It is considered a rather complex formula
|
TextLike
web_itditp: 387
Text quality (classifier Alekseev)
|
DocLen
web_itditp: 388
Document length in sentences
|
UrlLen
web_itditp: 389
The length of the URL 'A, divided by 5
|
YabarUrlAvgTime
web_itditp: 394
The average for users time is the user on the page. It is read as the difference between neighboring transitions.
|
UrlQueryVariety
web_itditp: 395
The degree of variety of requests for which this Urla click
|
EngLang
web_itditp: 400
Document language - English
|
CyrLang
web_itditp: 401
The language of the document is Cyrillic
|
AuraDocLogAuthor
web_itditp: 404
Logarithm of the number of shingles on which this owner of the document is recognized as the author
|
AuraDocLogOrigin
web_itditp: 407
Logarithm of the number of shingles in the document added by the owner of the site as original texts in ((http://wiki.yandex-team.ru/jandekspoisk/jekosistema/marketingPr/webmasters/plan/vtorcontect of originality plugin)). It does not participate in the formula, it is needed to disconnect the takes
|
YabarUrlLcAc
web_itditp: 417
The number of sessions in which Url was the last, classified as the sessions in which Url appeared
|
MetrikaUrlAvgTime
web_itditp: 419
Similar to Yabarurlavgtime
|
LeftAddTimeMP
web_itditp: 473
The time for adding the main page of the owner (host?) Will be remaped like Addtime.
|
LeftYaBarCoreHost
web_itditp: 481
The core of the audience of the hosts according to Yandex.Mrazusing
|
LeftYabarHostInternalTraffic
web_itditp: 487
The share of suits to the site is not by links (set with hands or from bookmarks)
|
LeftYabarHostAvgTime
web_itditp: 488
average for users Active continuous time for user finding (in sec) on host pages
|
LeftYabarHostAvgTime2
web_itditp: 489
The average for users Active continuous time of the user (in second) on the pages of the host. In the inside of the Yandex. Bara/elements/browser counter
|
AddTimeMP
web_itditp: 542
The time for adding the main page of the owner (host?) Will be remaped like Addtime.
|
YaBarCoreHost
web_itditp: 550
The core of the audience of the hosts according to Yandex.Mrazusing
|
YabarHostInternalTraffic
web_itditp: 556
The share of suits to the site is not by links (set with hands or from bookmarks)
|
YabarHostAvgTime
web_itditp: 557
average for users Active continuous time for user finding (in sec) on host pages
|
YabarHostAvgTime2
web_itditp: 558
The average for users Active continuous time of the user (in second) on the pages of the host. In the inside of the Yandex. Bara/elements/browser counter
|
DaterAddTime80Hours
web_itditp: 766
It is considered as (80-x) where X is the return of the document in the clock (continuously). Uses the data of the Robotaddtime dates
|
DaterAddTime10Days
web_itditp: 767
It is considered as (10-x) where X is the return of the document in days (continuously). Uses the data of the Robotaddtime dates
|
DaterAddTime3Years
web_itditp: 768
It is considered as (3-x) where X is the return of the document in years (continuously). Uses the data of the Robotaddtime dates
|
WordCount
web_l1: 10
Min (number of words of request/10, 1.f)
|
InvWordCount
web_l1: 24
1 / quantity_lov_v_
|
LongQuery
web_l1: 25
The amount of IDF words of the request. The name does not reflect the essence: for example, for the request of 'Gadyach' this factor will be more than for the request of 'Moscow Peter Yekaterinburg Samara'.
|
PopularQ
web_l1: 27
The popularity of the request
|
QDiversity
web_l1: 28
The degree of centralization of the points from which the request is set
|
CountryPopularQ
web_l1: 30
The popularity of the request within the country
|
QueryCommercialityMx
web_l1: 31
The measure of 'commercial' request. It is a comprehensively calculated Matrixnet factor formula for the procurement vocabulary in direct + for user queries + add. Intensive dictionaries. Requests with intensity to buy a factor seeks to -> 1 commodity requests -> 0.6 with intensity cannot buy, reviews, etc. -> 0 ((http://wiki.yandex-team.ru/Faktorydljanovogokatorazaprosov Factors of the Classifier))) (HTTP : //wiki.yandex-team.ru/jandekspoisk/antispam/antiseo/klassifikATORCHESSKIXZAPROSOV STUNITURE OF HIM))
|
IsNavMxQuery
web_l1: 32
Rank 'navigation'
|
VideoQuery
web_l1: 33
Request about the video
|
IsOrg
web_l1: 38
The request is the name of the organization (example: Gazprom, Gazprom) ((http://wiki.yandex-team.ru/arsengadzhikurbanov/warees Description))
|
QClassDownload
web_l1: 43
= 1 - v. Download formula. Class requests: download/watch online/play/photo/listen
|
QClassKak
web_l1: 44
question
|
CountryQDiversity
web_l1: 53
The degree of centralization of the points from which the request is set (inside the country)
|
MetaMetaNonzeroAddTimeMP
web_meta_itditp: 103
Meta:Nonzero metafactor on web_itditp:AddTimeMP(542)
|
MetaSDDFT_SUM_WF_NORM_SUM_WHops
web_meta_itditp: 140
SD:DFT_SUM_WF_NORM_SUM_W metafactor on web_itditp:Hops(382)
|
MetaMetaNonzeroYabarUrlAvgTime
web_meta_itditp: 141
Meta: nonzero metafactor on web_itditp: yabarurlavgtime (394)
|
MetaAvgTRp1
web_meta: 0
Avg metafactor on TRp1(4)
|
MetaAvgNews
web_meta: 1
Avg metafactor on News(11)
|
MetaAvgAddTime
web_meta: 2
Avg metafactor on AddTime(41)
|
MetaMaxOwnerNavQuota
web_meta: 14
Max Metafactor On OwnerNavquota (506)
|
MetaRmsTRp1
web_meta: 22
Rms metafactor on TRp1(4)
|
MetaRmsAddTime
web_meta: 23
Rms metafactor on AddTime(41)
|
MetaRmsQueryDOwnerClicksPCTR
web_meta: 24
Rms metafactor on QueryDOwnerClicksPCTR(77)
|
BoostOwnerByCgi
web_meta: 26
set feature to 1 when url owner in &rearr/scheme_Local/SetBoostOwners/Owners=‘host1.ru,host2.ru’
|
MetaMaxQueryURLClicksPCTR
web_meta: 105
Max metafactor on QueryURLClicksPCTR(max 45)
|
MetaRmseUrlQueryVariety
web_meta: 187
Rmse metafactor on Production:UrlQueryVariety(261)
|
MetaNonzeroDaterAge
web_meta: 189
Nonzero metafactor on Production:DaterAge(380)
|
MetaAvgLongestText
web_meta: 191
Avg metafactor on Production:LongestText(410)
|
MetaFractQUrlStatPower
web_meta: 193
Fract metafactor on Production:QUrlStatPower(433)
|
MetaDFT_SUM_WF_NORM_SUM_WHasNoQueryURLShows
web_meta: 249
DFT_SUM_WF_NORM_SUM_W metafactor on Production:HasNoQueryURLShows(63)
|
MetaMetaResidDocLen
web_meta: 337
Meta:Resid metafactor on web_production:DocLen(110)
|
MetaMetaResidSynFLremap1
web_meta: 338
Meta:Resid metafactor on web_production:SynFLremap1(335)
|
IsReservedOwner
web_meta: 502
From reseved_owner namespace in SaaS-KV
|
IsReservedUrl
web_meta: 503
From reseved_url namespace in SaaS-KV
|
IsReservedOwnerFast
web_meta: 504
From reseved_owner namespace in SaaS-KV for fast reaction
|
IsReservedUrlFast
web_meta: 505
From reseved_url namespace in SaaS-KV for fast reaction
|
PopularQ
web_new_l1: 3
The popularity of the request
|
QDiversity
web_new_l1: 4
The degree of centralization of the points from which the request is set
|
Diversity2
web_new_l1: 5
Geographical distribution of the request
|
HourDiversity
web_new_l1: 6
The severity of the querial tasks at different times of the day
|
NightQuery
web_new_l1: 9
The request is set mainly at night
|
MorningQuery
web_new_l1: 10
The request is set mainly in the morning
|
DayQuery
web_new_l1: 11
The request is given mainly in the afternoon
|
EveningQuery
web_new_l1: 12
The request is set mainly in the evening
|
CountryPopularQ
web_new_l1: 13
The popularity of the request within the country
|
CountryQDiversity
web_new_l1: 14
The degree of centralization of the points from which the request is set (inside the country)
|
CountryQDiversity2
web_new_l1: 15
Geographical distribution of the request within the country
|
MusicQ
web_new_l1: 18
The musicality of the request. The results of the sorcerer Anton Konygin.
|
QueryNonCommerciality
web_new_l1: 19
The commercial request for the dictionary of phrases from Direct: 0 - maximum commercial, 1 - minimal.
|
QueryCommercialityMx
web_new_l1: 20
The measure of 'commercial' request. It is a comprehensively calculated Matrixnet factor formula for the procurement vocabulary in direct + for user queries + add. Intensive dictionaries. Requests with intensity to buy a factor seeks to -> 1 commodity requests -> 0.6 with intensity cannot buy, reviews, etc. -> 0 ((http://wiki.yandex-team.ru/Faktorydljanovogokatorazaprosov Factors of the Classifier))) (HTTP : //wiki.yandex-team.ru/jandekspoisk/antispam/antiseo/klassifikATORCHESSKIXZAPROSOV STUNITURE OF HIM))
|
IsNavMxQuery
web_new_l1: 21
Rank 'navigation'
|
NonCommercialQuery
web_new_l1: 22
Binar non -profit request: Querynoncommerciality> 0.965.
|
PornoQuery
web_new_l1: 23
Are there any words from Yweb/Pornofilter/Porno.query.
|
XPornoQuery
web_new_l1: 24
Classifier of Porn Causions, another dictionary than Pornoquery
|
IsNavQuery
web_new_l1: 26
Is the request for navigation, on the clicking of the answers
|
IsForeignQuery
web_new_l1: 27
Request is not in Russian
|
VideoQuery
web_new_l1: 28
Request about the video
|
GeoRegionalityUNew
web_new_l1: 32
Requestful factors - the result of the work ((http://wiki.yandex-team.ru/poiskovajaplatforma/lingvistika/zaprosnyjefefactory/localizovannyjezaprosya classifier of the request of the request)) - a new version of factors [328] - [328] - [328]: u - u - u - u - u - u - uceleless sites the request is meaningless;
|
GeoRegionalityRNew
web_new_l1: 33
Запросные факторы - результат работы ((http://wiki.yandex-team.ru/PoiskovajaPlatforma/Lingvistika/ZaprosnyjeFactory/LocalizovannyjeZaprosy классификатора геолокализованности запроса)) - новая версия факторов [328]-[330]: R - георелевантные - региональные результаты в issuing could be useful, but nothing more;
|
GeoRegionalityVNew
web_new_l1: 34
Requestful factors - the result of the work ((http://wiki.yandex-team.ru/poiskovajaplatforma/lingvistika/zaprosnyjefefactory/localizovannyjezaprosy classifier of the request of the request)) - a new version of factors [328]: Vegetable fundamental importance.
|
QrTur
web_new_l1: 38
The prediction of the share of “good” (at least two different cities and frequency> = 10) references to the request with geography in Turkey
|
NearbyQuery
web_new_l1: 39
When responding to a request, the results are important in close proximity ([pharmacies], [children's clinic])
|
CityQuery
web_new_l1: 40
When answering a request, the results within the city are important (the bulk of localized queries)
|
AdmQuery
web_new_l1: 41
When responding to a request, the results from the region, the region of the user ([airport], [dairy]) are important
|
IsOrg
web_new_l1: 44
The request is the name of the organization (example: Gazprom, Gazprom) ((http://wiki.yandex-team.ru/arsengadzhikurbanov/warees Description))
|
QClassDownload
web_new_l1: 51
= 1 - v. Download formula. Class requests: download/watch online/play/photo/listen
|
QClassBrandnames
web_new_l1: 52
The result of the classifier of the request - in the request there are words from the corresponding dictionary. brand
|
QClassDisease
web_new_l1: 53
Medication Dictionary
|
QClassKak
web_new_l1: 54
question
|
QClassMoscow
web_new_l1: 55
Specific request for Moscow
|
QClassOAO
web_new_l1: 56
organization
|
QClassPorno
web_new_l1: 57
porn
|
QClassTravel
web_new_l1: 58
trips
|
DiversityCategDownload
web_new_l1: 61
0 or 1 - whether the request is matured by the tickt
|
HasPornoQuery
web_new_l1: 79
The result of the work of Adult Rules for the Sorcerer.
|
NewsAgenQuality
unknown: 7
|