Tag: TG_LEFT_DOC
(308 ranking factors)
Factors |
---|
LeftIsPorno
web_itditp: 0
Document from porn kitski
|
LeftWikiInfobox
web_itditp: 2
On danny url is a link from inFobox-ov to Wikipedia.
|
LeftIsComm
web_itditp: 4
A document from a commercial clay. Not used (depreded)
|
LeftIsFake
web_itditp: 6
Fast document
|
LeftIsSEO
web_itditp: 8
The page title contains commercial vocabulary. Not used (depreded)
|
LeftIsEShop
web_itditp: 10
Commercial page (Classifier Savina)
|
LeftIsForum
web_itditp: 12
URL satisfies forum_detector regularly
|
LeftIsObsolete
web_itditp: 14
The URL has an ancient date. Ancient news are recognized. Factor 1 if there is a year in Url <= 2007.
|
LeftHasPayments
web_itditp: 16
On the page there is about 'Payment SMS '.
|
LeftClickedWithAnotherSEClicks
web_itditp: 18
Clicks on the urlahs shown in the issuance for requests, by which they went to look for other search engines
|
LeftShowsWithAnotherSEClicks
web_itditp: 20
Urlov shows in the issuance for requests, by which they went to look for other search engines
|
LeftEshopValue
web_itditp: 22
Stage of the page
|
LeftPornoValue
web_itditp: 24
Pornography of the page
|
LeftIsPornoAdvert
web_itditp: 26
On the Porn Advertising page
|
LeftPoetry
web_itditp: 28
The poetry of the document
|
LeftPoetryQuad
web_itditp: 30
The maximum poetry of the quatrain
|
LeftSynS1
web_itditp: 32
Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
LeftSynFLremap1
web_itditp: 34
Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
LeftSynFLremap2
web_itditp: 36
Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
LeftUrlSessNormDurRate
web_itditp: 38
nd/i
|
LeftSynPercentBadWordPairs
web_itditp: 40
An indicator of the unnaturalness of the text from the point of view of the Russian language. The number of bad pairs of words in the text, transferred to the segment [0.1] according to the Z/(Z+10) formula
|
LeftSynNumBadWordPairs
web_itditp: 42
The proportion of bad steam among all found in the table: Z/(x+1), where Z 342 200 223 The number of bad couples in the text, and X 342 200 223 number ((http: //wiki.yandex- Team.ru/evgenijjjgrechnikov/testSynonimizers 2000-navigable)) steam
|
LeftNumLatinLetters
web_itditp: 44
The number of Latin letters in the text (not counting the markings) driven into [0.1] formula n/(n+100)
|
LeftHasBigPicture
web_itditp: 46
The page has a big picture
|
LeftRusWordsInText
web_itditp: 48
The number of words in the text (the word is what the lemmeter selected) is displayed in [0.1] according to the formula x/(x+a)
|
LeftRusWordsInTitle
web_itditp: 50
The number of words of the Russian language in the title
|
LeftMeanWordLength
web_itditp: 52
The average length of the word
|
LeftPercentWordsInLinks
web_itditp: 54
The percentage of the number of words inside the tag <a> .. </a> from the number of all words
|
LeftPercentVisibleContent
web_itditp: 56
The percentage of the number of words outside the tags (outside the brackets <>) from the number of all words
|
LeftPercentFreqWords
web_itditp: 58
The percentage of the number of words, which are 200 the most frequent words of the language, from the number of all words of the text
|
LeftPercentUsedFreqWords
web_itditp: 60
The number used in the text 500 of the most popular words of the language, divided by 500
|
LeftTrigramsProb
web_itditp: 62
Logarithm of average geometric probabilities of trigrams in the text. (the probability of a trigram - the number of its meetings in the text, divided by the number of all trigrams) is displayed in [0.1] according to the formula -x (x+a)
|
LeftTrigramsCondProb
web_itditp: 64
Logarithm of the average geometric conditional probabilities of trigrams. The conditional probability of a trigram is its probability, divided by the probability of a bigram from the first two words
|
LeftNumeralsPortion
web_itditp: 66
The share of different parts of speech in the text. The share of numerals (among all words that managed to recognize part of the speech)
|
LeftParticlesPortion
web_itditp: 68
The share of particles
|
LeftAdjPronounsPortion
web_itditp: 70
The share of pronoun adjectives
|
LeftAdvPronounsPortion
web_itditp: 72
The proportion of pronoun nouns
|
LeftVerbsPortion
web_itditp: 74
The share of verbs
|
LeftFemAndMasNounsPortion
web_itditp: 76
The share of words that can be both masculine nouns and nouns of the feminine, but not of the middle kind, among all nouns (examples: 'Hummingbird ' - an example of an indefinite kind that can be determined in two ways, 'Alexander ' - homonymy).
|
LeftVideoRating
web_itditp: 78
The popularity of the video roller comes from the video
|
LeftLongestText
web_itditp: 80
The size of the largest text segment (from the factor [18] puretext)
|
LeftHasLiRuCounter
web_itditp: 82
The presence of a LiveInternet meter
|
LeftUrlTrigrams
web_itditp: 84
Model with the training of each trigram on '+' and '-' urlah. It does not depend on the request.
|
LeftNumSlashes
web_itditp: 86
The number of slashes in Url
|
LeftWatchVideo
web_itditp: 88
The presence of a built -in video player on the page
|
LeftDownloadVideo
web_itditp: 90
Video for downloading
|
LeftGskUrlModel
web_itditp: 92
The factor is calculated from the text of Url using the classifier of sequences Quality/Seq/GSK
|
LeftSegmentAuxAlphasInText
web_itditp: 94
Number of letters in the AUX segment
|
LeftSegmentAuxSpacesInText
web_itditp: 96
The number of spaces in the AUX segment
|
LeftSegmentContentCommasInText
web_itditp: 98
The number of commas in the Content segment
|
LeftIsShop
web_itditp: 100
Page 342 200 224 Shop. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/opisanijafaktorov#SSHOP Description)). Not used (depreded)
|
LeftUrlNGramsModel
web_itditp: 102
Urlngramsmodel ranking factor in ERF
|
LeftStaticTitleComm
web_itditp: 104
The degree of commerce page title. Not used (depreded)
|
LeftStaticTitleBM25Ex
web_itditp: 106
BM25 page title by its text
|
LeftTrashAdv
web_itditp: 108
The greasy of the page
|
LeftCommRus
web_itditp: 110
The weight of the document on a monosyllabic dictionary of commercial vocabulary
|
LeftURLClicksMaxGeoCityFRCWeight
web_itditp: 112
Normalized corrected clicks count by query with user's city(gc=) mentioned
|
LeftIsMobileBeauty
web_itditp: 114
The binary factor about the mobile adaptability of the document. It is taken from ERF
|
LeftEmbedVideoBroken
web_itditp: 116
A broken built -in video on the page.
|
LeftSumFlashArea
web_itditp: 118
the ratio of the total area of ​​all Flash blocks to the screen area
|
LeftAdv
web_itditp: 120
There is advertising on the site.
|
LeftYandexAdv
web_itditp: 122
On the site there is an advertisement for Yandex.
|
LeftNoSpam
web_itditp: 124
The Classifier of Spam for Picks from Antispam recognized the site not (!) Spam. Those. 0 = spam, 1 = good.
|
LeftIsWiki
web_itditp: 128
page from ru.wikipedia.org
|
LeftAdvAspam
web_itditp: 130
|
LeftIsLinkPessimised
web_itditp: 132
Antispamers pessimized the site - all dynamic linseed factors are reset. Zerolnk.flt
|
LeftCommLinksSEOHosts
web_itditp: 138
The share of incoming corrupt links. The algorithm for recognition of commercial links is implemented. The factor will be remarked to [0.1] if the share of such links is 50%, otherwise 0. ((http://wiki.yandex-team.ru/svetlanashorina/topseolinks selection of wound sites))))))
|
LeftCountersSearchTraffic1
web_itditp: 144
Search traffic - transitions from search engines to the site (2nd formula)
|
LeftCountersSearchTraffic2
web_itditp: 146
Search traffic - transitions from search engines to the site (2nd formula)
|
LeftYabarHostVisitors
web_itditp: 148
The number of unique visitors, remarks exponentially
|
LeftYabarHostSearchTraffic
web_itditp: 150
The share of traffic from search engines
|
LeftOwnerSDiffClickEntropy
web_itditp: 152
Entropy - distribution of clicks
|
LeftOwnerSDiffShowEntropy
web_itditp: 154
Entropy - distribution of shows
|
LeftOwnerSDiffCSRatioEntropy
web_itditp: 156
Entropy - Distribution of clique/shows.
|
LeftOwnerNavQuota
web_itditp: 158
The share of clicks for navigation requests
|
LeftOwnerSatisfied4Rate
web_itditp: 160
This is the SEA factor = s4_r/ (k_r+10) where S4_R is the number of clicks> 180 sec, k_r - the total number of clicks. It is considered taking into account reformalization.
|
LeftIsCom
web_itditp: 162
Domna in Zone .com
|
LeftOwnerSessNormDuration
web_itditp: 164
ND/K normalized time to click
|
LeftOwnerReqsPopularity
web_itditp: 170
The popularity of Owner 'And in the requests
|
LeftHostReliability
web_itditp: 176
The share of the Urlov that respond without errors
|
LeftOwnerIsCommercial
web_itditp: 178
|
LeftYabarHostSurfTrDpNdLeafLn
web_itditp: 182
The length of the Depth Nodes petal counted for hosts
|
LeftYabarHostSurfTrNdTmGrDsp
web_itditp: 184
Dispersion of the angle in the space of Nodes Time, calculated for hosts
|
LeftYabarHostSurfTrNdTmLeafLn90
web_itditp: 186
0.9-quarter of the length of the petal in the space of Nodes Time, calculated for hosts
|
LeftYabarHostSurfTrNdHgGr
web_itditp: 188
The average sung of inclination in the plane of the top
|
LeftBrowserHostDownloadProbability
web_itditp: 190
The likelihood of a racing from a host after click (on the logs of the bar).
|
LeftLogCtrMean
web_itditp: 192
Weighted mean of log(query_clicks)/log(query_shows) for given host. Weights are proportional to log(query_shows) + 0.2.
|
LeftUrlQueryVariety_Reg
web_itditp: 194
The degree of variety of requests for which this Urla click is read by regions
|
LeftUrlSessNormDurRate_Reg
web_itditp: 196
nd/i
|
LeftYabarUrlVisits_Reg
web_itditp: 198
Regional attendance of Urla according to the I-Bara
|
LeftUrlShowsWithNextPageClicksP1
web_itditp: 200
|
LeftUrlShowsWithNextPageClicksP10
web_itditp: 202
The factor is used in Selectionrank. TG_UNUSED: should not be included in the formulas to avoid feedback
|
LeftUrlQueryTrigramsStatic
web_itditp: 204
Static trigrams intercection of url and queries by which users visited the url.
|
LeftNHopChainsCountFrc
web_itditp: 206
The number of chains on request / (the number of chains in which URL + the number of chains on request participated).
|
LeftNHopIsFinal
web_itditp: 208
The number of chains in which Url was the last normalized for the total number of chains in which this URL was.
|
LeftVisitsFromWiki
web_itditp: 210
Number of transitions to URL from Wikipedia
|
LeftRegBrowserUserHub
web_itditp: 212
The page indicator is like a hub (how many pages are the bar users pass from it).
|
LeftUSLongPeriodUrlCtrReg
web_itditp: 214
Static URL factor in search sessions in 1600 days. Ordinary CTR. Localization to the level of countries.
|
LeftUSLongPeriodUrlDt3600AvgReg
web_itditp: 216
Static URL factor in search sessions in 1600 days. Average dwelltime, and DwellTime is cut from the session if more than 3600 seconds. Localization to the level of countries.
|
LeftUSLongPeriodUrlLongClickProbReg
web_itditp: 218
Static URL factor in search sessions in 1600 days. The probability that the URL click will be more than 120 seconds. Localization to the level of countries.
|
LeftUSLongPeriodUrlPositionAvgReg
web_itditp: 220
Static URL factor in search sessions in 1600 days. The average position of the URL for all requests. Localization to the level of countries.
|
LeftUSLongPeriodUrlShowsReg
web_itditp: 222
Static URL factor in search sessions in 1600 days. Logarithm of the number of shows. Localization to the level of countries.
|
LeftUSLongPeriodUrlMobileDt3600AvgReg
web_itditp: 224
Static URL factor for search sessions for 1600 days calculated on mobile sessions. Average dwelltime, and DwellTime is cut from the session if more than 3600 seconds. Localization to the level of countries.
|
LeftUSLongPeriodUrlMobileDt180AvgReg
web_itditp: 226
Static URL factor for search sessions for 1600 days calculated on mobile sessions. Average dwelltime, and DwellTime is cut from the session if more than 180 seconds. Localization to the level of countries.
|
LeftUBLongPeriodSearchPercentEndReg
web_itditp: 228
Static URL factor in browser logs for the maximum period. The formula for calculating the factor we look at Wiki. Localization to the level of countries.
|
LeftUBLongPeriodLeavesCntReg
web_itditp: 230
Static URL factor in browser logs for the maximum period. The number of leaves in URLA support. In this case, the leaves are a page from which there were no transitions. Localization to the level of countries.
|
LeftUBLongPeriodDtUrlHChildrenCut600Reg
web_itditp: 232
Static URL factor in browser logs for the maximum period. The average time spent on the page and in all descendants of the page (URLS to which they switched) from the host. Cut if the total DT is more than 10 minutes. Localization to the level of countries.
|
LeftBeastUrlMeanPos
web_itditp: 234
The average position of Urla for all requests
|
LeftBeastUrlNumQueries
web_itditp: 236
Number of requests for URL
|
LeftBrowserUrlDwellTimeRegionFrc
web_itditp: 238
The attitude of Dwell Time on the page in this region to Dwell Time on a page in all regions
|
LeftRegHostRank
web_itditp: 248
It reads in the same way as the Hostrank factor, but not on all the Owner graph, but on its subrack, consisting of Owner 'OV of this region. Belonging to the region is determined by TLD, or by the presence of pages in the index from this Owner 'A, about which Geo or Geoa, the classifier says that they are from this region. Mapped in the same way as the Hostrank factor, from 0 to 1 with 256 gradations
|
LeftRegIsWiki
web_itditp: 250
A document from the language section of Wikipedia corresponding to the user region
|
LeftOwnerClicksPCTR_Reg
web_itditp: 252
The owner's clickness regardless of the request, separately in the regions
|
LeftOwnerSDiffClickEntropy_Reg
web_itditp: 254
Entropy is the distribution of clicks. Regionalized
|
LeftOwnerSDiffShowEntropy_Reg
web_itditp: 256
Entropy is the distribution of shows. Regionalized
|
LeftOwnerSDiffCSRatioEntropy_Reg
web_itditp: 258
Entropy - distribution of clique/shows. Regionalized
|
LeftOwnerSessNormDuration_Reg
web_itditp: 260
ND/K normalized time to click
|
LeftOwnerSatisfied4Rate_Reg
web_itditp: 262
This is the SEA factor = s4_r/ (k_r+10) where S4_R is the number of clicks> 180 sec, k_r - the total number of clicks. It is considered taking into account reformalization. Localized version
|
LeftOwnerCTRWithNextPageClicksP10
web_itditp: 264
|
LeftBrowserHostCntDwellTimeLog
web_itditp: 266
Middle Logarithm of the user on the host with localization in the country; It is considered according to Yabar logs
|
LeftCommercialOwnerRank_Reg
web_itditp: 268
Classifier of the commerciality of the site
|
LeftBeastHostMeanPos
web_itditp: 270
The average position of the host for all requests
|
LeftBeastHostNumQueries
web_itditp: 272
Number of requests for host
|
LeftYabarHostBrowseRank_Reg
web_itditp: 274
Implementation of the algorithm described in the article ((http://wiki.yandex-team.ru//h.yandex.net/?http%3A%2F%2FreseRosoft.microsoft.com%2Fen-US%2FPEOPLIULIUUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUUP032-LIUUUU .pdf http://research.microsoft.com/en-us/people/tyliu/fp032-liu.pdf)) by large regions (tube)
|
LeftBrowserHostDwellTimeRegionFrc
web_itditp: 276
The attitude of Dwell Time on a host in this region to Dwell Time on a host in all regions
|
LeftNews
web_itditp: 278
This is the news (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushichiekomponenty/klassificacionnye?v=tkd#h45859-3 Patterns in URL $))))).
|
LeftShop
web_itditp: 279
This is a proposal store (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayafformula/tekushichiekomponenty/klassificacionnye?v=tkd#h45859-4 Patterns in Url '))))))))). Not used (depreded)
|
LeftCat
web_itditp: 280
This is a catalog (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayafformula/tekushhiekomponenty/klassificacionnye? .
|
LeftLong
web_itditp: 281
Long document (the longer the document, the greater the value of the factor).
|
LeftPureText
web_itditp: 282
Long text without links.
|
LeftRoot
web_itditp: 283
This is a muzzle.
|
LeftRusLang
web_itditp: 284
The language of the document is Russian.
|
LeftAddTime
web_itditp: 285
The time of adding a page, more - a more old document; The root is placed from time displayed at the interval [0.1] so that 3+ years gives 1.
|
LeftIsMainPage
web_itditp: 286
If the main page of the owner (most often a second -level domain, for example xxxx.ru), then the factor is 1. For bums, hosting, personal blogs, etc. (for example, Lifejornal, People.ru, etc.) - domains of the third level (such as xxxxx.narod.ru) will also have an equal factor 1.
|
LeftHops
web_itditp: 287
The number of hops of Url inpans (such as less - closer to the muzzle, the lower the value (0 - the muzzle, 1 - from the muzzle cannot be reached, 0 <can get from the muzzle <1). Normal value for the root of the nosta 0.0039).
|
LeftUkrainian
web_itditp: 288
It is equal to one if the site has a Ukrainian geoist (i.e. 1 - Ukrainian site)
|
LeftIsBlog
web_itditp: 289
Page from the blogochosting
|
LeftIsLivejournal
web_itditp: 290
Page with Livejournal.com
|
LeftTextFeatures
web_itditp: 291
The quality of the text. It is considered a rather complex formula
|
LeftTextLike
web_itditp: 292
Text quality (classifier Alekseev)
|
LeftDocLen
web_itditp: 293
Document length in sentences
|
LeftUrlLen
web_itditp: 294
The length of the URL 'A, divided by 5
|
LeftIsHTML
web_itditp: 295
Document type - HTML
|
LeftIsUnreachable
web_itditp: 296
The page is unattainable by the links from the muzzle.
|
LeftYabarUrlVisits
web_itditp: 297
Varla's attendance according to I-Bara
|
LeftYabarUrlVisitors
web_itditp: 298
The number of unique visitors to Urla
|
LeftYabarUrlAvgTime
web_itditp: 299
The average for users time is the user on the page. It is read as the difference between neighboring transitions.
|
LeftUrlQueryVariety
web_itditp: 300
The degree of variety of requests for which this Urla click
|
LeftIsCommByKeywords
web_itditp: 301
Commercial page by keywords. Not used (depreded)
|
LeftAdultness
web_itditp: 302
equals 2 * NastyContent
|
LeftHostAdultness
web_itditp: 303
equals 2 * NastyContent
|
LeftKCHostAdultness
web_itditp: 304
always zero
|
LeftEngLang
web_itditp: 305
Document language - English
|
LeftCyrLang
web_itditp: 306
The language of the document is Cyrillic
|
LeftUrlHasNoDigits
web_itditp: 307
There are no numbers in Urla
|
LeftAuraDocLogShared
web_itditp: 308
Logarithm of the number of shingles on which this document is not unique
|
LeftAuraDocLogAuthor
web_itditp: 309
Logarithm of the number of shingles on which this owner of the document is recognized as the author
|
LeftAuraDocMeanSharedWeight
web_itditp: 310
The average weight of non-ugly shingles of this document
|
LeftSoft404
web_itditp: 311
Page 342 200 224 '404 ' (share of tokens '404 ' in relation to the total number of tokens on the page)
|
LeftAuraDocLogOrigin
web_itditp: 312
Logarithm of the number of shingles in the document added by the owner of the site as original texts in ((http://wiki.yandex-team.ru/jandekspoisk/jekosistema/marketingPr/webmasters/plan/vtorcontect of originality plugin)). It does not participate in the formula, it is needed to disconnect the takes
|
LeftAuraDocMeanFltAuthorSource
web_itditp: 313
The average filtered number of sources of authorship of the document. It does not participate in the formula, it is needed to disconnect the takes
|
LeftLanguagePopularity
web_itditp: 314
The popularity of the language of the document. Number from 0 to 1. (http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/languaguaguagepopalarity)))))))
|
LeftNumNonLettersInUrl
web_itditp: 315
Number 'Nebukv ' in Url
|
LeftUrlLen2
web_itditp: 316
The length of the URL 'and with an accuracy to the symbol. Disconnected in production.
|
LeftIsHub
web_itditp: 317
Habi page
|
LeftUSLongPeriodUrlMobileDt180Avg
web_itditp: 318
Static URL factor for search sessions for 1600 days calculated on mobile sessions. Average dwelltime, and DwellTime is cut from the session if more than 180 seconds
|
LeftUSLongPeriodUrlMobileLongClickProb
web_itditp: 319
Static URL factor for search sessions for 1600 days calculated on mobile sessions. The probability that the URL click will be more than 120 seconds
|
LeftUSLongPeriodUrlMobileLossesProb
web_itditp: 320
Static URL factor for search sessions for 1600 days calculated on mobile sessions. The probability that URL is not clicks if they click at least one URL below.
|
LeftMetrikaUrlVisits
web_itditp: 321
Similar to Yabarurlvisits
|
LeftYabarUrlLcAc
web_itditp: 322
The number of sessions in which Url was the last, classified as the sessions in which Url appeared
|
LeftMetrikaUrlVisitors
web_itditp: 323
Similar to Yabarurlvisitors
|
LeftMetrikaUrlAvgTime
web_itditp: 324
Similar to Yabarurlavgtime
|
LeftMetrikaUrlCoreAudience
web_itditp: 325
The core of the page of the pages on which there is a metric counter
|
LeftMetrikaUrlHostVisitTime
web_itditp: 326
The average time of the user stay on the host with an external (from another non-search site) entry from a specific URL
|
LeftMetrikaUrlHostVisitDepth
web_itditp: 327
Average 'depth ' (the number of transitions within the host) of the user stay on the host with an external (from another non-search site) entry from a particular URL
|
LeftYabarUrlDownloads
web_itditp: 328
Assessment of the probability of leaps from the document
|
LeftIsIndexPage
web_itditp: 329
This is Index. (HTML/PHP/ASPX?/...), without CGI parameters. It is considered to be for all takes.
|
LeftIsIndexPageSoft
web_itditp: 330
This is Index. (HTML/PHP/ASPX?/...), possibly with CGI parameters. It is considered to be for all takes.
|
LeftIsOwner
web_itditp: 331
Whether the host is the owner, conditionally host == Owner (Host).
|
LeftMinPathLen
web_itditp: 332
The minimum length of Pathandquery for all half -shoes.
|
LeftHasDownloadLinkOnFile
web_itditp: 333
The document has a direct link to the file
|
LeftHasDownloadLinkOnFileHosting
web_itditp: 334
The document has a link to filehosting
|
LeftHasUserReviews
web_itditp: 335
The document contains user review/comment
|
LeftDocCreateMonth
web_itditp: 336
The time of creating a document with an accuracy of up to a month 1.0 is the current month, 0 342 200 224- 10 years ago and older. Temporarily disconnected
|
LeftDocUpdateMonth
web_itditp: 337
The time for updating the document with an accuracy of up to a month 1.0 is the current month, 0 342 200 224- 10 years ago and older. Temporarily disconnected
|
LeftDaterStatsYearNormLikelihood
web_itditp: 338
The function of the credibility of the distribution of years in the document. Temporarily disconnected
|
LeftDaterStatsAverageSourceSegment
web_itditp: 339
The arithmetic mean position of dates in the document. Temporarily disconnected
|
LeftSegmentWordPortionFromMainContent
web_itditp: 341
The share of the words of the document from the segments with Score> 2.
|
LeftTotalDups
web_itditp: 342
|
LeftWikiLinkCount
web_itditp: 343
|
LeftNastyContent
web_itditp: 344
Content ugliness factor.
|
LeftYabarUrlRevisits
web_itditp: 345
User return on URL
|
LeftBrowserBookmarksUrl
web_itditp: 346
The more users add to bookmarks a url, the more factor value it has
|
LeftIsNotCgi
web_itditp: 347
The factor about the presence of a symbol '? ' In Url. It is zero if the Url has CGI parameters (more precisely: all duplicate have a symbol '? ' In Url).
|
LeftPageHasMapsApi
web_itditp: 348
Equal to one if the page connects JS-API of any geo-data supplier
|
LeftUSLongPeriodUrlCtr
web_itditp: 349
Static URL factor in search sessions in 1600 days. Ordinary CTR.
|
LeftUSLongPeriodUrlDt3600Avg
web_itditp: 350
Static URL factor in search sessions in 1600 days. Average dwelltime, and DwellTime is cut from the session if more than 3600 seconds
|
LeftUSLongPeriodUrlDt180Avg
web_itditp: 351
Static URL factor in search sessions in 1600 days. Average dwelltime, and DwellTime is cut from the session if more than 180 seconds
|
LeftUSLongPeriodUrlLongClickProb
web_itditp: 352
Static URL factor in search sessions in 1600 days. The probability that the URL click will be more than 120 seconds
|
LeftUSLongPeriodUrlShows
web_itditp: 353
Static URL factor in search sessions in 1600 days. Logarithm of the number of shows.
|
LeftUSLongPeriodUrlWinsProb
web_itditp: 354
Static URL factor in search sessions in 1600 days. The probability that URL is clicking if they do not click on at least one URL higher.
|
LeftUSLongPeriodUrlLossesProb
web_itditp: 355
Static URL factor in search sessions in 1600 days. The probability that URL is not clicks if they click at least one URL below.
|
LeftUBLongPeriodVisitsSNProb
web_itditp: 356
Static URL factor in browser logs for the maximum period. The percentage of traffic from social networks in all traffic from other hosts and search.
|
LeftUBLongPeriodDirectHChildren90CntFromExtHost
web_itditp: 357
Static URL factor in browser logs for the maximum period. The average number of direct descendants from the host on which they spent more than 90 seconds. The descendant is straight, only if there is a link from our page to the descendant and crossed it.
|
LeftUUBLongPeriodDepthFromExtHost
web_itditp: 358
Static URL factor in browser logs for the maximum period. The average maximum depth of wood with the root in the current URL is when the URL is visited from other hosts.
|
LeftUBLongPeriodBrowseFrc
web_itditp: 359
Static URL factor in browser logs for the maximum period. The number of times when the feather was transferred to the page to the total number of pages to which they switched from a sickle. The closer to 1, the more often the page was opened the only one in the session.
|
LeftUBLongPeriodAvgSearchDuration600
web_itditp: 360
Static URL factor in browser logs for the maximum period. The average length of search sessions, when they switched to the page from a sickle
|
LeftUBLongPeriodSearchPercentEnd
web_itditp: 361
Static URL factor in browser logs for the maximum period. The formula for calculating the factor we look at Wiki.
|
LeftUBLongPeriodSearchPercentMiddle30
web_itditp: 362
Static URL factor in browser logs for the maximum period. The formula for calculating the factor we look at Wiki.
|
LeftUBLongPeriodVisit120Prob
web_itditp: 363
Static URL factor in browser logs for the maximum period. The probability that the user will spend on the page> 120 seconds.
|
LeftUBLongPeriodLeavesCnt
web_itditp: 364
Static URL factor in browser logs for the maximum period. The number of leaves in URLA support. In this case, the leaves are a page from which there were no transitions.
|
LeftUBLongPeriodDtUrlHChildrenCut600
web_itditp: 365
Static URL factor in browser logs for the maximum period. The average time spent on the page and in all descendants of the page (URLS to which they switched) from the host. Cut off if the total DT is more than 10 minutes
|
LeftUBLongPeriodMinTimeWhenPageShow
web_itditp: 366
Static URL factor in browser logs for the maximum period. The minimum Unix Time when the page appeared in the logs for the first time.
|
LeftUBLongPeriodDeltaAvgMinTimeWhenPageShow
web_itditp: 367
Static URL factor in browser logs for the maximum period. The difference between the middle and minimum Unix Time when the page appeared in the logs.
|
LeftUBLongPeriodLatitude
web_itditp: 368
Static URL factor in browser logs for the maximum period. Current breadth where the page was viewed from.
|
LeftUBLongPeriodLongitude
web_itditp: 369
Static URL factor in browser logs for the maximum period. Current longitude where the page was viewed from.
|
LeftUBLongPeriodDownloadsProb
web_itditp: 370
Static URL factor in browser logs for the maximum period. The likelihood of leaps from the page
|
LeftUBLongPeriodDownloadsImageProb
web_itditp: 371
Static URL factor in browser logs for the maximum period. The likelihood of image jumps from the page
|
LeftUBLongPeriodDownloadsTorrentProb
web_itditp: 372
Static URL factor in browser logs for the maximum period. The probability of leap torrent file from the page
|
LeftYaBar
web_itditp: 469
Attendance from the bar - ((http://wiki.yandex-team.ru/andrejjkostjagin/yabarlog/hoststat data description)). The factor will be remarked.
|
LeftAddTimeMP
web_itditp: 473
The time for adding the main page of the owner (host?) Will be remaped like Addtime.
|
LeftOwnerClicksPCTR
web_itditp: 475
The owner's clickness regardless of the request
|
LeftSpam2
web_itditp: 477
Automatic classifier spam named after Alekseeva, the likelihood that the website spam (0 is not spam, 1- spam)
|
LeftYaBarCoreOwner
web_itditp: 480
The core of the audience of owners according to Yandex.Mrazusing
|
LeftYaBarCoreHost
web_itditp: 481
The core of the audience of the hosts according to Yandex.Mrazusing
|
LeftHasYaBarCore
web_itditp: 482
Does the host have a host
|
LeftHostSize
web_itditp: 483
The size of the Host named after Raskovalov in the documents without taking into account the takes (each double is taken into account in the factor by an independent document)
|
LeftNevasca1
web_itditp: 485
The content of content is not used. 'Good ' host (from 0 to 1), calculated on the basis of how many and what kind of hosts the content is borrowed from this.
|
LeftNevasca2
web_itditp: 486
The content of content is not used. 'Poorness ' host (from 0 to 1) 342 200 223 is proportional to the number of secondary content on the host. 'Poorness ' host (from 0 to 1) 342 200 223 is proportional to the number of secondary content on the host.
|
LeftYabarHostInternalTraffic
web_itditp: 487
The share of suits to the site is not by links (set with hands or from bookmarks)
|
LeftYabarHostAvgTime
web_itditp: 488
average for users Active continuous time for user finding (in sec) on host pages
|
LeftYabarHostAvgTime2
web_itditp: 489
The average for users Active continuous time of the user (in second) on the pages of the host. In the inside of the Yandex. Bara/elements/browser counter
|
LeftYabarHostAvgActions
web_itditp: 490
The average for users is the number of active actions (clicks, clicks) with the continuous finding of the user (in second) on the pages of the host.
|
LeftYabarHostBrowseRank
web_itditp: 491
Implementation of the algorithm described in the article ((http://wiki.yandex-team.ru//h.yandex.net/?http%3A%2F%2FreseRosoft.microsoft.com%2Fen-US%2FPEOPLIULIUUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUUP032-LIUUUU .pdf http://research.microsoft.com/en-us/people/tyliu/fp032-liu.pdf))
|
LeftIsUa
web_itditp: 492
Domain in the .ua zone
|
LeftIsNotRu
web_itditp: 493
Domain is not in the .ru zone
|
LeftSeoInPayLinks
web_itditp: 500
The number of COO-Thrilling links between hosts
|
LeftRankComGoodness
web_itditp: 501
Classifier for estimates of commercial sites
|
LeftRankComGoodnessBar
web_itditp: 502
Classifier that approximate the quality of commercial sites based on user behavior data
|
LeftRankBoostGoodness
web_itditp: 503
The rank of site quality used for boosts of the Moscow commercial formula
|
LeftQueriesAvgCM2
web_itditp: 504
Average query commerciality
|
LeftMore90SecVisitsShare
web_itditp: 506
The share of visits for which the time spent during the day on the host is more than 90 seconds
|
LeftMore160SecVisitsShare
web_itditp: 507
The share of visits for which the time spent during the day on the host is more than 160 seconds
|
LeftRankHackedNovaPhp
web_itditp: 508
Rank of hacked sites
|
LeftRankAgs4
web_itditp: 509
Rank AGS4
|
LeftMaxQsDocClassQsRankPthQuerySpam
web_itditp: 510
Maximum QSRANK on the owner
|
LeftAvgQsRankOnNotSubdomainDocs
web_itditp: 511
Average QSRANK on the main domain
|
LeftVisitorsReturnMonthShare
web_itditp: 512
The share of users who returned within a month
|
LeftVisitorsReturnMonthNumber
web_itditp: 513
The number of users returning within a month
|
LeftRankXitDoor
web_itditp: 514
Rank Dorweev
|
LeftAvgTitleCapitalLettersRatio
web_itditp: 515
Share of the capital letters in Title
|
LeftFromSearchShareNormalized
web_itditp: 516
The share of incoming traffic from search engines among all incoming traffic
|
LeftGreenTrafficShareNormalized
web_itditp: 517
The share of direct visits among all incoming traffic
|
LeftAvgQsFWnd500TOKEN
web_itditp: 518
Middle QSRank in a sliding window
|
LeftMinOwnerQsRank
web_itditp: 519
Minimum QSRANK
|
LeftAvgNumhops
web_itditp: 520
Average HOPS
|
LeftRankArtroz
web_itditp: 521
Rank of the quality of texts on the host. The higher 342 200 224 the greater the likelihood that the host is full of articles - a rewriting, a bad copy of the content ordered on the exchanges of content. Burning stronger as the before the aggregation.
|
LeftRandomLogHostHasPaymentsAvg
web_itditp: 522
AVG aggregation of HasPayments web factor using random log
|
LeftRandomLogHostIsVideoQueryAvg
web_itditp: 523
AVG aggregation of VideoQuery web factor using random log
|
LeftRandomLogHostSyntQualityAvg
web_itditp: 524
AVG aggregation of SyntQuality web factor using random log
|
LeftRandomLogHostGeoRegionalityVNewPerc90
web_itditp: 525
PERCENTALE_90 aggregation of GeoRegionalityVNew web factor using random log
|
LeftRandomLogHostQClassDownloadAvg
web_itditp: 526
AVG aggregation of QClassDownload web factor using random log
|
LeftRandomLogHostIsMusicAvg
web_itditp: 527
AVG aggregation of IsMusic web factor using random log
|
LeftRandomLogHostQueryThEncyclopedicPerc25
web_itditp: 528
PERCENTALE_25 aggregation of QueryThEncyclopedic web factor using random log
|
LeftRandomLogHostCommercialOwnerRankRegAvg
web_itditp: 529
AVG aggregation of CommercialOwnerRank_Reg web factor using random log
|
LeftRandomLogHostYabarWordDNGIPerc25
web_itditp: 530
PERCENTALE_25 aggregation of YabarWordDepthNodesGradientMin web factor using random log
|
LeftRandomLogHostPopularSEFRCBrowserAvg
web_itditp: 531
AVG aggregation of PopularSEFRCBrowser web factor using random log
|
LeftRandomLogHostURLClicksMaxGeoRegionFRCRatioAvg
web_itditp: 532
AVG aggregation of URLClicksMaxGeoRegionFRCRatio web factor using random log
|
LeftRandomLogHostUBLongPeriodDirectHChildren90CntPerc90
web_itditp: 533
PERCENTALE_90 aggregation of UBLongPeriodDirectHChildren90CntFromExtHost web factor using random log
|
LeftRandomLogHostUBLongPeriodDtUrlHChildrenPerc90
web_itditp: 534
PERCENTALE_90 aggregation of UBLongPeriodDtUrlHChildrenCut600Reg web factor using random log
|
LeftRandomLogHostIsPictureAvg
web_itditp: 535
AVG aggregation of IsPicture web factor using random log
|
LeftRandomLogHostErratumLogQueryProbabilityAvg
web_itditp: 536
AVG aggregation of ErratumLogQueryProbability web factor using random log
|
LeftKubrLang
web_itditp: 609
|
LeftIsNational
web_itditp: 612
|
LeftIsRu
web_itditp: 613
|
LeftIsKubr
web_itditp: 614
|
LeftRandomLogHostVisitsFromWikiAvg
web_itditp: 664
AVG aggregation of VisitsFromWiki web factor using random log
|
LeftRandomLogHostNavLinearPerc25
web_itditp: 668
PERCENTALE_25 aggregation of NavLinear web factor using random log
|
LeftRandomLogHostFoundPerc90
web_itditp: 670
PERCENTALE_90 aggregation of Found web factor using random log
|
LeftRandomLogHostSubqueryThMatchAvg
web_itditp: 672
AVG aggregation of SubqueryThMatch web factor using random log
|
LeftRandomLogHostSegmentWordPortionFromMainContentAvg
web_itditp: 676
AVG aggregation of SegmentWordPortionFromMainContent web factor using random log
|
LeftRandomLogHostXfDtShowAllMaxFFieldSet2Bm15FLogK0001Avg
web_itditp: 678
AVG aggregation of XfDtShowAllMaxFFieldSet2Bm15FLogK0001 web factor using random log
|
LeftRandomLogHostQueryRegionSizeAvg
web_itditp: 680
AVG aggregation of QueryRegionSize web factor using random log
|
LeftRandomLogHostIsRelevLocaleUAAvg
web_itditp: 684
AVG aggregation of IsRelevLocaleUA web factor using random log
|
LeftRandomLogHostQfufAllSumWFSumWFieldSet3BclmWeightedFLogW0K0001Perc90
web_itditp: 686
PERCENTALE_90 aggregation of QfufAllSumWFSumWFieldSet3BclmWeightedFLogW0K0001 web factor using random log
|
LeftRandomLogHostDssmBoostingCtrQuerySelfSimilarityPerc90
web_itditp: 688
PERCENTALE_90 aggregation of DssmBoostingCtrQuerySelfSimilarity web factor using random log
|
LeftRandomLogHostQueryToDocAllSumFCountTextBocm11Norm256Avg
web_itditp: 690
AVG aggregation of QueryToDocAllSumFCountTextBocm11Norm256 web factor using random log. NOTE: QueryToDocAllSumFCountTextBocm11Norm256 has been removed.
|
LeftRandomLogHostIsNavMxQueryPerc90
web_itditp: 692
PERCENTALE_90 aggregation of IsNavMxQuery web factor using random log
|
LeftRandomLogHostDBM15Wares2Avg
web_itditp: 696
AVG aggregation of DBM15Wares2 web factor using random log
|
LeftRandomLogHostUrlNGramsModelPerc90
web_itditp: 698
PERCENTALE_90 aggregation of UrlNGramsModel web factor using random log
|
LeftRandomLogHostDssmBoostingCtrKMeans1ScoreScaledSumWeightedQEPerc25
web_itditp: 704
PERCENTALE_25 aggregation of DssmBoostingCtrKMeans1ScoreScaledSumWeightedQE web factor using random log
|
LeftRandomLogHostLongClickMobileAllWcmWeightedValuePerc90
web_itditp: 706
PERCENTALE_90 aggregation of LongClickMobileAllWcmWeightedValue web factor using random log
|
LeftRandomLogHostDssmVkPopularityPerc25
web_itditp: 708
PERCENTALE_25 aggregation of DssmVkPopularity web factor using random log
|
LeftRandomLogHostUBLongPeriodVisitsSNProbAvg
web_itditp: 710
AVG aggregation of UBLongPeriodVisitsSNProb web factor using random log
|
LeftRandomLogHostCountryQueryRegionalityPerc90
web_itditp: 712
PERCENTALE_90 aggregation of CountryQueryRegionality web factor using random log
|
LeftRandomLogHostTRhitwPerc90
web_itditp: 714
PERCENTALE_90 aggregation of TRhitw web factor using random log
|
LeftRandomLogHostUBLongPeriodAvgSearchDuration600Perc90
web_itditp: 716
PERCENTALE_90 aggregation of UBLongPeriodAvgSearchDuration600 web factor using random log
|
LeftRandomLogHostRequestIsFromIOSAvg
web_itditp: 718
AVG aggregation of RequestIsFromIOS web factor using random log
|
LeftRandomLogHostDssmQueryEmbeddingCtrNoMinerPca4Perc90
web_itditp: 720
PERCENTALE_90 aggregation of DssmQueryEmbeddingCtrNoMinerPca4 web factor using random log
|
LeftRandomLogHostXfDtShowAllMaxFFieldSetUTBm15FLogW0Avg
web_itditp: 722
AVG aggregation of XfDtShowAllMaxFFieldSetUTBm15FLogW0 web factor using random log
|
LeftRandomLogHostUrlTrigramsPerc25
web_itditp: 724
PERCENTALE_25 aggregation of UrlTrigrams web factor using random log
|
LeftRandomLogHostDssmQueryEmbeddingCtrNoMinerPca1Perc90
web_itditp: 726
PERCENTALE_90 aggregation of DssmQueryEmbeddingCtrNoMinerPca1 web factor using random log
|
LeftRandomLogHostIsRelevLocaleKZAvg
web_itditp: 728
AVG aggregation of IsRelevLocaleKZ web factor using random log
|
LeftRandomLogHostTextFeaturesPerc90
web_itditp: 730
PERCENTALE_90 aggregation of TextFeatures web factor using random log
|
LeftNewsAgencyRating
web_itditp: 732
Rating of news agency from agencies.json (Yandex.News resource)
|
LeftHasJsFromMarketgidCom
web_itditp: 734
1 if host include js from marketgid.com
|
LeftHasJsFromRfityCom
web_itditp: 736
1 if host include js from rfity.com
|
LeftHasJsFromFacebookNet
web_itditp: 742
1 if host include js from facebook.net
|