Tag: TG_RIGHT_DOC
(308 ranking factors)
Factors |
---|
IsPorno
web_itditp: 1
Document from porn kitski
|
WikiInfobox
web_itditp: 3
On danny url is a link from inFobox-ov to Wikipedia.
|
IsComm
web_itditp: 5
A document from a commercial clay. Not used (depreded)
|
IsFake
web_itditp: 7
Fast document
|
IsSEO
web_itditp: 9
The page title contains commercial vocabulary. Not used (depreded)
|
IsEShop
web_itditp: 11
Commercial page (Classifier Savina)
|
IsForum
web_itditp: 13
URL satisfies forum_detector regularly
|
IsObsolete
web_itditp: 15
The URL has an ancient date. Ancient news are recognized. Factor 1 if there is a year in Url <= 2007.
|
HasPayments
web_itditp: 17
On the page there is about 'Payment SMS '.
|
ClickedWithAnotherSEClicks
web_itditp: 19
Clicks on the urlahs shown in the issuance for requests, by which they went to look for other search engines
|
ShowsWithAnotherSEClicks
web_itditp: 21
Urlov shows in the issuance for requests, by which they went to look for other search engines
|
EshopValue
web_itditp: 23
Stage of the page
|
PornoValue
web_itditp: 25
Pornography of the page
|
IsPornoAdvert
web_itditp: 27
On the Porn Advertising page
|
Poetry
web_itditp: 29
The poetry of the document
|
PoetryQuad
web_itditp: 31
The maximum poetry of the quatrain
|
SynS1
web_itditp: 33
Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
SynFLremap1
web_itditp: 35
Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
SynFLremap2
web_itditp: 37
Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
UrlSessNormDurRate
web_itditp: 39
nd/i
|
SynPercentBadWordPairs
web_itditp: 41
An indicator of the unnaturalness of the text from the point of view of the Russian language. The number of bad pairs of words in the text, transferred to the segment [0.1] according to the Z/(Z+10) formula
|
SynNumBadWordPairs
web_itditp: 43
The proportion of bad steam among all found in the table: Z/(x+1), where Z 342 200 223 The number of bad couples in the text, and X 342 200 223 number ((http: //wiki.yandex- Team.ru/evgenijjjgrechnikov/testSynonimizers 2000-navigable)) steam
|
NumLatinLetters
web_itditp: 45
The number of Latin letters in the text (not counting the markings) driven into [0.1] formula n/(n+100)
|
HasBigPicture
web_itditp: 47
The page has a big picture
|
RusWordsInText
web_itditp: 49
The number of words in the text (the word is what the lemmeter selected) is displayed in [0.1] according to the formula x/(x+a)
|
RusWordsInTitle
web_itditp: 51
The number of words of the Russian language in the title
|
MeanWordLength
web_itditp: 53
The average length of the word
|
PercentWordsInLinks
web_itditp: 55
The percentage of the number of words inside the tag <a> .. </a> from the number of all words
|
PercentVisibleContent
web_itditp: 57
The percentage of the number of words outside the tags (outside the brackets <>) from the number of all words
|
PercentFreqWords
web_itditp: 59
The percentage of the number of words, which are 200 the most frequent words of the language, from the number of all words of the text
|
PercentUsedFreqWords
web_itditp: 61
The number used in the text 500 of the most popular words of the language, divided by 500
|
TrigramsProb
web_itditp: 63
Logarithm of average geometric probabilities of trigrams in the text. (the probability of a trigram - the number of its meetings in the text, divided by the number of all trigrams) is displayed in [0.1] according to the formula -x (x+a)
|
TrigramsCondProb
web_itditp: 65
Logarithm of the average geometric conditional probabilities of trigrams. The conditional probability of a trigram is its probability, divided by the probability of a bigram from the first two words
|
NumeralsPortion
web_itditp: 67
The share of different parts of speech in the text. The share of numerals (among all words that managed to recognize part of the speech)
|
ParticlesPortion
web_itditp: 69
The share of particles
|
AdjPronounsPortion
web_itditp: 71
The share of pronoun adjectives
|
AdvPronounsPortion
web_itditp: 73
The proportion of pronoun nouns
|
VerbsPortion
web_itditp: 75
The share of verbs
|
FemAndMasNounsPortion
web_itditp: 77
The share of words that can be both masculine nouns and nouns of the feminine, but not of the middle kind, among all nouns (examples: 'Hummingbird ' - an example of an indefinite kind that can be determined in two ways, 'Alexander ' - homonymy).
|
VideoRating
web_itditp: 79
The popularity of the video roller comes from the video
|
LongestText
web_itditp: 81
The size of the largest text segment (from the factor [18] puretext)
|
HasLiRuCounter
web_itditp: 83
The presence of a LiveInternet meter
|
UrlTrigrams
web_itditp: 85
Model with the training of each trigram on '+' and '-' urlah. It does not depend on the request.
|
NumSlashes
web_itditp: 87
The number of slashes in Url
|
WatchVideo
web_itditp: 89
The presence of a built -in video player on the page
|
DownloadVideo
web_itditp: 91
Video for downloading
|
GskUrlModel
web_itditp: 93
The factor is calculated from the text of Url using the classifier of sequences Quality/Seq/GSK
|
SegmentAuxAlphasInText
web_itditp: 95
Number of letters in the AUX segment
|
SegmentAuxSpacesInText
web_itditp: 97
The number of spaces in the AUX segment
|
SegmentContentCommasInText
web_itditp: 99
The number of commas in the Content segment
|
IsShop
web_itditp: 101
Page 342 200 224 Shop. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/opisanijafaktorov#SSHOP Description)). Not used (depreded)
|
UrlNGramsModel
web_itditp: 103
Urlngramsmodel ranking factor in ERF
|
StaticTitleComm
web_itditp: 105
The degree of commerce page title. Not used (depreded)
|
StaticTitleBM25Ex
web_itditp: 107
BM25 page title by its text
|
TrashAdv
web_itditp: 109
The greasy of the page
|
CommRus
web_itditp: 111
The weight of the document on a monosyllabic dictionary of commercial vocabulary
|
URLClicksMaxGeoCityFRCWeight
web_itditp: 113
Normalized corrected clicks count by query with user's city(gc=) mentioned
|
IsMobileBeauty
web_itditp: 115
The binary factor about the mobile adaptability of the document. It is taken from ERF
|
EmbedVideoBroken
web_itditp: 117
A broken built -in video on the page.
|
SumFlashArea
web_itditp: 119
the ratio of the total area of ​​all Flash blocks to the screen area
|
Adv
web_itditp: 121
There is advertising on the site.
|
YandexAdv
web_itditp: 123
On the site there is an advertisement for Yandex.
|
NoSpam
web_itditp: 125
The Classifier of Spam for Picks from Antispam recognized the site not (!) Spam. Those. 0 = spam, 1 = good.
|
IsWiki
web_itditp: 129
page from ru.wikipedia.org
|
AdvAspam
web_itditp: 131
|
IsLinkPessimised
web_itditp: 133
Antispamers pessimized the site - all dynamic linseed factors are reset. Zerolnk.flt
|
CommLinksSEOHosts
web_itditp: 139
The share of incoming corrupt links. The algorithm for recognition of commercial links is implemented. The factor will be remarked to [0.1] if the share of such links is 50%, otherwise 0. ((http://wiki.yandex-team.ru/svetlanashorina/topseolinks selection of wound sites))))))
|
CountersSearchTraffic1
web_itditp: 145
Search traffic - transitions from search engines to the site (2nd formula)
|
CountersSearchTraffic2
web_itditp: 147
Search traffic - transitions from search engines to the site (2nd formula)
|
YabarHostVisitors
web_itditp: 149
The number of unique visitors, remarks exponentially
|
YabarHostSearchTraffic
web_itditp: 151
The share of traffic from search engines
|
OwnerSDiffClickEntropy
web_itditp: 153
Entropy - distribution of clicks
|
OwnerSDiffShowEntropy
web_itditp: 155
Entropy - distribution of shows
|
OwnerSDiffCSRatioEntropy
web_itditp: 157
Entropy - Distribution of clique/shows.
|
OwnerNavQuota
web_itditp: 159
The share of clicks for navigation requests
|
OwnerSatisfied4Rate
web_itditp: 161
This is the SEA factor = s4_r/ (k_r+10) where S4_R is the number of clicks> 180 sec, k_r - the total number of clicks. It is considered taking into account reformalization.
|
IsCom
web_itditp: 163
Domna in Zone .com
|
OwnerSessNormDuration
web_itditp: 165
ND/K normalized time to click
|
OwnerReqsPopularity
web_itditp: 171
The popularity of Owner 'And in the requests
|
HostReliability
web_itditp: 177
The share of the Urlov that respond without errors
|
OwnerIsCommercial
web_itditp: 179
|
YabarHostSurfTrDpNdLeafLn
web_itditp: 183
The length of the Depth Nodes petal counted for hosts
|
YabarHostSurfTrNdTmGrDsp
web_itditp: 185
Dispersion of the angle in the space of Nodes Time, calculated for hosts
|
YabarHostSurfTrNdTmLeafLn90
web_itditp: 187
0.9-quarter of the length of the petal in the space of Nodes Time, calculated for hosts
|
YabarHostSurfTrNdHgGr
web_itditp: 189
The average sung of inclination in the plane of the top
|
BrowserHostDownloadProbability
web_itditp: 191
The likelihood of a racing from a host after click (on the logs of the bar).
|
LogCtrMean
web_itditp: 193
Weighted mean of log(query_clicks)/log(query_shows) for given host. Weights are proportional to log(query_shows) + 0.2.
|
UrlQueryVariety_Reg
web_itditp: 195
The degree of variety of requests for which this Urla click is read by regions
|
UrlSessNormDurRate_Reg
web_itditp: 197
nd/i
|
YabarUrlVisits_Reg
web_itditp: 199
Regional attendance of Urla according to the I-Bara
|
UrlShowsWithNextPageClicksP1
web_itditp: 201
|
UrlShowsWithNextPageClicksP10
web_itditp: 203
The factor is used in Selectionrank. TG_UNUSED: should not be included in the formulas to avoid feedback
|
UrlQueryTrigramsStatic
web_itditp: 205
Static trigrams intercection of url and queries by which users visited the url.
|
NHopChainsCountFrc
web_itditp: 207
The number of chains on request / (the number of chains in which URL + the number of chains on request participated).
|
NHopIsFinal
web_itditp: 209
The number of chains in which Url was the last normalized for the total number of chains in which this URL was.
|
VisitsFromWiki
web_itditp: 211
Number of transitions to URL from Wikipedia
|
RegBrowserUserHub
web_itditp: 213
The page indicator is like a hub (how many pages are the bar users pass from it).
|
USLongPeriodUrlCtrReg
web_itditp: 215
Static URL factor in search sessions in 1600 days. Ordinary CTR. Localization to the level of countries.
|
USLongPeriodUrlDt3600AvgReg
web_itditp: 217
Static URL factor in search sessions in 1600 days. Average dwelltime, and DwellTime is cut from the session if more than 3600 seconds. Localization to the level of countries.
|
USLongPeriodUrlLongClickProbReg
web_itditp: 219
Static URL factor in search sessions in 1600 days. The probability that the URL click will be more than 120 seconds. Localization to the level of countries.
|
USLongPeriodUrlPositionAvgReg
web_itditp: 221
Static URL factor in search sessions in 1600 days. The average position of the URL for all requests. Localization to the level of countries.
|
USLongPeriodUrlShowsReg
web_itditp: 223
Static URL factor in search sessions in 1600 days. Logarithm of the number of shows. Localization to the level of countries.
|
USLongPeriodUrlMobileDt3600AvgReg
web_itditp: 225
Static URL factor for search sessions for 1600 days calculated on mobile sessions. Average dwelltime, and DwellTime is cut from the session if more than 3600 seconds. Localization to the level of countries.
|
USLongPeriodUrlMobileDt180AvgReg
web_itditp: 227
Static URL factor for search sessions for 1600 days calculated on mobile sessions. Average dwelltime, and DwellTime is cut from the session if more than 180 seconds. Localization to the level of countries.
|
UBLongPeriodSearchPercentEndReg
web_itditp: 229
Static URL factor in browser logs for the maximum period. The formula for calculating the factor we look at Wiki. Localization to the level of countries.
|
UBLongPeriodLeavesCntReg
web_itditp: 231
Static URL factor in browser logs for the maximum period. The number of leaves in URLA support. In this case, the leaves are a page from which there were no transitions. Localization to the level of countries.
|
UBLongPeriodDtUrlHChildrenCut600Reg
web_itditp: 233
Static URL factor in browser logs for the maximum period. The average time spent on the page and in all descendants of the page (URLS to which they switched) from the host. Cut if the total DT is more than 10 minutes. Localization to the level of countries.
|
BeastUrlMeanPos
web_itditp: 235
The average position of Urla for all requests
|
BeastUrlNumQueries
web_itditp: 237
Number of requests for URL
|
BrowserUrlDwellTimeRegionFrc
web_itditp: 239
The attitude of Dwell Time on the page in this region to Dwell Time on a page in all regions
|
RegHostRank
web_itditp: 249
It reads in the same way as the Hostrank factor, but not on all the Owner graph, but on its subrack, consisting of Owner 'OV of this region. Belonging to the region is determined by TLD, or by the presence of pages in the index from this Owner 'A, about which Geo or Geoa, the classifier says that they are from this region. Mapped in the same way as the Hostrank factor, from 0 to 1 with 256 gradations
|
RegIsWiki
web_itditp: 251
A document from the language section of Wikipedia corresponding to the user region
|
OwnerClicksPCTR_Reg
web_itditp: 253
The owner's clickness regardless of the request, separately in the regions
|
OwnerSDiffClickEntropy_Reg
web_itditp: 255
Entropy is the distribution of clicks. Regionalized
|
OwnerSDiffShowEntropy_Reg
web_itditp: 257
Entropy is the distribution of shows. Regionalized
|
OwnerSDiffCSRatioEntropy_Reg
web_itditp: 259
Entropy - distribution of clique/shows. Regionalized
|
OwnerSessNormDuration_Reg
web_itditp: 261
ND/K normalized time to click
|
OwnerSatisfied4Rate_Reg
web_itditp: 263
This is the SEA factor = s4_r/ (k_r+10) where S4_R is the number of clicks> 180 sec, k_r - the total number of clicks. It is considered taking into account reformalization. Localized version
|
OwnerCTRWithNextPageClicksP10
web_itditp: 265
|
BrowserHostCntDwellTimeLog
web_itditp: 267
Middle Logarithm of the user on the host with localization in the country; It is considered according to Yabar logs
|
CommercialOwnerRank_Reg
web_itditp: 269
Classifier of the commerciality of the site
|
BeastHostMeanPos
web_itditp: 271
The average position of the host for all requests
|
BeastHostNumQueries
web_itditp: 273
Number of requests for host
|
YabarHostBrowseRank_Reg
web_itditp: 275
Implementation of the algorithm described in the article ((http://wiki.yandex-team.ru//h.yandex.net/?http%3A%2F%2FreseRosoft.microsoft.com%2Fen-US%2FPEOPLIULIUUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUUP032-LIUUUU .pdf http://research.microsoft.com/en-us/people/tyliu/fp032-liu.pdf)) by large regions (tube)
|
BrowserHostDwellTimeRegionFrc
web_itditp: 277
The attitude of Dwell Time on a host in this region to Dwell Time on a host in all regions
|
News
web_itditp: 373
This is the news (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushichiekomponenty/klassificacionnye?v=tkd#h45859-3 Patterns in URL $))))).
|
Shop
web_itditp: 374
This is a proposal store (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayafformula/tekushichiekomponenty/klassificacionnye?v=tkd#h45859-4 Patterns in Url '))))))))). Not used (depreded)
|
Cat
web_itditp: 375
This is a catalog (determined by the characteristic ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayafformula/tekushhiekomponenty/klassificacionnye? .
|
Long
web_itditp: 376
Long document (the longer the document, the greater the value of the factor).
|
PureText
web_itditp: 377
Long text without links.
|
Root
web_itditp: 378
This is a muzzle.
|
RusLang
web_itditp: 379
The language of the document is Russian.
|
AddTime
web_itditp: 380
The time of adding a page, more - a more old document; The root is placed from time displayed at the interval [0.1] so that 3+ years gives 1.
|
IsMainPage
web_itditp: 381
If the main page of the owner (most often a second -level domain, for example xxxx.ru), then the factor is 1. For bums, hosting, personal blogs, etc. (for example, Lifejornal, People.ru, etc.) - domains of the third level (such as xxxxx.narod.ru) will also have an equal factor 1.
|
Hops
web_itditp: 382
The number of hops of Url inpans (such as less - closer to the muzzle, the lower the value (0 - the muzzle, 1 - from the muzzle cannot be reached, 0 <can get from the muzzle <1). Normal value for the root of the nosta 0.0039).
|
Ukrainian
web_itditp: 383
It is equal to one if the site has a Ukrainian geoist (i.e. 1 - Ukrainian site)
|
IsBlog
web_itditp: 384
Page from the blogochosting
|
IsLivejournal
web_itditp: 385
Page with Livejournal.com
|
TextFeatures
web_itditp: 386
The quality of the text. It is considered a rather complex formula
|
TextLike
web_itditp: 387
Text quality (classifier Alekseev)
|
DocLen
web_itditp: 388
Document length in sentences
|
UrlLen
web_itditp: 389
The length of the URL 'A, divided by 5
|
IsHTML
web_itditp: 390
Document type - HTML
|
IsUnreachable
web_itditp: 391
The page is unattainable by the links from the muzzle.
|
YabarUrlVisits
web_itditp: 392
Varla's attendance according to I-Bara
|
YabarUrlVisitors
web_itditp: 393
The number of unique visitors to Urla
|
YabarUrlAvgTime
web_itditp: 394
The average for users time is the user on the page. It is read as the difference between neighboring transitions.
|
UrlQueryVariety
web_itditp: 395
The degree of variety of requests for which this Urla click
|
IsCommByKeywords
web_itditp: 396
Commercial page by keywords. Not used (depreded)
|
Adultness
web_itditp: 397
equals 2 * NastyContent
|
HostAdultness
web_itditp: 398
equals 2 * NastyContent
|
KCHostAdultness
web_itditp: 399
always zero
|
EngLang
web_itditp: 400
Document language - English
|
CyrLang
web_itditp: 401
The language of the document is Cyrillic
|
UrlHasNoDigits
web_itditp: 402
There are no numbers in Urla
|
AuraDocLogShared
web_itditp: 403
Logarithm of the number of shingles on which this document is not unique
|
AuraDocLogAuthor
web_itditp: 404
Logarithm of the number of shingles on which this owner of the document is recognized as the author
|
AuraDocMeanSharedWeight
web_itditp: 405
The average weight of non-ugly shingles of this document
|
Soft404
web_itditp: 406
Page 342 200 224 '404 ' (share of tokens '404 ' in relation to the total number of tokens on the page)
|
AuraDocLogOrigin
web_itditp: 407
Logarithm of the number of shingles in the document added by the owner of the site as original texts in ((http://wiki.yandex-team.ru/jandekspoisk/jekosistema/marketingPr/webmasters/plan/vtorcontect of originality plugin)). It does not participate in the formula, it is needed to disconnect the takes
|
AuraDocMeanFltAuthorSource
web_itditp: 408
The average filtered number of sources of authorship of the document. It does not participate in the formula, it is needed to disconnect the takes
|
LanguagePopularity
web_itditp: 409
The popularity of the language of the document. Number from 0 to 1. (http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/languaguaguagepopalarity)))))))
|
NumNonLettersInUrl
web_itditp: 410
Number 'Nebukv ' in Url
|
UrlLen2
web_itditp: 411
The length of the URL 'and with an accuracy to the symbol. Disconnected in production.
|
IsHub
web_itditp: 412
Habi page
|
USLongPeriodUrlMobileDt180Avg
web_itditp: 413
Static URL factor for search sessions for 1600 days calculated on mobile sessions. Average dwelltime, and DwellTime is cut from the session if more than 180 seconds
|
USLongPeriodUrlMobileLongClickProb
web_itditp: 414
Static URL factor for search sessions for 1600 days calculated on mobile sessions. The probability that the URL click will be more than 120 seconds
|
USLongPeriodUrlMobileLossesProb
web_itditp: 415
Static URL factor for search sessions for 1600 days calculated on mobile sessions. The probability that URL is not clicks if they click at least one URL below.
|
MetrikaUrlVisits
web_itditp: 416
Similar to Yabarurlvisits
|
YabarUrlLcAc
web_itditp: 417
The number of sessions in which Url was the last, classified as the sessions in which Url appeared
|
MetrikaUrlVisitors
web_itditp: 418
Similar to Yabarurlvisitors
|
MetrikaUrlAvgTime
web_itditp: 419
Similar to Yabarurlavgtime
|
MetrikaUrlCoreAudience
web_itditp: 420
The core of the page of the pages on which there is a metric counter
|
MetrikaUrlHostVisitTime
web_itditp: 421
The average time of the user stay on the host with an external (from another non-search site) entry from a specific URL
|
MetrikaUrlHostVisitDepth
web_itditp: 422
Average 'depth ' (the number of transitions within the host) of the user stay on the host with an external (from another non-search site) entry from a particular URL
|
YabarUrlDownloads
web_itditp: 423
Assessment of the probability of leaps from the document
|
IsIndexPage
web_itditp: 424
This is Index. (HTML/PHP/ASPX?/...), without CGI parameters. It is considered to be for all takes.
|
IsIndexPageSoft
web_itditp: 425
This is Index. (HTML/PHP/ASPX?/...), possibly with CGI parameters. It is considered to be for all takes.
|
IsOwner
web_itditp: 426
Whether the host is the owner, conditionally host == Owner (Host).
|
MinPathLen
web_itditp: 427
The minimum length of Pathandquery for all half -shoes.
|
HasDownloadLinkOnFile
web_itditp: 428
The document has a direct link to the file
|
HasDownloadLinkOnFileHosting
web_itditp: 429
The document has a link to filehosting
|
HasUserReviews
web_itditp: 430
The document contains user review/comment
|
DocCreateMonth
web_itditp: 431
The time of creating a document with an accuracy of up to a month 1.0 is the current month, 0 342 200 224- 10 years ago and older. Temporarily disconnected
|
DocUpdateMonth
web_itditp: 432
The time for updating the document with an accuracy of up to a month 1.0 is the current month, 0 342 200 224- 10 years ago and older. Temporarily disconnected
|
DaterStatsYearNormLikelihood
web_itditp: 433
The function of the credibility of the distribution of years in the document. Temporarily disconnected
|
DaterStatsAverageSourceSegment
web_itditp: 434
The arithmetic mean position of dates in the document. Temporarily disconnected
|
SegmentWordPortionFromMainContent
web_itditp: 436
The share of the words of the document from the segments with Score> 2.
|
TotalDups
web_itditp: 437
|
WikiLinkCount
web_itditp: 438
|
NastyContent
web_itditp: 439
Content ugliness factor.
|
YabarUrlRevisits
web_itditp: 440
User return on URL
|
BrowserBookmarksUrl
web_itditp: 441
The more users add to bookmarks a url, the more factor value it has
|
IsNotCgi
web_itditp: 442
The factor about the presence of a symbol '? ' In Url. It is zero if the Url has CGI parameters (more precisely: all duplicate have a symbol '? ' In Url).
|
PageHasMapsApi
web_itditp: 443
Equal to one if the page connects JS-API of any geo-data supplier
|
USLongPeriodUrlCtr
web_itditp: 444
Static URL factor in search sessions in 1600 days. Ordinary CTR.
|
USLongPeriodUrlDt3600Avg
web_itditp: 445
Static URL factor in search sessions in 1600 days. Average dwelltime, and DwellTime is cut from the session if more than 3600 seconds
|
USLongPeriodUrlDt180Avg
web_itditp: 446
Static URL factor in search sessions in 1600 days. Average dwelltime, and DwellTime is cut from the session if more than 180 seconds
|
USLongPeriodUrlLongClickProb
web_itditp: 447
Static URL factor in search sessions in 1600 days. The probability that the URL click will be more than 120 seconds
|
USLongPeriodUrlShows
web_itditp: 448
Static URL factor in search sessions in 1600 days. Logarithm of the number of shows.
|
USLongPeriodUrlWinsProb
web_itditp: 449
Static URL factor in search sessions in 1600 days. The probability that URL is clicking if they do not click on at least one URL higher.
|
USLongPeriodUrlLossesProb
web_itditp: 450
Static URL factor in search sessions in 1600 days. The probability that URL is not clicks if they click at least one URL below.
|
UBLongPeriodVisitsSNProb
web_itditp: 451
Static URL factor in browser logs for the maximum period. The percentage of traffic from social networks in all traffic from other hosts and search.
|
UBLongPeriodDirectHChildren90CntFromExtHost
web_itditp: 452
Static URL factor in browser logs for the maximum period. The average number of direct descendants from the host on which they spent more than 90 seconds. The descendant is straight, only if there is a link from our page to the descendant and crossed it.
|
UUBLongPeriodDepthFromExtHost
web_itditp: 453
Static URL factor in browser logs for the maximum period. The average maximum depth of wood with the root in the current URL is when the URL is visited from other hosts.
|
UBLongPeriodBrowseFrc
web_itditp: 454
Static URL factor in browser logs for the maximum period. The number of times when the feather was transferred to the page to the total number of pages to which they switched from a sickle. The closer to 1, the more often the page was opened the only one in the session.
|
UBLongPeriodAvgSearchDuration600
web_itditp: 455
Static URL factor in browser logs for the maximum period. The average length of search sessions, when they switched to the page from a sickle
|
UBLongPeriodSearchPercentEnd
web_itditp: 456
Static URL factor in browser logs for the maximum period. The formula for calculating the factor we look at Wiki.
|
UBLongPeriodSearchPercentMiddle30
web_itditp: 457
Static URL factor in browser logs for the maximum period. The formula for calculating the factor we look at Wiki.
|
UBLongPeriodVisit120Prob
web_itditp: 458
Static URL factor in browser logs for the maximum period. The probability that the user will spend on the page> 120 seconds.
|
UBLongPeriodLeavesCnt
web_itditp: 459
Static URL factor in browser logs for the maximum period. The number of leaves in URLA support. In this case, the leaves are a page from which there were no transitions.
|
UBLongPeriodDtUrlHChildrenCut600
web_itditp: 460
Static URL factor in browser logs for the maximum period. The average time spent on the page and in all descendants of the page (URLS to which they switched) from the host. Cut off if the total DT is more than 10 minutes
|
UBLongPeriodMinTimeWhenPageShow
web_itditp: 461
Static URL factor in browser logs for the maximum period. The minimum Unix Time when the page appeared in the logs for the first time.
|
UBLongPeriodDeltaAvgMinTimeWhenPageShow
web_itditp: 462
Static URL factor in browser logs for the maximum period. The difference between the middle and minimum Unix Time when the page appeared in the logs.
|
UBLongPeriodLatitude
web_itditp: 463
Static URL factor in browser logs for the maximum period. Current breadth where the page was viewed from.
|
UBLongPeriodLongitude
web_itditp: 464
Static URL factor in browser logs for the maximum period. Current longitude where the page was viewed from.
|
UBLongPeriodDownloadsProb
web_itditp: 465
Static URL factor in browser logs for the maximum period. The likelihood of leaps from the page
|
UBLongPeriodDownloadsImageProb
web_itditp: 466
Static URL factor in browser logs for the maximum period. The likelihood of image jumps from the page
|
UBLongPeriodDownloadsTorrentProb
web_itditp: 467
Static URL factor in browser logs for the maximum period. The probability of leap torrent file from the page
|
YaBar
web_itditp: 538
Attendance from the bar - ((http://wiki.yandex-team.ru/andrejjkostjagin/yabarlog/hoststat data description)). The factor will be remarked.
|
AddTimeMP
web_itditp: 542
The time for adding the main page of the owner (host?) Will be remaped like Addtime.
|
OwnerClicksPCTR
web_itditp: 544
The owner's clickness regardless of the request
|
Spam2
web_itditp: 546
Automatic classifier spam named after Alekseeva, the likelihood that the website spam (0 is not spam, 1- spam)
|
YaBarCoreOwner
web_itditp: 549
The core of the audience of owners according to Yandex.Mrazusing
|
YaBarCoreHost
web_itditp: 550
The core of the audience of the hosts according to Yandex.Mrazusing
|
HasYaBarCore
web_itditp: 551
Does the host have a host
|
HostSize
web_itditp: 552
The size of the Host named after Raskovalov in the documents without taking into account the takes (each double is taken into account in the factor by an independent document)
|
Nevasca1
web_itditp: 554
The content of content is not used. 'Good ' host (from 0 to 1), calculated on the basis of how many and what kind of hosts the content is borrowed from this.
|
Nevasca2
web_itditp: 555
The content of content is not used. 'Poorness ' host (from 0 to 1) 342 200 223 is proportional to the number of secondary content on the host. 'Poorness ' host (from 0 to 1) 342 200 223 is proportional to the number of secondary content on the host.
|
YabarHostInternalTraffic
web_itditp: 556
The share of suits to the site is not by links (set with hands or from bookmarks)
|
YabarHostAvgTime
web_itditp: 557
average for users Active continuous time for user finding (in sec) on host pages
|
YabarHostAvgTime2
web_itditp: 558
The average for users Active continuous time of the user (in second) on the pages of the host. In the inside of the Yandex. Bara/elements/browser counter
|
YabarHostAvgActions
web_itditp: 559
The average for users is the number of active actions (clicks, clicks) with the continuous finding of the user (in second) on the pages of the host.
|
YabarHostBrowseRank
web_itditp: 560
Implementation of the algorithm described in the article ((http://wiki.yandex-team.ru//h.yandex.net/?http%3A%2F%2FreseRosoft.microsoft.com%2Fen-US%2FPEOPLIULIUUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUULYUUP032-LIUUUU .pdf http://research.microsoft.com/en-us/people/tyliu/fp032-liu.pdf))
|
IsUa
web_itditp: 561
Domain in the .ua zone
|
IsNotRu
web_itditp: 562
Domain is not in the .ru zone
|
SeoInPayLinks
web_itditp: 569
The number of COO-Thrilling links between hosts
|
RankComGoodness
web_itditp: 570
Classifier for estimates of commercial sites
|
RankComGoodnessBar
web_itditp: 571
Classifier that approximate the quality of commercial sites based on user behavior data
|
RankBoostGoodness
web_itditp: 572
The rank of site quality used for boosts of the Moscow commercial formula
|
QueriesAvgCM2
web_itditp: 573
Average query commerciality
|
More90SecVisitsShare
web_itditp: 575
The share of visits for which the time spent during the day on the host is more than 90 seconds
|
More160SecVisitsShare
web_itditp: 576
The share of visits for which the time spent during the day on the host is more than 160 seconds
|
RankHackedNovaPhp
web_itditp: 577
Rank of hacked sites
|
RankAgs4
web_itditp: 578
Rank AGS4
|
MaxQsDocClassQsRankPthQuerySpam
web_itditp: 579
Maximum QSRANK on the owner
|
AvgQsRankOnNotSubdomainDocs
web_itditp: 580
Average QSRANK on the main domain
|
VisitorsReturnMonthShare
web_itditp: 581
The share of users who returned within a month
|
VisitorsReturnMonthNumber
web_itditp: 582
The number of users returning within a month
|
RankXitDoor
web_itditp: 583
Rank Dorweev
|
AvgTitleCapitalLettersRatio
web_itditp: 584
Share of the capital letters in Title
|
FromSearchShareNormalized
web_itditp: 585
The share of incoming traffic from search engines among all incoming traffic
|
GreenTrafficShareNormalized
web_itditp: 586
The share of direct visits among all incoming traffic
|
AvgQsFWnd500TOKEN
web_itditp: 587
Middle QSRank in a sliding window
|
MinOwnerQsRank
web_itditp: 588
Minimum QSRANK
|
AvgNumhops
web_itditp: 589
Average HOPS
|
RankArtroz
web_itditp: 590
Rank of the quality of texts on the host. The higher 342 200 224 the greater the likelihood that the host is full of articles - a rewriting, a bad copy of the content ordered on the exchanges of content. Burning stronger as the before the aggregation.
|
RandomLogHostHasPaymentsAvg
web_itditp: 591
AVG aggregation of HasPayments web factor using random log
|
RandomLogHostIsVideoQueryAvg
web_itditp: 592
AVG aggregation of VideoQuery web factor using random log
|
RandomLogHostSyntQualityAvg
web_itditp: 593
AVG aggregation of SyntQuality web factor using random log
|
RandomLogHostGeoRegionalityVNewPerc90
web_itditp: 594
PERCENTALE_90 aggregation of GeoRegionalityVNew web factor using random log
|
RandomLogHostQClassDownloadAvg
web_itditp: 595
AVG aggregation of QClassDownload web factor using random log
|
RandomLogHostIsMusicAvg
web_itditp: 596
AVG aggregation of IsMusic web factor using random log
|
RandomLogHostQueryThEncyclopedicPerc25
web_itditp: 597
PERCENTALE_25 aggregation of QueryThEncyclopedic web factor using random log
|
RandomLogHostCommercialOwnerRankRegAvg
web_itditp: 598
AVG aggregation of CommercialOwnerRank_Reg web factor using random log
|
RandomLogHostYabarWordDNGIPerc25
web_itditp: 599
PERCENTALE_25 aggregation of YabarWordDepthNodesGradientMin web factor using random log
|
RandomLogHostPopularSEFRCBrowserAvg
web_itditp: 600
AVG aggregation of PopularSEFRCBrowser web factor using random log
|
RandomLogHostURLClicksMaxGeoRegionFRCRatioAvg
web_itditp: 601
AVG aggregation of URLClicksMaxGeoRegionFRCRatio web factor using random log
|
RandomLogHostUBLongPeriodDirectHChildren90CntPerc90
web_itditp: 602
PERCENTALE_90 aggregation of UBLongPeriodDirectHChildren90CntFromExtHost web factor using random log
|
RandomLogHostUBLongPeriodDtUrlHChildrenPerc90
web_itditp: 603
PERCENTALE_90 aggregation of UBLongPeriodDtUrlHChildrenCut600Reg web factor using random log
|
RandomLogHostIsPictureAvg
web_itditp: 604
AVG aggregation of IsPicture web factor using random log
|
RandomLogHostErratumLogQueryProbabilityAvg
web_itditp: 605
AVG aggregation of ErratumLogQueryProbability web factor using random log
|
KubrLang
web_itditp: 610
|
IsNational
web_itditp: 615
|
IsRu
web_itditp: 616
|
IsKubr
web_itditp: 617
|
RandomLogHostVisitsFromWikiAvg
web_itditp: 665
AVG aggregation of VisitsFromWiki web factor using random log
|
RandomLogHostNavLinearPerc25
web_itditp: 669
PERCENTALE_25 aggregation of NavLinear web factor using random log
|
RandomLogHostFoundPerc90
web_itditp: 671
PERCENTALE_90 aggregation of Found web factor using random log
|
RandomLogHostSubqueryThMatchAvg
web_itditp: 673
AVG aggregation of SubqueryThMatch web factor using random log
|
RandomLogHostSegmentWordPortionFromMainContentAvg
web_itditp: 677
AVG aggregation of SegmentWordPortionFromMainContent web factor using random log
|
RandomLogHostXfDtShowAllMaxFFieldSet2Bm15FLogK0001Avg
web_itditp: 679
AVG aggregation of XfDtShowAllMaxFFieldSet2Bm15FLogK0001 web factor using random log
|
RandomLogHostQueryRegionSizeAvg
web_itditp: 681
AVG aggregation of QueryRegionSize web factor using random log
|
RandomLogHostIsRelevLocaleUAAvg
web_itditp: 685
AVG aggregation of IsRelevLocaleUA web factor using random log
|
RandomLogHostQfufAllSumWFSumWFieldSet3BclmWeightedFLogW0K0001Perc90
web_itditp: 687
PERCENTALE_90 aggregation of QfufAllSumWFSumWFieldSet3BclmWeightedFLogW0K0001 web factor using random log
|
RandomLogHostDssmBoostingCtrQuerySelfSimilarityPerc90
web_itditp: 689
PERCENTALE_90 aggregation of DssmBoostingCtrQuerySelfSimilarity web factor using random log
|
RandomLogHostQueryToDocAllSumFCountTextBocm11Norm256Avg
web_itditp: 691
AVG aggregation of QueryToDocAllSumFCountTextBocm11Norm256 web factor using random log. NOTE: QueryToDocAllSumFCountTextBocm11Norm256 has been removed.
|
RandomLogHostIsNavMxQueryPerc90
web_itditp: 693
PERCENTALE_90 aggregation of IsNavMxQuery web factor using random log
|
RandomLogHostDBM15Wares2Avg
web_itditp: 697
AVG aggregation of DBM15Wares2 web factor using random log
|
RandomLogHostUrlNGramsModelPerc90
web_itditp: 699
PERCENTALE_90 aggregation of UrlNGramsModel web factor using random log
|
RandomLogHostDssmBoostingCtrKMeans1ScoreScaledSumWeightedQEPerc25
web_itditp: 705
PERCENTALE_25 aggregation of DssmBoostingCtrKMeans1ScoreScaledSumWeightedQE web factor using random log
|
RandomLogHostLongClickMobileAllWcmWeightedValuePerc90
web_itditp: 707
PERCENTALE_90 aggregation of LongClickMobileAllWcmWeightedValue web factor using random log
|
RandomLogHostDssmVkPopularityPerc25
web_itditp: 709
PERCENTALE_25 aggregation of DssmVkPopularity web factor using random log
|
RandomLogHostUBLongPeriodVisitsSNProbAvg
web_itditp: 711
AVG aggregation of UBLongPeriodVisitsSNProb web factor using random log
|
RandomLogHostCountryQueryRegionalityPerc90
web_itditp: 713
PERCENTALE_90 aggregation of CountryQueryRegionality web factor using random log
|
RandomLogHostTRhitwPerc90
web_itditp: 715
PERCENTALE_90 aggregation of TRhitw web factor using random log
|
RandomLogHostUBLongPeriodAvgSearchDuration600Perc90
web_itditp: 717
PERCENTALE_90 aggregation of UBLongPeriodAvgSearchDuration600 web factor using random log
|
RandomLogHostRequestIsFromIOSAvg
web_itditp: 719
AVG aggregation of RequestIsFromIOS web factor using random log
|
RandomLogHostDssmQueryEmbeddingCtrNoMinerPca4Perc90
web_itditp: 721
PERCENTALE_90 aggregation of DssmQueryEmbeddingCtrNoMinerPca4 web factor using random log
|
RandomLogHostXfDtShowAllMaxFFieldSetUTBm15FLogW0Avg
web_itditp: 723
AVG aggregation of XfDtShowAllMaxFFieldSetUTBm15FLogW0 web factor using random log
|
RandomLogHostUrlTrigramsPerc25
web_itditp: 725
PERCENTALE_25 aggregation of UrlTrigrams web factor using random log
|
RandomLogHostDssmQueryEmbeddingCtrNoMinerPca1Perc90
web_itditp: 727
PERCENTALE_90 aggregation of DssmQueryEmbeddingCtrNoMinerPca1 web factor using random log
|
RandomLogHostIsRelevLocaleKZAvg
web_itditp: 729
AVG aggregation of IsRelevLocaleKZ web factor using random log
|
RandomLogHostTextFeaturesPerc90
web_itditp: 731
PERCENTALE_90 aggregation of TextFeatures web factor using random log
|
NewsAgencyRating
web_itditp: 733
Rating of news agency from agencies.json (Yandex.News resource)
|
HasJsFromMarketgidCom
web_itditp: 735
1 if host include js from marketgid.com
|
HasJsFromRfityCom
web_itditp: 737
1 if host include js from rfity.com
|
HasJsFromFacebookNet
web_itditp: 743
1 if host include js from facebook.net
|