Tag: TG_DATA_FROM_SAASKV
(130 ranking factors)
Factors |
---|
IsUrlForClickDeboost
web_production: 577
It is known about URL that it is shown too often with very low relevance (according to Bert and/or BM25)
|
IsFeedListing
web_production: 593
OffersBase feature for ecoboost.
|
IsFeedMain
web_production: 594
OffersBase feature for ecoboost.
|
IsFeedStratocaster
web_production: 595
OffersBase feature for ecoboost.
|
IsFeedAny
web_production: 596
OffersBase feature for ecoboost.
|
HostHasFeedUrls
web_production: 630
OffersBase feature for ecoboost.
|
IsFeedOffer
web_production: 631
OffersBase feature for ecoboost.
|
HostEcomKernel1
web_production: 632
Business kernel.
|
HostEcomKernel2
web_production: 633
Business kernel.
|
HostEcomKernel3
web_production: 634
Business kernel.
|
HostUserLeakage
web_production: 669
User outflow coefficient from the search after a visit to the site
|
HasTurboEcom
web_production: 884
|
DomainHasMetrika
web_production: 1463
Does owner have metrika or not
|
HasSideblock
web_production: 1464
The document has a turbo page for Mobile platform.
|
YellownessMax
web_production: 1473
Maximum value of domain yellowness (based on Toloka)
|
YellownessMean
web_production: 1474
Mean value of domain yellowness (based on Toloka)
|
YellownessMedian
web_production: 1475
Median of domain yellowness (based on Toloka)
|
YellownessMin
web_production: 1476
Minimum value of domain yellowness (based on Toloka)
|
OwnerWebsiteAttention
web_production: 1675
Site owner pays attention to site details (at least once in quarter)
|
ChatScore
web_production: 1677
Chat info. positive / events or zero
|
HostPlayerViewDepth
web_production: 1678
Host player info. Relation between view time and video duration
|
HasTurbo
web_production: 1711
The document has a turbo page. Depends on the platform
|
HasTurboApp
web_production: 1793
The document has a turbo page for Desktop platforms. Updates on top of the base are delivered through SaAS.
|
IsTasIx
web_production: 1840
The site is located on the TAS-IX network (relevant for Uzbekistan)
|
HasTurboMobile
web_production: 1904
The document has a turbo page for Mobile platform. Updates on top of the base are delivered through SaAS.
|
HasTurboDesktop
web_production: 1905
The document has a turbo page for Desktop platforms. Updates on top of the base are delivered through SaAS.
|
RandomCommercial
web_production: 1908
'Random' factor for commercial sites.
|
QueryTokenMaxFreqMusic
alice_query_tokens_factors: 0
Max tokens frequency in music scenarios
|
QueryTokenMaxFreqVideo
alice_query_tokens_factors: 1
Max tokens frequency in video scenarios
|
QueryTokenAvgFreqMusic
alice_query_tokens_factors: 2
Average tokens frequency in music scenarios
|
QueryTokenAvgFreqVideo
alice_query_tokens_factors: 3
Average tokens frequency in video scenarios
|
QueryTokenMaxFreqMusicWHF
alice_query_tokens_factors: 4
Max tokens frequency in music scenarios (without high frequency tokens)
|
QueryTokenMaxFreqVideoWHF
alice_query_tokens_factors: 5
Max tokens frequency in video scenarios (without high frequency tokens)
|
QueryTokenAvgFreqMusicWHF
alice_query_tokens_factors: 6
Average tokens frequency in music scenarios (without high frequency tokens)
|
QueryTokenAvgFreqVideoWHF
alice_query_tokens_factors: 7
Average tokens frequency in video scenarios (without high frequency tokens)
|
QueryTokenMaxFreqVideoSelectFromGallery
alice_query_tokens_factors: 8
Max tokens frequency in video select from gallery scenario
|
QueryTokenAvgFreqVideoSelectFromGallery
alice_query_tokens_factors: 9
Avg tokens frequency in video select from gallery scenario
|
QueryTokenMaxFreqGC
alice_query_tokens_factors: 10
Max tokens frequency in general conversation scenarios
|
QueryTokenAvgFreqGC
alice_query_tokens_factors: 11
Avg tokens frequency in general conversation scenarios
|
QueryTokenMaxFreqSearch
alice_query_tokens_factors: 12
Max tokens frequency in search scenarios
|
QueryTokenAvgFreqSearch
alice_query_tokens_factors: 13
Avg tokens frequency in search scenarios
|
QueryTokenMaxFreqOther
alice_query_tokens_factors: 14
Max tokens frequency in other scenarios
|
QueryTokenAvgFreqOther
alice_query_tokens_factors: 15
Avg tokens frequency in other scenarios
|
QueryTokenBigramsMaxFreqMusic
alice_query_tokens_factors: 16
Max tokens bigrams frequency in music scenarios
|
QueryTokenBigramsMaxFreqVideo
alice_query_tokens_factors: 17
Max tokens bigrams frequency in video scenarios
|
QueryTokenBigramsAvgFreqMusic
alice_query_tokens_factors: 18
Average tokens bigrams frequency in music scenarios
|
QueryTokenBigramsAvgFreqVideo
alice_query_tokens_factors: 19
Average tokens bigrams frequency in video scenarios
|
QueryTokenBigramsMaxFreqMusicWHF
alice_query_tokens_factors: 20
Max tokens bigrams frequency in music scenarios (without high frequency tokens)
|
QueryTokenBigramsMaxFreqVideoWHF
alice_query_tokens_factors: 21
Max tokens bigrams frequency in video scenarios (without high frequency tokens)
|
QueryTokenBigramsAvgFreqMusicWHF
alice_query_tokens_factors: 22
Average tokens bigrams frequency in music scenarios (without high frequency tokens)
|
QueryTokenBigramsAvgFreqVideoWHF
alice_query_tokens_factors: 23
Average tokens bigrams frequency in video scenarios (without high frequency tokens)
|
QueryTokenBigramsMaxFreqVideoSelectFromGallery
alice_query_tokens_factors: 24
Max tokens bigrams frequency in video select from gallery scenario
|
QueryTokenBigramsAvgFreqVideoSelectFromGallery
alice_query_tokens_factors: 25
Avg tokens bigrams frequency in video select from gallery scenario
|
QueryTokenBigramsMaxFreqGC
alice_query_tokens_factors: 26
Max tokens bigrams frequency in general conversation scenarios
|
QueryTokenBigramsAvgFreqGC
alice_query_tokens_factors: 27
Avg tokens bigrams frequency in general conversation scenarios
|
QueryTokenBigramsMaxFreqSearch
alice_query_tokens_factors: 28
Max tokens bigrams frequency in search scenarios
|
QueryTokenBigramsAvgFreqSearch
alice_query_tokens_factors: 29
Avg tokens bigrams frequency in search scenarios
|
QueryTokenBigramsMaxFreqOther
alice_query_tokens_factors: 30
Max tokens bigrams frequency in other scenarios
|
QueryTokenBigramsAvgFreqOther
alice_query_tokens_factors: 31
Avg tokens bigrams frequency in other scenarios
|
ClusteredEmbLogDwelltimeBigramsDocDwt120LessTopavg1
web_meta_pers: 16
Top1 weighted average cosine similarity between document and clustered embedding of documents from user_history (model=LogDwellTimeBigrams, dwelltime more than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocDwt120MoreTopavg1
web_meta_pers: 17
Top1 weighted average cosine similarity between document and clustered embedding of documents from user_history (model=LogDwellTimeBigrams, dwelltime less than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocDwt120LessTopavg2
web_meta_pers: 18
Top2 weighted average cosine similarity between document and clustered embedding of documents from user_history (model=LogDwellTimeBigrams, dwelltime more than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocDwt120MoreTopavg2
web_meta_pers: 19
Top2 weighted average cosine similarity between document and clustered embedding of documents from user_history (model=LogDwellTimeBigrams, dwelltime less than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocDwt120LessCounter30
web_meta_pers: 20
cnt / (1 + cnt), cnt = sum of weights where cosine between document and clustered embedding of documents from user_history > 0.30 (model=LogDwellTimeBigrams, dwelltime more than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocDwt120MoreCounter30
web_meta_pers: 21
cnt / (1 + cnt), cnt = sum of weights where cosine between document and clustered embedding of documents from user_history > 0.30 (model=LogDwellTimeBigrams, dwelltime less than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocDwt120LessCounter60
web_meta_pers: 22
cnt / (1 + cnt), cnt = sum of weights where cosine between document and clustered embedding of documents from user_history > 0.60 (model=LogDwellTimeBigrams, dwelltime more than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocDwt120MoreCounter60
web_meta_pers: 23
cnt / (1 + cnt), cnt = sum of weights where cosine between document and clustered embedding of documents from user_history > 0.60 (model=LogDwellTimeBigrams, dwelltime less than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocDwt120LessCounter90
web_meta_pers: 24
cnt / (1 + cnt), cnt = sum of weights where cosine between document and clustered embedding of documents from user_history > 0.90 (model=LogDwellTimeBigrams, dwelltime more than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocDwt120MoreCounter90
web_meta_pers: 25
cnt / (1 + cnt), cnt = sum of weights where cosine between document and clustered embedding of documents from user_history > 0.90 (model=LogDwellTimeBigrams, dwelltime less than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocXQueryDwt120LessTopavg1
web_meta_pers: 26
Top1 weighted average cosine similarity between query embedding and clustered embedding of documents from user_history (model=LogDwellTimeBigrams, dwelltime more than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocXQueryDwt120MoreTopavg1
web_meta_pers: 27
Top1 weighted average cosine similarity between query embedding and clustered embedding of documents from user_history (model=LogDwellTimeBigrams, dwelltime less than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocXQueryDwt120LessTopavg2
web_meta_pers: 28
Top2 weighted average cosine similarity between query embedding and clustered embedding of documents from user_history (model=LogDwellTimeBigrams, dwelltime more than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocXQueryDwt120MoreTopavg2
web_meta_pers: 29
Top2 weighted average cosine similarity between query embedding and clustered embedding of documents from user_history (model=LogDwellTimeBigrams, dwelltime less than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocXQueryDwt120LessCounter30
web_meta_pers: 30
cnt / (1 + cnt), cnt = sum of weights where cosine between query embedding and clustered embedding of documents from user_history > 0.30 (model=LogDwellTimeBigrams, dwelltime more than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocXQueryDwt120MoreCounter30
web_meta_pers: 31
cnt / (1 + cnt), cnt = sum of weights where cosine between query embedding and clustered embedding of documents from user_history > 0.30 (model=LogDwellTimeBigrams, dwelltime less than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocXQueryDwt120LessCounter60
web_meta_pers: 32
cnt / (1 + cnt), cnt = sum of weights where cosine between query embedding and clustered embedding of documents from user_history > 0.60 (model=LogDwellTimeBigrams, dwelltime more than 120sec)
|
ClusteredEmbLogDwelltimeBigramsDocXQueryDwt120MoreCounter60
web_meta_pers: 33
cnt / (1 + cnt), cnt = sum of weights where cosine between query embedding and clustered embedding of documents from user_history > 0.60 (model=LogDwellTimeBigrams, dwelltime less than 120sec)
|
ClustEmbLogDtBigramsDocXQueryDwt120LessWeights5LessMinscore
web_meta_pers: 88
Take clustered embeddings (short clicks, centroids with weights less than 5), Calc dot products with query embedding, f = Min(DotProducts)
|
ClustEmbLogDtBigramsDocDwt120LessWeights0AllScoreThreshold95pSatweightsum
web_meta_pers: 89
Take clustered embeddings (short clicks), Calc dot products with doc embedding, Take dot product more than 0.95, f = 1 / (1 + sum of weights)
|
ClustEmbLogDtBigramsDocXQueryDwt120MoreWeights5LessMintop10pScore
web_meta_pers: 90
Take clustered embeddings (long clicks, centroids with weights less than 5), Calc dot products with query embedding, sort descending, f = DotProducts[0.10 * DotProduct.size()]
|
ClustEmbLogDtBigramsDocXQueryDwt120MoreWeights5LessMintop90pScore
web_meta_pers: 91
Take clustered embeddings (long clicks, centroids with weights less than 5), Calc dot products with query embedding, sort descending, f = DotProducts[0.90 * DotProduct.size()]
|
ClustEmbLogDtBigramsDocXQueryDwt120MoreWeights5LessMinscore
web_meta_pers: 92
Take clustered embeddings (long clicks, centroids with weights less than 5), Calc dot products with query embedding, f = Min(DotProducts)
|
ClustEmbLogDtBigramsDocXQueryDwt120MoreWeights5LessScoreThreshold30pSatweightsum
web_meta_pers: 93
Take clustered embeddings (long clicks, centroids with weights less than 5), Calc dot products with query embedding, Take dot product more than 0.30, f = 1 / (1 + sum of weights)
|
ClustEmbLogDtBigramsDocXQueryDwt120MoreWeights5LessScoreThreshold40pCount
web_meta_pers: 94
Take clustered embeddings (long clicks, centroids with weights less than 5), Calc dot products with query embedding, Take dot product more than 0.40, f = count / (count + 10)
|
ClustEmbLogDtBigramsDocXQueryDwt120MoreWeights5LessScoreThreshold40pSatweightsum
web_meta_pers: 95
Take clustered embeddings (long clicks, centroids with weights less than 5), Calc dot products with query embedding, Take dot product more than 0.40, f = 1 / (1 + sum of weights)
|
ClustEmbLogDtBigramsDocXQueryDwt120MoreWeights10LessAvgtop5Scorexweight
web_meta_pers: 96
Take clustered embeddings (long clicks, centroids with weights less than 10), Calc dot products with query embedding, sort descending, take top5, f = AVG(DotProduct[i] * Weight[i])
|
ClustEmbLogDtBigramsDocDwt120MoreWeights0AllMaxscore
web_meta_pers: 97
Take clustered embeddings (long clicks), Calc dot products with doc embedding, f = Max(DotProduct)
|
ClustEmbLogDtBigramsDocDwt120MoreWeights0AllMaxScorexweight
web_meta_pers: 98
Take clustered embeddings (long clicks), Calc dot products with doc embedding, f = AVG(DotProduct[i] * Weight[i])
|
ClustEmbLogDtBigramsDocDwt120MoreWeights0AllScoreThreshold95pSatweightsum
web_meta_pers: 99
Take clustered embeddings (long clicks), Calc dot products with doc embedding, Take dot product more than 0.95, f = 1 / (1 + sum of weights)
|
ClustEmbLogDtBigramsDocDwt120MoreWeights5LessMaxscore
web_meta_pers: 100
Take clustered embeddings (long clicks, centroids with weights less than 5), Calc dot products with doc embedding, f = Max(DotProduct)
|
ClustEmbLogDtBigramsDocDwt120MoreWeights5LessMintop5pScore
web_meta_pers: 101
Take clustered embeddings (long clicks, centroids with weights less than 5), Calc dot products with doc embedding, sort descending, f = DotProducts[0.05 * DotProduct.size()]
|
ClustEmbLogDtBigramsDocDwt120MoreWeights5LessMintop30pScore
web_meta_pers: 102
Take clustered embeddings (long clicks, centroids with weights less than 5), Calc dot products with doc embedding, sort descending, f = DotProducts[0.30 * DotProduct.size()]
|
ClustEmbLogDtBigramsDocDwt120MoreWeights5LessMinscore
web_meta_pers: 103
Take clustered embeddings (long clicks, centroids with weights less than 5), Calc dot products with doc embedding, f = Min(DotProducts)
|
ClustEmbLogDtBigramsDocDwt120MoreWeights5LessAvgtop5pScorexweight
web_meta_pers: 104
Take clustered embeddings (long clicks, centroids with weights less than 5), Calc dot products with doc embedding, sort descending, take top5, f = AVG(DotProduct[i] * Weight[i])
|
ClustEmbLogDtBigramsDocDwt120MoreWeights5LessScoreThreshold30pSatweightsum
web_meta_pers: 105
Take clustered embeddings (long clicks, centroids with weights less than 5), Calc dot products with doc embedding, Take dot product more than 0.30, f = 1 / (1 + sum of weights)
|
ClustEmbLogDtBigramsDocDwt120MoreWeights5MoreMintop10pScore
web_meta_pers: 106
Take clustered embeddings (long clicks, centroids with weights more than 5), Calc dot products with doc embedding, sort descending, f = DotProducts[0.10 * DotProduct.size()]
|
ClustEmbLogDtBigramsDocDwt120MoreWeights10LessMaxscore
web_meta_pers: 107
Take clustered embeddings (long clicks, centroids with weights less than 10), Calc dot products with doc embedding, f = Max(DotProduct)
|
ClustEmbLogDtBigramsDocDwt120MoreWeights10LessMintop10pScore
web_meta_pers: 108
Take clustered embeddings (long clicks, centroids with weights less than 10), Calc dot products with doc embedding, sort descending, f = DotProducts[0.10 * DotProduct.size()]
|
ClustEmbLogDtBigramsDocDwt120MoreWeights20LessMaxScorexweight
web_meta_pers: 109
Take clustered embeddings (long clicks, centroids with weights less than 20), Calc dot products with doc embedding, f = AVG(DotProduct[i] * Weight[i])
|
ClustEmbLogDtBigramsDocDwt120MoreWeights20LessScoreThreshold70pSatweightsum
web_meta_pers: 110
Take clustered embeddings (long clicks, centroids with weights less than 20), Calc dot products with doc embedding, Take dot product more than 0.70, f = 1 / (1 + sum of weights)
|
ClustEmbLogDtBigramsDocDwt120MoreWeights30LessMaxscore
web_meta_pers: 111
Take clustered embeddings (long clicks, centroids with weights less than 30), Calc dot products with doc embedding, f = Max(DotProduct)
|
ClustEmbLogDtBigramsDocDwt120MoreWeights30LessAvgtop20pScorexweight
web_meta_pers: 112
Take clustered embeddings (long clicks, centroids with weights less than 30), Calc dot products with doc embedding, sort descending, take top20, f = AVG(DotProduct[i] * Weight[i])
|
ClustEmbLogDtBigramsQueryAllWeightsAvgtop4ScoreXWeight
web_meta_pers: 121
Take clustered embeddings (query embeddings, all weights), Calc dot product with query embedding, Take top4, f = AVG(Score[i] * Weight[i])
|
ClustEmbLogDtBigramsQueryAllWeightsScoreThreshold75pSatweightsum
web_meta_pers: 122
Take clustered embeddings (query embeddings, all weights), Calc dot product with query embedding, Take dot product with score > 0.75, f = 1 / (1 + sum of weights)
|
ClustEmbLogDtBigramsQueryWeights3LessMintop95pScore
web_meta_pers: 123
Take clustered embeddings (query embeddings, weights less than 3), Calc dot product with query embedding, Sort descending, f = DotProduct[0.95 * length(DotProducts)]
|
ClustEmbLogDtBigramsQueryWeights5LessScoreThreshold55pSatweightsum
web_meta_pers: 124
Take clustered embeddings (query embeddings, weights less than 5), Calc dot product with query embedding, Take dot product with score > 0.55, f = 1 / (1 + sum of weights)
|
ClustEmbLogDtBigramsQueryWeights5LessMinscore
web_meta_pers: 125
Take clustered embeddings (query embeddings, weights less than 5), Calc dot product with query embedding, f = MIN(DotProducts)
|
ClustEmbLogDtBigramsQueryWeights10LessMaxscore
web_meta_pers: 126
Take clustered embeddings (query embeddings, weights less than 10), Calc dot product with query embedding, f = MAX(DotProducts)
|
ClustEmbLogDtBigramsQueryWeights15LessMaxScoreXWeight
web_meta_pers: 127
Take clustered embeddings (query embeddings, weights less than 15), Calc dot product with query embedding, f = MAX(Score[i] * Weights[i])
|
ClustEmbLogDtBigramsQueryWeights20LessScoreThreshold70pSatweightsum
web_meta_pers: 128
Take clustered embeddings (query embeddings, weights less than 20), Calc dot product with query embedding, Take dot product with score > 0.70, f = 1 / (1 + sum of weights)
|
ClustEmbLogDtBigramsQueryAllWeightsMaxScoreXWeight
web_meta_pers: 129
Take clustered embeddings (query embeddings, all weights), Calc dot product with query embedding, f = MAX(Score[i] * Weights[i])
|
ClustEmbLogDtBigramsQueryXDocWeights3LessMintop5pScore
web_meta_pers: 130
Take clustered embeddings (query embeddings, weights less than 3), Calc dot product with doc embedding, Sort descending, f = DotProduct[0.05 * length(DotProducts)]
|
ClustEmbLogDtBigramsQueryXDocWeights3LessMinscore
web_meta_pers: 131
Take clustered embeddings (query embeddings, weights less than 3), Calc dot product with doc embedding, f = MIN(DotProducts)
|
ClustEmbLogDtBigramsQueryXDocWeights5LessMinscore
web_meta_pers: 132
Take clustered embeddings (query embeddings, weights less than 5), Calc dot product with doc embedding, f = MIN(DotProducts)
|
ClustEmbLogDtBigramsQueryXDocWeights15LessScoreThreshold45pSatweightsum
web_meta_pers: 133
Take clustered embeddings (query embeddings, weights less than 15), Calc dot product with doc embedding, Take dot product with score > 0.45, f = 1 / (1 + sum of weights)
|
ClustEmbLogDtBigramsQueryXDocAllWeightsMaxScoreXWeight
web_meta_pers: 134
Take clustered embeddings (query embeddings, all weights), Calc dot product with doc embedding, f = MAX(Score[i] * Weights[i])
|
ClustEmbLogDtBigramsQueryXDocAllWeightsScoreThreshold55pSatweightsum
web_meta_pers: 135
Take clustered embeddings (query embeddings, all weights), Calc dot product with doc embedding, Take dot product with score > 0.55, f = 1 / (1 + sum of weights)
|
SDGenerallyPRSMaxSimilarWeight
web_meta: 91
Maximum by SimiladocsprsPrsweight counted according to all documents in PRS
|
DomainHasMetrika
web_meta: 486
Does owner have metrika or not
|
HasSideblock
web_meta: 487
The document has a turbo page for Mobile platform.
|
IsTasIx
web_meta: 488
The site is located on the TAS-IX network (relevant for Uzbekistan)
|
RandomCommercial
web_meta: 489
'Random' factor for commercial sites.
|
HostBizKernel4
web_meta: 490
Thematic business cakel
|
OwnerWebsiteAttention
web_meta: 491
Site owner pays attention to site details (at least once in quarter)
|
ChatScore
web_meta: 492
Chat info. positive / events or zero
|
HostPlayerViewDepth
web_meta: 493
Host player info. Relation between view time and video duration
|
IsReservedOwner
web_meta: 502
From reseved_owner namespace in SaaS-KV
|
IsReservedUrl
web_meta: 503
From reseved_url namespace in SaaS-KV
|
IsReservedOwnerFast
web_meta: 504
From reseved_owner namespace in SaaS-KV for fast reaction
|
IsReservedUrlFast
web_meta: 505
From reseved_url namespace in SaaS-KV for fast reaction
|