Slice: unknown
(1259 ranking factors)
Factors |
---|
MatchedHashes
unknown: 0
|
Offset
unknown: 0
|
NumHashMatches
unknown: 1
|
Popularity
unknown: 2
|
OneMinusOffset
unknown: 3
|
Tf
unknown: 0
|
Idf
unknown: 1
|
I2t
unknown: 2
|
WordCount
unknown: 3
|
NumberRatio
unknown: 4
|
SumSimilarity
unknown: 5
|
TfIdfPosition
unknown: 6
|
ClusterSimilarity
unknown: 7
|
NonNullComponentsNumber
unknown: 8
|
IsFio
unknown: 9
|
IsEntity
unknown: 10
|
ClusterIdx
unknown: 11
|
Language
unknown: 12
|
IsLowerCase
unknown: 13
|
Prob0
unknown: 14
|
Prob1
unknown: 15
|
Prob2
unknown: 16
|
Prob3
unknown: 17
|
Prob4
unknown: 18
|
Enthropy
unknown: 19
|
TRhitw
unknown: 0
|
TextBM25
unknown: 1
|
TxtHead
unknown: 2
|
TxtHiRel
unknown: 3
|
NewsStoryRank
unknown: 4
|
NewsDuplicate
unknown: 5
|
NewsInStoryAgencyWeight
unknown: 6
|
NewsAgenQuality
unknown: 7
|
NewsUCP
unknown: 8
|
NewsNoSelections
unknown: 9
|
NewsTailSelected
unknown: 10
|
NewsWordsInTitle
unknown: 11
|
NewsIsUaAgency
unknown: 12
|
NewsWholeQueryInTitle
unknown: 13
|
MatchedHashes
unknown: 0
|
Offset
unknown: 0
|
NumHashMatches
unknown: 1
|
Popularity
unknown: 2
|
OneMinusOffset
unknown: 3
|
ru_fact_snippet_dssm_query_candidate_score
unknown: 0
ru fact snippet dssm query candidate score
|
neocortex_ttt_similarity
unknown: 1
neocortex text to text embed cosine
|
neocortex_tth_similarity
unknown: 2
neocortex text to host embed cosine
|
query_contains_all_rhs_numbers
unknown: 3
if query contains all rhs numbers
|
query_len
unknown: 4
query len in utf8 chars
|
rhs_len
unknown: 5
rhs len in utf8 chars
|
answer_len
unknown: 6
answer len
|
same_question_word
unknown: 7
if query and rhs share question word
|
is_assistant
unknown: 8
if query from assistant
|
symbol_edit_distance
unknown: 9
minimum edit distance in Unicode symbols
|
word_edit_distance
unknown: 10
minimum edit distance in words
|
sorted_word_edit_distance
unknown: 11
minimum edit distance in words between sorted word sequences
|
word_dist_url_5
unknown: 12
word dist on top 5 urls
|
word_dist_url_10
unknown: 13
word dist on top 10 urls
|
word_dist_host_5
unknown: 14
word dist on top 5 hosts
|
word_dist_host_10
unknown: 15
word dist on top 10 hosts
|
word_dist_url_5_norm
unknown: 16
word dist on top 5 urls + norm
|
word_dist_url_10_norm
unknown: 17
word dist on top 10 urls + norm
|
word_dist_host_5_norm
unknown: 18
word dist on top 5 hosts + norm
|
word_dist_host_10_norm
unknown: 19
word dist on top 10 hosts + norm
|
lemma_duplicate_count
unknown: 20
number of duplicate lemmas
|
lemma_weighted_edit_distance
unknown: 21
minimum edit distance between word sequences with duplicate lemmas transposed, weighted by part of speech or word form change
|
lemma_weighted_edit_chain_len
unknown: 22
length of the chain for weighted minimum edit distance
|
query_question_word_id
unknown: 23
Interrogative word in alias
|
rhs_question_word_id
unknown: 24
Interrogative word in Query
|
fact_snip_neocortex_query_answer_cosine
unknown: 25
For NeoCortex Embeddings COS (Alias, Answer)
|
fact_snip_neocortex_query_answer_cosine_minus_rhs_answer_cosine
unknown: 26
For NeoCortex Embeddings COS (Alias, Answer) - COS (Query, Answer)
|
fact_snip_neocortex_query_rhs_cosine
unknown: 27
For NeoCortex Embeddings COS (Alias, Query)
|
word_embed_qwt_xor_sif_w1_pairwise_sim_left_min_max
unknown: 28
Query-Word-Title Embeda. Backing: symmetrical difference in words by lemmams between requests. Word weights: 1E-4 / (1E-4 + frequency). Fig: we go according to the first request and select the greatest cosine to one of the words of the second request; Of these meanings, we find a minimum.
|
word_embed_qwt_sif_w1_pairwise_sim_left_min_max
unknown: 29
Query-Word-Title Embeda. Word weights: 1E-4 / (1E-4 + frequency). Fig: we go according to the first request and select the greatest cosine to one of the words of the second request; Of these meanings, we find a minimum.
|
word_embed_qwt_sif_w1_pairwise_sim_right_min_max
unknown: 30
Query-Word-Title Embeda. Word weights: 1E-4 / (1E-4 + frequency). Fig: We go according to the second request and select the greatest cosine to one of the words of the first request; Of these meanings, we find a minimum.
|
word_embed_qwt_sif_w1_pairwise_sim_right_max_min
unknown: 31
Query-Word-Title Embeda. Word weights: 1E-4 / (1E-4 + frequency). Fig: We go according to the second request and select the smallest cosine to one of the words of the first request; From these meanings we find the maximum.
|
word_embed_qwt_sif_w2_pairwise_sim_left_min_max
unknown: 32
Query-Word-Title Embeda. Word weights: 1E-4 / (1E-4 + frequency). Fig: go through the pairs of neighboring words of the first request and select the greatest cosine of the amount to the neighboring pair of words of the second request; Of these meanings, we find a minimum.
|
word_embed_qwt_q2a_sif_w2_pairwise_sim_right_min_max
unknown: 33
Query-Word-Title Embeda. Word weights: 1E-4 / (1E-4 + frequency). Fig: go through the pairs of neighboring words of the second request and select the greatest cosine of the amount to the neighboring pair of words of the answer; Of these meanings, we find a minimum.
|
word_embed_wiki_sif_w1_pairwise_sim_left_min_max
unknown: 34
Word2VEC on Wikipedia. Word weights: 1E-4 / (1E-4 + frequency). Fig: we go according to the first request and select the greatest cosine to one of the words of the second request; Of these meanings, we find a minimum.
|
word_embed_wiki_sif_w1_pairwise_sim_right_min_max
unknown: 35
Word2VEC on Wikipedia. Word weights: 1E-4 / (1E-4 + frequency). Fig: We go according to the second request and select the greatest cosine to one of the words of the first request; Of these meanings, we find a minimum.
|
word_embed_wiki_sif_sum_sim
unknown: 36
Word2VEC on Wikipedia. Word weights: 1E-4 / (1E-4 + frequency). Fig: cosine of suspended sums of words of the first and second request.
|
word_embed_fs_xor_sif_w1_pairwise_sim_global_max
unknown: 37
Embeds of words query_features from DSSM models on the answers of Mail.ru. Backing: symmetrical difference in words by lemmams between requests. Word weights: 1E-4 / (1E-4 + frequency). Fig: the greatest meaning of cosine among all the pairs of words of the first and second requests.
|
word_embed_fs_xor_sif_sum_sim
unknown: 38
Embeds of words query_features from DSSM models on the answers of Mail.ru. Backing: symmetrical difference in words by lemmams between requests. Word weights: 1E-4 / (1E-4 + frequency). Fig: cosine of suspended sums of words of the first and second requests.
|
word_embed_fs_sif_w1_pairwise_sim_left_min_max
unknown: 39
Embeds of words query_features from DSSM models on the answers of Mail.ru. Word weights: 1E-4 / (1E-4 + frequency). Fig: we go according to the first request and select the greatest cosine to one of the words of the second request; Of these meanings, we find a minimum.
|
word_embed_fs_sif_w1_pairwise_sim_right_min_max
unknown: 40
Embeds of words query_features from DSSM models on the answers of Mail.ru. Word weights: 1E-4 / (1E-4 + frequency). Fig: We go according to the second request and select the greatest cosine to one of the words of the first request; Of these meanings, we find a minimum.
|
word_embed_fs_q1a_sif_w1_pairwise_sim_global_max
unknown: 41
Embeds of words query_features from DSSM models on the answers of Mail.ru. Word weights: 1E-4 / (1E-4 + frequency). Fig: the greatest meaning of cosine among all the pairs of the words of the first request and answer.
|
word_embed_fs_q1a_sif_w1_pairwise_sim_left_min_max
unknown: 42
Embeds of words query_features from DSSM models on the answers of Mail.ru. Word weights: 1E-4 / (1E-4 + frequency). Fig: we go according to the first request and choose the greatest cosine to one of the words of the answer; Of these meanings, we find a minimum.
|
word_embed_fs_q2a_sif_w1_pairwise_sim_left_min_max
unknown: 43
Embeds of words query_features from DSSM models on the answers of Mail.ru. Word weights: 1E-4 / (1E-4 + frequency). Fig: we go according to the second request and choose the greatest cosine to one of the words of the answer; Of these meanings, we find a minimum.
|
word_embed_qwt_sif_vectors_to_sum_sim_left
unknown: 44
Query-Word-Title Embeda. Word weights: 1E-4 / (1E-4 + frequency). Fig: the cosine of the suspended amount of the first request to the main component of the suspended amounts of factory requests of length> = 3.
|
word_embed_qwt_sif_vectors_to_sum_sim_right
unknown: 45
Query-Word-Title Embeda. Word weights: 1E-4 / (1E-4 + frequency). Fig: the cosine of the suspended amount of the second request to the main component of the suspended amounts of factory requests of length> = 3.
|
word_dist_url_5_prepared_query
unknown: 46
word dist on top 5 urls on prepared queries
|
word_dist_url_10_prepared_query
unknown: 47
word dist on top 10 urls on prepared queries
|
word_dist_host_5_prepared_query
unknown: 48
word dist on top 5 hosts on prepared queries
|
word_dist_host_10_prepared_query
unknown: 49
word dist on top 10 hosts on prepared queries
|
word_dist_url_5_norm_prepared_query
unknown: 50
word dist on top 5 urls + norm on prepared queries
|
word_dist_url_10_norm_prepared_query
unknown: 51
word dist on top 10 urls + norm on prepared queries
|
word_dist_host_5_norm_prepared_query
unknown: 52
word dist on top 5 hosts + norm on prepared queries
|
word_dist_host_10_norm_prepared_query
unknown: 53
word dist on top 10 hosts + norm on prepared queries
|
bigram_dist_url_5
unknown: 54
words and bigrams dist based on top url 5 search result
|
bigram_dist_url_10
unknown: 55
words and bigrams dist based on top url 10 search result
|
bigram_dist_host_10
unknown: 56
words and bigrams dist based on top hosts 10 search result
|
bigram_dist_host_5
unknown: 57
words and bigrams dist based on top hosts 5 search result
|
word_embed_fq_xor_sif_w1_pairwise_sim_left_min_max
unknown: 58
Symmetric difference in words of queries, taking into account lemmatization. Word2VEC for factual requests and answers. Word weights: 1E-4 / (1E-4 + frequency). Fig: we go according to the first request and select the greatest cosine to one of the words of the second request; Of these meanings, we find a minimum.
|
word_embed_fq_xor_sif_w1_pairwise_sum_right_min_max
unknown: 59
Symmetric difference in words of queries, taking into account lemmatization. Word2VEC for factual requests and answers. Word weights: 1E-4 / (1E-4 + frequency). Fig: We go according to the second request and select the greatest cosine to one of the words of the first request; Of these meanings, we find a minimum.
|
word_embed_fq_q2a_sif_sum_sim
unknown: 60
Word2VEC for factual requests and answers. Word weights: 1E-4 / (1E-4 + frequency). Cosine of the sums of the vectors of the response and vectors of the second request
|
word_embed_qwt_xor_sif_mean_dist
unknown: 61
Symmetric difference in words of queries, taking into account lemmatization. Query-Word-Title Dictionary. Word weights: 1E-4 / (1E-4 + frequency). Fig: Euclidean distance between the average vectors of the words of the first and second requests
|
neocortex_alias_lemma_cosine
unknown: 62
FACTS-1856
|
lhs_diff_min_lemma_frequency
unknown: 63
The frequency of the most rare lemma in the diphas of the first request
|
rhs_diff_min_lemma_frequency
unknown: 64
The frequency of the most rare lemma in Diffe of the second request
|
common_noun_min_frequency
unknown: 65
The frequency of the most rare common noun in requests
|
neocortex_serp_items_lhs_union_facts_cosine
unknown: 66
FACTS-2184
|
neocortex_serp_items_lhs_positive_query_mx_cosine
unknown: 67
FACTS-2184
|
neocortex_serp_items_rhs_wiz_images_cosine
unknown: 68
FACTS-2184
|
neocortex_serp_items_rhs_wiz_entity_search_cosine
unknown: 69
FACTS-2184
|
nearest_word_subset_symbol_edit_distance
unknown: 70
FACTS-2221
|
tomato_dssm_query_candidate_score
unknown: 71
[T]he [O]tvet.[MA]il.ru + [TO]loka DSSM. Query - KNN Candidate dot product
|
query_model_feature_diff
unknown: 72
Diff Forethos of a unigramal text classifier trained for factskeeping queries from Toloka
|
query_model_feature_min
unknown: 73
Min is a ski uniramic text classifier trained for factskeeping queries from Toloka
|
query_model_feature_max
unknown: 74
Max are a ski uniramic text classifier trained for factskewk requests from Toloka
|
cross_model_feature_diff
unknown: 75
diff the value of the regression trained in bigrams in which the first word is taken from the request, the second from the answer
|
cross_model_feature_mim
unknown: 76
min The meaning of the regression trained in bigrams in which the first word is taken from the request, the second from the answer
|
cross_model_feature_max
unknown: 77
Max The meaning of the regression trained in bigrams in which the first word is taken from the request, the second from the answer
|
aliases_bdssm_score
unknown: 78
BERT imitated by DSSM based on online aliases prediction
|
FullMatchPrediction
unknown:
|
SynonymMatchPrediction
unknown:
|
AnnotationMatchPrediction
unknown:
|
AnnotationMatchPredictionWeighted
unknown:
|
QueryMatchPrediction
unknown:
|
WcmMax
unknown:
|
ValueWcmMax
unknown:
|
ValueWcmAvg
unknown:
|
ValueWcmPrediction
unknown:
|
WcmCoveragePrediction
unknown:
|
WcmCoverageMax
unknown:
|
PcmMax
unknown:
|
ValuePcmMax
unknown:
|
ValuePcmAvg
unknown:
|
ValuePcmPrediction
unknown:
|
WordValueMax
unknown:
|
PrefixMatchMax
unknown:
|
PrefixMatchAvg
unknown:
|
PrefixMatchCount
unknown:
|
SuffixMatchMax
unknown:
|
SuffixMatchAvg
unknown:
|
SuffixMatchCount
unknown:
|
Bm15K1
unknown:
|
Bm15K2
unknown:
|
Bm15K3
unknown:
|
Bm15K4
unknown:
|
Bm15K5
unknown:
|
Bm15K6
unknown:
|
Bm15K7
unknown:
|
Bm15K8
unknown:
|
Bm15K9
unknown:
|
Bm15K10
unknown:
|
Bm15AK1
unknown:
|
Bm15AK2
unknown:
|
Bm15AK3
unknown:
|
Bm15AK4
unknown:
|
Bm15W1K1
unknown:
|
Bm15W1K2
unknown:
|
Bm15W1K3
unknown:
|
Bm15W1K4
unknown:
|
Bm15W2K1
unknown:
|
Bm15W2K2
unknown:
|
Bm15W2K3
unknown:
|
Bm15W2K4
unknown:
|
Bm15V0K1
unknown:
|
Bm15V0K2
unknown:
|
Bm15V0K3
unknown:
|
Bm15V0K4
unknown:
|
Bm15V0W1K1
unknown:
|
Bm15V0W1K2
unknown:
|
Bm15V0W1K3
unknown:
|
Bm15V0W1K4
unknown:
|
Bm15V2K1
unknown:
|
Bm15V2K2
unknown:
|
Bm15V2K3
unknown:
|
Bm15V2K4
unknown:
|
Bm15V2W1K1
unknown:
|
Bm15V2W1K2
unknown:
|
Bm15V2W1K3
unknown:
|
Bm15V2W1K4
unknown:
|
Bm15V4K1
unknown:
|
Bm15V4K2
unknown:
|
Bm15V4K3
unknown:
|
Bm15V4K4
unknown:
|
Bm15V4K5
unknown:
|
Bm15V4K6
unknown:
|
Bm15V4K7
unknown:
|
Bm15V4K8
unknown:
|
Bm15V4W1K1
unknown:
|
Bm15V4W1K2
unknown:
|
Bm15V4W1K3
unknown:
|
Bm15V4W1K4
unknown:
|
Bm15StrictK1
unknown:
|
Bm15StrictK2
unknown:
|
Bm15StrictK3
unknown:
|
Bm15StrictK4
unknown:
|
Bm15StrictW1K1
unknown:
|
Bm15StrictW1K2
unknown:
|
Bm15StrictW1K3
unknown:
|
Bm15StrictW1K4
unknown:
|
Bm15MaxK1
unknown:
|
Bm15MaxK2
unknown:
|
Bm15MaxK3
unknown:
|
Bm15MaxK4
unknown:
|
Bm15V2MaxK1
unknown:
|
Bm15V2MaxK2
unknown:
|
Bm15V2MaxK3
unknown:
|
Bm15V2MaxK4
unknown:
|
Bm15AttenK1
unknown:
|
Bm15AttenK2
unknown:
|
Bm15AttenK3
unknown:
|
Bm15AttenK4
unknown:
|
Bm15AttenW1K1
unknown:
|
Bm15AttenW1K2
unknown:
|
Bm15AttenW1K3
unknown:
|
Bm15AttenW1K4
unknown:
|
Bm15WcmK1
unknown:
|
Bm15WcmK2
unknown:
|
Bm15WcmK3
unknown:
|
Bm15WcmK4
unknown:
|
Bm15WcmW1K1
unknown:
|
Bm15WcmW1K2
unknown:
|
Bm15WcmW1K3
unknown:
|
Bm15WcmW1K4
unknown:
|
Bm15CoverageK1
unknown:
|
Bm15CoverageK2
unknown:
|
Bm15CoverageK3
unknown:
|
Bm15CoverageK4
unknown:
|
Bm15CoverageW1K1
unknown:
|
Bm15CoverageW1K2
unknown:
|
Bm15CoverageW1K3
unknown:
|
Bm15CoverageW1K4
unknown:
|
Bm15CoverageV2K1
unknown:
|
Bm15CoverageV2K2
unknown:
|
Bm15CoverageV2K3
unknown:
|
Bm15CoverageV2K4
unknown:
|
Bm15CoverageV2W1K1
unknown:
|
Bm15CoverageV2W1K2
unknown:
|
Bm15CoverageV2W1K3
unknown:
|
Bm15CoverageV2W1K4
unknown:
|
Bm15CoverageV4K1
unknown:
|
Bm15CoverageV4K2
unknown:
|
Bm15CoverageV4K3
unknown:
|
Bm15CoverageV4K4
unknown:
|
Bm15CoverageV4W1K1
unknown:
|
Bm15CoverageV4W1K2
unknown:
|
Bm15CoverageV4W1K3
unknown:
|
Bm15CoverageV4W1K4
unknown:
|
BclmPlainK1
unknown:
|
BclmPlainK2
unknown:
|
BclmPlainK3
unknown:
|
BclmPlainK4
unknown:
|
BclmPlainK5
unknown:
|
BclmPlainK6
unknown:
|
BclmPlainK7
unknown:
|
BclmPlainW1K1
unknown:
|
BclmPlainW1K2
unknown:
|
BclmPlainW1K3
unknown:
|
BclmPlainW1K4
unknown:
|
BclmPlainV2K1
unknown:
|
BclmPlainV2K2
unknown:
|
BclmPlainV2K3
unknown:
|
BclmPlainV2K4
unknown:
|
BclmPlainV2W1K1
unknown:
|
BclmPlainV2W1K2
unknown:
|
BclmPlainV2W1K3
unknown:
|
BclmPlainV2W1K4
unknown:
|
BclmWeightedK1
unknown:
|
BclmWeightedK2
unknown:
|
BclmWeightedK3
unknown:
|
BclmWeightedK4
unknown:
|
BclmWeightedW1K1
unknown:
|
BclmWeightedW1K2
unknown:
|
BclmWeightedW1K3
unknown:
|
BclmWeightedW1K4
unknown:
|
BclmWeightedV2K1
unknown:
|
BclmWeightedV2K2
unknown:
|
BclmWeightedV2K3
unknown:
|
BclmWeightedV2K4
unknown:
|
BclmWeightedV2W1K1
unknown:
|
BclmWeightedV2W1K2
unknown:
|
BclmWeightedV2W1K3
unknown:
|
BclmWeightedV2W1K4
unknown:
|
BclmSoftK1
unknown:
|
BclmSoftK2
unknown:
|
BclmSoftK3
unknown:
|
BclmSoftK4
unknown:
|
BclmHardK1
unknown:
|
BclmHardK2
unknown:
|
BclmHardK3
unknown:
|
BclmHardK4
unknown:
|
BclmHardW1K1
unknown:
|
BclmHardW1K2
unknown:
|
BclmHardW1K3
unknown:
|
BclmHardW1K4
unknown:
|
BclmMixPlainA1K1
unknown:
|
BclmMixPlainA1K2
unknown:
|
BclmMixPlainA1K3
unknown:
|
BclmMixPlainA1K4
unknown:
|
BclmMixPlainA2K1
unknown:
|
BclmMixPlainA2K2
unknown:
|
BclmMixPlainA2K3
unknown:
|
BclmMixPlainA2K4
unknown:
|
BclmMixPlainW1K1
unknown:
|
BclmMixPlainW1K2
unknown:
|
BclmMixPlainW1K3
unknown:
|
BclmMixPlainW1K4
unknown:
|
BclmMixPlainV2K1
unknown:
|
BclmMixPlainV2K2
unknown:
|
BclmMixPlainV2K3
unknown:
|
BclmMixPlainV2K4
unknown:
|
BclmMixPlainV2W1K1
unknown:
|
BclmMixPlainV2W1K2
unknown:
|
BclmMixPlainV2W1K3
unknown:
|
BclmMixPlainV2W1K4
unknown:
|
BclmMixWeighted
unknown:
|
BocmPlain
unknown:
|
BocmWeightedK1
unknown:
|
BocmWeightedK2
unknown:
|
BocmWeightedK3
unknown:
|
BocmWeightedK4
unknown:
|
BocmWeightedK5
unknown:
|
BocmWeightedK6
unknown:
|
BocmWeightedK7
unknown:
|
BocmWeightedK8
unknown:
|
BocmWeightedK9
unknown:
|
BocmWeightedK10
unknown:
|
BocmWeightedW1K1
unknown:
|
BocmWeightedW1K2
unknown:
|
BocmWeightedW1K3
unknown:
|
BocmWeightedW1K4
unknown:
|
BocmWeightedW2K1
unknown:
|
BocmWeightedW2K2
unknown:
|
BocmWeightedW2K3
unknown:
|
BocmWeightedW2K4
unknown:
|
BocmWeightedV2K1
unknown:
|
BocmWeightedV2K2
unknown:
|
BocmWeightedV2K3
unknown:
|
BocmWeightedV2K4
unknown:
|
BocmWeightedV2W1K1
unknown:
|
BocmWeightedV2W1K2
unknown:
|
BocmWeightedV2W1K3
unknown:
|
BocmWeightedV2W1K4
unknown:
|
BocmWeightedV4K1
unknown:
|
BocmWeightedV4K2
unknown:
|
BocmWeightedV4K3
unknown:
|
BocmWeightedV4K4
unknown:
|
BocmWeightedV4K5
unknown:
|
BocmWeightedV4K6
unknown:
|
BocmWeightedV4K7
unknown:
|
BocmWeightedV4K8
unknown:
|
BocmWeightedV4W1K1
unknown:
|
BocmWeightedV4W1K2
unknown:
|
BocmWeightedV4W1K3
unknown:
|
BocmWeightedV4W1K4
unknown:
|
BocmWeightedMaxK1
unknown:
|
BocmWeightedMaxK2
unknown:
|
BocmWeightedMaxK3
unknown:
|
BocmWeightedMaxK4
unknown:
|
BocmWeightedV2MaxK1
unknown:
|
BocmWeightedV2MaxK2
unknown:
|
BocmWeightedV2MaxK3
unknown:
|
BocmWeightedV2MaxK4
unknown:
|
BocmDoubleK1
unknown:
|
BocmDoubleK2
unknown:
|
BocmDoubleK3
unknown:
|
BocmDoubleK4
unknown:
|
BocmDoubleK5
unknown:
|
BocmDoubleK6
unknown:
|
BocmDoubleK7
unknown:
|
BocmDoubleW1K1
unknown:
|
BocmDoubleW1K2
unknown:
|
BocmDoubleW1K3
unknown:
|
BocmDoubleW1K4
unknown:
|
BocmDoubleV2K1
unknown:
|
BocmDoubleV2K2
unknown:
|
BocmDoubleV2K3
unknown:
|
BocmDoubleV2K4
unknown:
|
BocmDoubleV2W1K1
unknown:
|
BocmDoubleV2W1K2
unknown:
|
BocmDoubleV2W1K3
unknown:
|
BocmDoubleV2W1K4
unknown:
|
TfICopLink
unknown: 0
|
RankSumICopLink
unknown: 1
|
RankDisSumICopLink
unknown: 2
|
RankDisMaxICopLink
unknown: 3
|
RankMaxICopLink
unknown: 4
|
TfISimLink
unknown: 5
|
RankSumISimLink
unknown: 6
|
RankDisSumISimLink
unknown: 7
|
RankDisMaxISimLink
unknown: 8
|
RankMaxISimLink
unknown: 9
|
TfICopDuck
unknown: 10
|
RankSumICopDuck
unknown: 11
|
RankDisSumICopDuck
unknown: 12
|
RankDisMaxICopDuck
unknown: 13
|
RankMaxICopDuck
unknown: 14
|
TfISimDuck
unknown: 15
|
RankSumISimDuck
unknown: 16
|
RankDisSumISimDuck
unknown: 17
|
RankDisMaxISimDuck
unknown: 18
|
RankMaxISimDuck
unknown: 19
|
TagWordsCount
unknown: 20
|
I2TQueryTag
unknown: 21
|
I2TClusterQuery
unknown: 22
|
I2TClusterTag
unknown: 23
|
ClusterIdx
unknown: 24
|
QueryPredictionBinaryPorn
unknown: 25
|
QueryPredictionClothes
unknown: 26
|
QueryPredictionGeo
unknown: 27
|
QueryPredictionMarket
unknown: 28
|
QueryPredictionMobilePorn
unknown: 29
|
QueryPredictionOcrText
unknown: 30
|
TokensScriptCyrillic
unknown: 31
|
TokensScriptAcceptable
unknown: 32
|
EndOfList
unknown: 33
|
OcrTokensNumber
unknown: 34
|
OcrTagI2T
unknown: 35
|
OcrTokensIntersection
unknown: 36
|
ListwiseI2T09
unknown: 37
|
ListwiseI2T08
unknown: 38
|
ListwiseI2TRatio06
unknown: 39
|
ListwiseI2TRatio04
unknown: 40
|
ListwiseI2TRatio02
unknown: 41
|
ListwiseI2TWMean
unknown: 42
|
TextIsMarket
unknown: 43
|
TextIsOntology
unknown: 44
|
TextIsFIO
unknown: 45
|
I2tDistance
unknown: 46
|
KnnDistance
unknown: 47
|
PcaComponentT0
unknown: 48
|
PcaComponentT1
unknown: 49
|
PcaComponentT2
unknown: 50
|
PcaComponentT3
unknown: 51
|
PcaComponentT4
unknown: 52
|
PcaTagImgL2
unknown: 53
|
StrLen
unknown: 54
|
StopWords
unknown: 55
|
Prob0Class
unknown: 56
|
Prob1Class
unknown: 57
|
Prob2Class
unknown: 58
|
Prob3Class
unknown: 59
|
EnthropyClass
unknown: 60
|
TitleToTagI2T
unknown: 61
|
I2tBluredClusterQuery
unknown: 62
|
I2tBluredClusterTag
unknown: 63
|
Prob4Class
unknown: 64
|
TagSources
unknown: 65
|
IcopTitleToTagI2T
unknown: 66
|
UserThreshold30Decay30Prior0
unknown: 0
|
UserThreshold30Decay30Prior1
unknown: 1
|
UserThreshold30Decay30Prior05
unknown: 2
|
UserThreshold120Decay30Prior0
unknown: 3
|
UserThreshold120Decay30Prior1
unknown: 4
|
UserThreshold120Decay30Prior05
unknown: 5
|
UserThreshold5Decay30Prior0
unknown: 6
|
UserThreshold5Decay30Prior1
unknown: 7
|
UserThreshold5Decay30Prior05
unknown: 8
|
UserLog5Decay30Prior0
unknown: 9
|
UserLog5Decay30Prior1
unknown: 10
|
UserLog5Decay30Prior05
unknown: 11
|
UserLog300Decay30Prior0
unknown: 12
|
UserLog300Decay30Prior1
unknown: 13
|
UserLog300Decay30Prior05
unknown: 14
|
UserCTRThreshold1Decay30
unknown: 15
|
UserCCTRThreshold1Decay30
unknown: 16
|
UserCCTR2Threshold1Decay30
unknown: 17
|
UserPCTRThreshold1Decay30
unknown: 18
|
UserClicksThreshold1Decay30
unknown: 19
|
UserClicksThreshold1Decay30XClicksThreshold1Decay30
unknown: 20
|
UserCTRThreshold120Decay30
unknown: 21
|
UserCCTRThreshold120Decay30
unknown: 22
|
UserCCTR2Threshold120Decay30
unknown: 23
|
UserPCTRThreshold120Decay30
unknown: 24
|
UserClicksThreshold120Decay30
unknown: 25
|
UserClicksThreshold120Decay30XClicksThreshold1Decay30
unknown: 26
|
UserCTRThreshold300Decay30
unknown: 27
|
UserCCTRThreshold300Decay30
unknown: 28
|
UserCCTR2Threshold300Decay30
unknown: 29
|
UserPCTRThreshold300Decay30
unknown: 30
|
UserClicksThreshold300Decay30
unknown: 31
|
UserClicksThreshold300Decay30XClicksThreshold1Decay30
unknown: 32
|
UserCTRLog5Decay30
unknown: 33
|
UserCCTRLog5Decay30
unknown: 34
|
UserCCTR2Log5Decay30
unknown: 35
|
UserPCTRLog5Decay30
unknown: 36
|
UserClicksLog5Decay30
unknown: 37
|
UserClicksLog5Decay30XClicksThreshold1Decay30
unknown: 38
|
UserCTRDwelltime600Decay30
unknown: 39
|
UserCCTRDwelltime600Decay30
unknown: 40
|
UserCCTR2Dwelltime600Decay30
unknown: 41
|
UserPCTRDwelltime600Decay30
unknown: 42
|
UserClicksDwelltime600Decay30
unknown: 43
|
UserClicksDwelltime600Decay30XClicksThreshold1Decay30
unknown: 44
|
UserCTROdd01Decay30
unknown: 45
|
UserCCTROdd01Decay30
unknown: 46
|
UserCCTR2Odd01Decay30
unknown: 47
|
UserPCTROdd01Decay30
unknown: 48
|
UserClicksOdd01Decay30
unknown: 49
|
UserClicksOdd01Decay30XClicksThreshold1Decay30
unknown: 50
|
UserCTROdd02Decay30
unknown: 51
|
UserCCTROdd02Decay30
unknown: 52
|
UserCCTR2Odd02Decay30
unknown: 53
|
UserPCTROdd02Decay30
unknown: 54
|
UserClicksOdd02Decay30
unknown: 55
|
UserClicksOdd02Decay30XClicksThreshold1Decay30
unknown: 56
|
UserCTROdd03Decay30
unknown: 57
|
UserCCTROdd03Decay30
unknown: 58
|
UserCCTR2Odd03Decay30
unknown: 59
|
UserPCTROdd03Decay30
unknown: 60
|
UserClicksOdd03Decay30
unknown: 61
|
UserClicksOdd03Decay30XClicksThreshold1Decay30
unknown: 62
|
UserCTROdd04Decay30
unknown: 63
|
UserCCTROdd04Decay30
unknown: 64
|
UserCCTR2Odd04Decay30
unknown: 65
|
UserPCTROdd04Decay30
unknown: 66
|
UserClicksOdd04Decay30
unknown: 67
|
UserClicksOdd04Decay30XClicksThreshold1Decay30
unknown: 68
|
UserCTROdd05Decay30
unknown: 69
|
UserCCTROdd05Decay30
unknown: 70
|
UserCCTR2Odd05Decay30
unknown: 71
|
UserPCTROdd05Decay30
unknown: 72
|
UserClicksOdd05Decay30
unknown: 73
|
UserClicksOdd05Decay30XClicksThreshold1Decay30
unknown: 74
|
UserCTROdd06Decay30
unknown: 75
|
UserCCTROdd06Decay30
unknown: 76
|
UserCCTR2Odd06Decay30
unknown: 77
|
UserPCTROdd06Decay30
unknown: 78
|
UserClicksOdd06Decay30
unknown: 79
|
UserClicksOdd06Decay30XClicksThreshold1Decay30
unknown: 80
|
UserCTROdd07Decay30
unknown: 81
|
UserCCTROdd07Decay30
unknown: 82
|
UserCCTR2Odd07Decay30
unknown: 83
|
UserPCTROdd07Decay30
unknown: 84
|
UserClicksOdd07Decay30
unknown: 85
|
UserClicksOdd07Decay30XClicksThreshold1Decay30
unknown: 86
|
UserCTROdd08Decay30
unknown: 87
|
UserCCTROdd08Decay30
unknown: 88
|
UserCCTR2Odd08Decay30
unknown: 89
|
UserPCTROdd08Decay30
unknown: 90
|
UserClicksOdd08Decay30
unknown: 91
|
UserClicksOdd08Decay30XClicksThreshold1Decay30
unknown: 92
|
UserCTROdd085Decay30
unknown: 93
|
UserCCTROdd085Decay30
unknown: 94
|
UserCCTR2Odd085Decay30
unknown: 95
|
UserPCTROdd085Decay30
unknown: 96
|
UserClicksOdd085Decay30
unknown: 97
|
UserClicksOdd085Decay30XClicksThreshold1Decay30
unknown: 98
|
UserCTROdd09Decay30
unknown: 99
|
UserCCTROdd09Decay30
unknown: 100
|
UserCCTR2Odd09Decay30
unknown: 101
|
UserPCTROdd09Decay30
unknown: 102
|
UserClicksOdd09Decay30
unknown: 103
|
UserClicksOdd09Decay30XClicksThreshold1Decay30
unknown: 104
|
UserCTRThreshold120Decay1
unknown: 105
|
UserCCTRThreshold120Decay1
unknown: 106
|
UserCCTR2Threshold120Decay1
unknown: 107
|
UserPCTRThreshold120Decay1
unknown: 108
|
UserClicksThreshold120Decay1
unknown: 109
|
UserClicksThreshold120Decay1XClicksThreshold1Decay30
unknown: 110
|
UserCTRDwelltime600Decay1
unknown: 111
|
UserCCTRDwelltime600Decay1
unknown: 112
|
UserCCTR2Dwelltime600Decay1
unknown: 113
|
UserPCTRDwelltime600Decay1
unknown: 114
|
UserClicksDwelltime600Decay1
unknown: 115
|
UserClicksDwelltime600Decay1XClicksThreshold1Decay30
unknown: 116
|
UserCTRPreceding0Position0Threshold120Decay30
unknown: 117
|
UserCTRPreceding1Position0Threshold120Decay30
unknown: 118
|
UserCTRPreceding5Position0Threshold120Decay30
unknown: 119
|
UserCTRPreceding10Position0Threshold120Decay30
unknown: 120
|
UserCTRPosition0Threshold120Decay30
unknown: 121
|
UserCTRPreceding0Position1Threshold120Decay30
unknown: 122
|
UserCTRPreceding1Position1Threshold120Decay30
unknown: 123
|
UserCTRPreceding5Position1Threshold120Decay30
unknown: 124
|
UserCTRPreceding10Position1Threshold120Decay30
unknown: 125
|
UserCTRPosition1Threshold120Decay30
unknown: 126
|
UserWinsThreshold120Decay3XWinsThreshold120Decay30
unknown: 127
|
UserLossesThreshold30Decay30XLossesThreshold5Decay30
unknown: 128
|
UserWinsThreshold120Decay30XWinsThreshold5Decay30
unknown: 129
|
UserLossesThreshold120Decay30XLossesThreshold5Decay30
unknown: 130
|
UserWinsThreshold30Decay3XWinsThreshold30Decay30
unknown: 131
|
UserLossesThreshold120Decay3XLossesThreshold120Decay30
unknown: 132
|
UserWinsThreshold30Decay30XWinsThreshold5Decay30
unknown: 133
|
UserLossesLog300Decay30XLossesLog5Decay30
unknown: 134
|
same_numbers
unknown: 0
the number of answers in which there are numbers from this
|
query_model
unknown: 1
ski uniramic text classifier trained for factskeeping queries from Toloka
|
facts_w2v_sim
unknown: 2
The average similarity of the response to others, counted on the basis of the non -metered Word2VEC Runet
|
queryfact_w2v_sim
unknown: 3
The similarity of Snippet to a request, calculated on the basis of an unelematic Word2Vec Runet
|
query_host2vec_weight
unknown: 4
The value of logistics regression predicting the question of the request for the average vector of hosts to issue
|
querydoc_host2vec
unknown: 5
The value of logistics regression predicting that the answer is correct, according to the concatenation of the Khost vector and the medium vector of hosts to issue
|
query_is_encyc
unknown: 6
checks the operation of the impyscar rule of the encyclopedicity of the request
|
cluster_0
unknown: 7
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_1
unknown: 8
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_2
unknown: 9
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_3
unknown: 10
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_4
unknown: 11
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_5
unknown: 12
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_6
unknown: 13
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_7
unknown: 14
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_8
unknown: 15
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_9
unknown: 16
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_10
unknown: 17
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_11
unknown: 18
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_12
unknown: 19
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_13
unknown: 20
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_14
unknown: 21
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_15
unknown: 22
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_16
unknown: 23
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_17
unknown: 24
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_18
unknown: 25
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_19
unknown: 26
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_20
unknown: 27
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_21
unknown: 28
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_22
unknown: 29
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_23
unknown: 30
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_24
unknown: 31
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_25
unknown: 32
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_26
unknown: 33
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_27
unknown: 34
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_28
unknown: 35
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cluster_29
unknown: 36
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
cross_model
unknown: 37
The meaning of the regression trained in bigrams in which the first word is taken from the request, the second is from the answer
|
snippet_unigram_weight
unknown: 38
The meaning of the regression trained at the frequencies of words in a snippet predicting that Snippet contains the answer
|
host_fact_score
unknown: 39
The ratio of the number of times when the host was shown on a Serpa with a fact to a total number of times when the host was present on the Serpa.
|
meaningful_word_count
unknown: 40
Palace of words in snippet after filtering
|
avg_similarity
unknown: 41
Average similarity to SNiPPTs to switch words
|
similarity_top1
unknown: 42
The best coincidence according to the words in Actsnipp (CI, the punctuation and words are also not taken into account of 3 characters not containing numbers)
|
similarity_top2
unknown: 43
The second coincidence in the words in Actsnippe (CI, the punctuation and words are also not taken into account of 3 characters not containing numbers)
|
similarity_top3
unknown: 44
The third coincidence according to the words in Actsnipp (CI, the punctuation and words are also not taken into account of 3 characters not containing numbers)
|
similarity_top4
unknown: 45
The fourth coincidence according to Actsnippe (CI, the punctuation and words are also not taken into account of 3 characters not containing numbers)
|
similarity_range90
unknown: 46
the number of coincidences according to the acts differing from the best not more than 90%
|
similarity_range80
unknown: 47
the number of coincidences according to the acts differing from the best by 80-90%
|
similarity_range70
unknown: 48
the number of coincidences according to the acts in the act of the best by 70-80%
|
similarity_range60
unknown: 49
the number of coincidences according to the acts in the act of the best by 60-70%
|
similarity_range50
unknown: 50
the number of coincidences according to the acts in the act of the best by 50-60%
|
neocortex_facts
unknown: 51
The prediction of the NeoCortEx model trained in TextTotext on factory logs
|
neocortex_oml
unknown: 52
The prediction of the NeoCortex model trained in TextTotext on answers
|
neocortex_facts_big
unknown: 53
The prediction of the large NEOCORTEX model trained in TextTotext on answers
|
is_assistant
unknown: 54
a sign that the request came from the assistant
|
bert_factsnip_answer_dssm
unknown: 55
The cosinus between the embezzle of the request and Snippte built by the Bert_estimate_answer.dssm DSSM-Nevi-network, trained on the answers of the search bart, who is learned from the Toloca
|
FI_neocortex_serp_items_wiz_images
unknown: 56
Cosinus between emblems of the request and availability of wiz-images on a Serpa
|
FI_neocortex_serp_items_wiz_video
unknown: 57
Cosinus between emblems of the request and availability of Wiz-Video on a Serpa
|
FI_neocortex_serp_items_union_facts
unknown: 58
Cosinus between embezzles of request and availability of Union-Facts on a Serpa
|
FI_neocortex_serp_items_wiz_musicplayer
unknown: 59
The cosine between the embedding of the request and the availability of Wiz-MusicPlayer on the Serpa
|
FI_neocortex_serp_items_wiz_maps
unknown: 60
Cosinus between embezzles of request and availability of Wiz-Maps on a sickle
|
FI_neocortex_serp_items_positive_query_mx
unknown: 61
Cosinus between the embezzle of the request and the presence of a positive Query MX
|
FI_fact_word_min_frequency
unknown: 62
The minimum word frequency from the request, by the frequencies of words in factary requests
|
FI_fact_word_max_frequency
unknown: 63
The maximum frequency of a word from the request, by the frequencies of words in factual requests
|
FI_fact_word_med_smooth_inverse_frequency
unknown: 64
Squeezed inverted median frequencies from a request, according to the frequencies of words in factary requests
|
FI_fact_word_relative_min_frequency
unknown: 65
The minimum word frequency from the request, by the frequencies of words in factual requests, relative to the frequency of the word in general texts
|
FI_fact_word_relative_mean_frequency
unknown: 66
The average frequency of words from the request, by the frequencies of words in factary requests, relative to the frequencies of words in general texts
|
FI_fact_word_relative_med_smooth_inverse_frequency
unknown: 67
Squeezed inverted median frequencies from a request, according to the frequencies of words in factary requests, relative to the frequencies of words in general texts
|
FI_fact_bigram_max_frequency
unknown: 68
The maximum frequency of the bigma from the request, according to the frequencies of birams in factary queries
|
FI_norm_query_char_len
unknown: 69
Request length after Normalizetext in symbols
|
FI_norm_query_word_len
unknown: 70
Request length after Normalizetext in words
|
FI_question_word_count
unknown: 71
The number of interrogative words in the request
|
FI_first_advpro_hash
unknown: 72
Hesh of the first pronoun dialect in the request
|
FI_first_preposition_hash
unknown: 73
Hesh of the first preposition in the request
|
FI_long_word_count
unknown: 74
The number of words, the length of which is more than 3, in a normalized request
|
FI_query_normalizied_length_diff
unknown: 75
(length of_nabrication_biez_normalization - length_vseh_normalized_glines) / length_normalized_Prosa (in lengths of words, arithmic progression coefficients)
|
FI_snippet_sentence_count
unknown: 76
Number of proposals in Snippet
|
FI_snippet_bad_sentence_count
unknown: 77
The number of poorly formed sentences in Snippet (does not begin on the capital letter or does not end at the point or contains less than one sentence)
|
FI_snippet_uppercase_words_rel_freq
unknown: 78
The number of words starting with the title letter / number of all words in snippet
|
FI_fact_snippet_true_bert_target_0
unknown: 79
Bert value on an assessment of an act, zero head of multitargete
|
FI_fact_snippet_true_bert_target_1
unknown: 80
The value of BERT on an assessment of the act, the first head of the multitargete
|
FI_fact_snippet_true_info_bert_target_0
unknown: 81
Info Bert value on an assessment of an act
|
RF_Max_Hops
unknown: 0
The number of hops of Url inpans (such as less - closer to the muzzle, the lower the value (0 - the muzzle, 1 - from the muzzle cannot be reached, 0 <can get from the muzzle <1). Normal value for the root of the nosta 0.0039).
|
RF_Max_QueryDOwnerYabarAvgTime
unknown: 1
The average for users Active continuous time of the user is (in second) on the host pages after the transition on request from the search engine (the factor depends on the pair (request, Domattr)).
|
RF_Mean_CommLinksSEOHosts
unknown: 2
The share of incoming corrupt links. The algorithm for recognition of commercial links is implemented. The factor will be remarked to [0.1] if the share of such links is 50%, otherwise 0. ((http://wiki.yandex-team.ru/svetlanashorina/topseolinks selection of wound sites))))))
|
RF_Mean_PercentFreqWords
unknown: 3
The percentage of the number of words, which are 200 the most frequent words of the language, from the number of all words of the text
|
RF_Min_TrigramsCondProb
unknown: 4
Logarithm of the average geometric conditional probabilities of trigrams. The conditional probability of a trigram is its probability, divided by the probability of a bigram from the first two words
|
RF_Max_TextWeightedForms
unknown: 5
The sum of the number of forms balanced by the scales of words - the amount in all words of the request of the number_form_dly_lov/64*weight_lov; REMAP species x/(1 + x).
|
RF_Mean_AdvPronounsPortion
unknown: 6
The proportion of pronoun nouns
|
RF_Min_AdvPronounsPortion
unknown: 7
The proportion of pronoun nouns
|
RF_Max_FemAndMasNounsPortion
unknown: 8
The share of words that can be both masculine nouns and nouns of the feminine, but not of the middle kind, among all nouns (examples: 'hummingbirds' are an example of an indefinite kind that can be determined in two ways, 'Alexander' is homonymy).
|
RF_Mean_LongestText
unknown: 9
The size of the largest text segment (from the factor [18] puretext)
|
RF_QClassKak
unknown: 10
question
|
RF_Removed_11
unknown: 11
|
RF_Removed_12
unknown: 12
|
RF_Removed_13
unknown: 13
|
RF_Mean_NumSlashes
unknown: 14
The number of slashes in Url
|
RF_Max_TitleTrigramsTitle
unknown: 15
Calculates the heading of the heading of the document header with letter trigrams
|
RF_Max_NumLinksFromSegmentContent
unknown: 16
|
RF_Mean_SeoInPayLinks
unknown: 17
The number of COO-Thrilling links between hosts
|
RF_MaxOne
unknown: 18
Returns the maximum degree of household objects in the request under the name Wmaxone. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#maxone more)))))))
|
RF_Max_MetrikaUrlAvgTime
unknown: 19
Similar to Yabarurlavgtime
|
RF_Min_DBM40
unknown: 20
Variation of Temo ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaformula/tekushhiekomponenty/DBM25 dBM25), cm.
|
RF_Max_NavLinear
unknown: 21
((http://wiki.yandex-team.ru/jandekspoisk/antispam/polunavigacionnyezaprosy#faktornnostiparyurl-zapros classifier)) pairs of vitalnikov [query url], Url Vital for the request, if value is valuable for Ф> 0.
|
RF_QueryThEncyclopedic
unknown: 22
The result of the work of the lexical classifier of requests predicting the likelihood of click on the theme of 3561
|
RF_YabarWordDepthNodesGradientMin
unknown: 23
The angle in the Depth Nodes space, counted only by words (min for all)
|
RF_Mean_SegmentWordPortionFromMainContent
unknown: 24
The share of the words of the document from the segments with Score> 2.
|
RF_Mean_NHopIsFinal
unknown: 25
The number of chains in which Url was the last normalized for the total number of chains in which this URL was.
|
RF_Min_Bclmf
unknown: 26
BCLM for Annotation index, doc text and links.
|
RF_Mean_URLClicksMaxGeoCityFRCWeight
unknown: 27
Normalized corrected clicks count by query with user's city(gc=) mentioned
|
RF_Mean_YabarUrlRevisits
unknown: 28
User return on URL
|
RF_Max_YabarUrlRevisits
unknown: 29
User return on URL
|
RF_Max_CorrectedCtrXfactorValueWcmAvg
unknown: 30
CorrectedctrxFactor in the annotation index, factor Valuewcmavg
|
RF_Max_DoubleFrcQueryMatchPrediction
unknown: 31
DoubleFRC in the annotation index, QueryMatchpredical factor
|
RF_Max_XfDtShowAllMaxFTextCosineMatchMaxPrediction
unknown: 32
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: CosinemaxMatchprediction in text and Title. The maximum value of the expansion factor.
|
RF_Max_OneClickFrcXfSpSuffixMatchCount
unknown: 33
OneClickFRC, calculated by the sampled period and collaboratively expanded, SuffixMatchcount factor
|
SF_Mean_tsim
unknown: 34
|
SF_Mean_end
unknown: 35
|
SF_Mean_blocks
unknown: 36
|
SF_Mean_mtch2
unknown: 37
|
SF_Max_seg_weight_sum
unknown: 38
|
BF_WebCTR0123
unknown: 39
CTR Web sum of the first 4 elements
|
BF_HasAllWordsTRFmHisto5hFraction
unknown: 40
Similarly, HasallwordStrfmhisto3dfuction, in the numerator - the number of documents over the past 5 hours
|
BF_AutoHostClassifier
unknown: 41
host classifier for auto vertical
|
BF_ClassificationKak
unknown: 42
Classification wizard rule class Kak
|
BF_VideoMaxWordsCSTR
unknown: 43
VideoMaxWordsCSTR
|
FF_FI_query_model
unknown: 44
ski uniramic text classifier trained for factskeeping queries from Toloka
|
FF_Max_FI_queryfact_w2v_sim
unknown: 45
The similarity of Snippet to a request, calculated on the basis of an unelematic Word2Vec Runet
|
FF_FI_query_host2vec_weight
unknown: 46
The value of logistics regression predicting the question of the request for the average vector of hosts to issue
|
FF_Mean_FI_querydoc_host2vec
unknown: 47
The value of logistics regression predicting that the answer is correct, according to the concatenation of the Khost vector and the medium vector of hosts to issue
|
FF_Min_FI_querydoc_host2vec
unknown: 48
The value of logistics regression predicting that the answer is correct, according to the concatenation of the Khost vector and the medium vector of hosts to issue
|
FF_Max_FI_querydoc_host2vec
unknown: 49
The value of logistics regression predicting that the answer is correct, according to the concatenation of the Khost vector and the medium vector of hosts to issue
|
FF_FI_query_is_encyc
unknown: 50
checks the operation of the impyscar rule of the encyclopedicity of the request
|
FF_FI_cluster_7
unknown: 51
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_8
unknown: 52
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_Mean_FI_cross_model
unknown: 53
The meaning of the regression trained in bigrams in which the first word is taken from the request, the second is from the answer
|
FF_Max_FI_cross_model
unknown: 54
The meaning of the regression trained in bigrams in which the first word is taken from the request, the second is from the answer
|
FF_Max_FI_snippet_unigram_weight
unknown: 55
The meaning of the regression trained at the frequencies of words in a snippet predicting that Snippet contains the answer
|
SF_Mean_fq_schema_is_question
unknown: 56
The page has a marking Schema.org Question (Mean)
|
SF_Max_fq_schema_is_question
unknown: 57
The page has a marking Schema.org Question (Max)
|
SF_Mean_fq_schema_has_approved_answer
unknown: 58
The page has a marking Schema.org Question and the best answer was selected (Mean)
|
SF_Max_fq_schema_has_approved_answer
unknown: 59
The page has a marking Schema.org Question and the best answer was selected (max)
|
SF_Mean_fq_schema_best_ans_word_count
unknown: 60
The length of the best answer Schema.org Question in words (Mean)
|
SF_Max_fq_schema_best_ans_word_count
unknown: 61
The length of the best answer Schema.org Question in words (max)
|
SF_Mean_fq_schema_best_ans_upvote_count
unknown: 62
The number of votes for the best answer to Schema.org Question (Mean)
|
SF_Max_fq_schema_best_ans_upvote_count
unknown: 63
The number of votes for the best answer to Schema.org Question (Max)
|
SF_Mean_fq_schema_best_ans_max_span_lcswc_div_span_wc
unknown: 64
The length of the greatest overall tuning of the best response to Schema.org Question and Snippet (share of SNIPPET words) (Mean)
|
SF_Max_fq_schema_best_ans_max_span_lcswc_div_span_wc
unknown: 65
The length of the greatest overall tuning of the best response to Schema.org Question and Snippet (share of SNIPPET words) (MAX)
|
SF_Mean_fq_schema_best_ans_max_span_lcswc_div_ans_wc
unknown: 66
The length of the greatest overall tuning of the best response to Schema.org Question and Snippet (share of words of the best answer) (Mean) (Mean)
|
SF_Max_fq_schema_best_ans_max_span_lcswc_div_ans_wc
unknown: 67
The length of the greatest overall tuning of the best response to Schema.org Question and Snippet (share of words of the best answer) (Max)
|
SF_Mean_fq_schema_best_ans_lcsw_pos_ratio_in_snip
unknown: 68
The position of the greatest overall tuning of the best response to Schema.org Question and Snippte in Snippet (Mean)
|
SF_Max_fq_schema_best_ans_lcsw_pos_ratio_in_snip
unknown: 69
The position of the greatest overall tuning of the best response to Schema.org Question and Snippte in Snippet (MAX)
|
SF_Mean_fq_schema_best_ans_lcsw_pos_ratio_in_ans
unknown: 70
The position of the greatest overall tuning of the best response to Schema.org Question and Snippet in the best answer (Mean)
|
SF_Max_fq_schema_best_ans_lcsw_pos_ratio_in_ans
unknown: 71
The position of the greatest overall tuning of the best response to Schema.org Question and Snippet in the best answer (Max)
|
SF_Mean_fq_schema_matched_ans_word_count
unknown: 72
The length in the words of Schema.org Question response, most similar to snippet
|
SF_Max_fq_schema_matched_ans_word_count
unknown: 73
The length in the words of Schema.org Question response, most similar to snippet (max)
|
SF_Mean_fq_schema_matched_ans_upvote_count
unknown: 74
The number of votes for the answer Schema.org Question, most similar to Snippet (Mean)
|
SF_Max_fq_schema_matched_ans_upvote_count
unknown: 75
The number of votes for the answer Schema.org Question, most similar to Snippet (Max)
|
SF_Mean_fq_schema_matched_ans_max_span_lcswc_div_span_wc
unknown: 76
The length of the largest overall substitution of Schema.org Question response, most similar to snippet, and snippet (share of SNIPPET words) (Mean)
|
SF_Max_fq_schema_matched_ans_max_span_lcswc_div_span_wc
unknown: 77
The length of the largest overall substitution of Schema.org Question response, most similar to snippet, and snippet (share of the words of Snippte) (Max)
|
SF_Mean_fq_schema_matched_ans_max_span_lcswc_div_ans_wc
unknown: 78
The length of the largest overall substitution of Schema.org Question response, most similar to snippet, and snippet (share of the words of the answer) (Mean) (Mean)
|
SF_Max_fq_schema_matched_ans_max_span_lcswc_div_ans_wc
unknown: 79
The length of the largest overall substitution of Schema.org Question response, most similar to snippet, and snippet (share of the words of the answer) (max)
|
SF_Mean_fq_schema_matched_ans_lcsw_pos_ratio_in_snip
unknown: 80
Schema.org Question response position, most similar to snippet, and snippet in snippet (Mean)
|
SF_Max_fq_schema_matched_ans_lcsw_pos_ratio_in_snip
unknown: 81
Schema.org Question response position, most similar to snippet, and snippet in snippet (Max)
|
SF_Mean_fq_schema_matched_ans_lcsw_pos_ratio_in_ans
unknown: 82
Schema.org Question response position, most similar to snippet, and snippet in the answer (Mean)
|
SF_Max_fq_schema_matched_ans_lcsw_pos_ratio_in_ans
unknown: 83
Schema.org Question response position, most similar to snippet, and snippet in the answer (max)
|
Sf_Median_fq_ru_fact_snippet_dssm_factoid_score
unknown: 84
Median Factoid DSSM-Steams <Request, snippet> according to the first documents of the issuance of the FACTS-747, FACTS-19
|
Sf_Min_fq_ru_fact_snippet_dssm_factoid_score
unknown: 85
A minimum of factual DSSM-surplus pairs <request, snippet> according to the first documents of the issuance of the FACTS-747, FACTS-19
|
Sf_Std_fq_ru_fact_snippet_dssm_factoid_score
unknown: 86
The average quadratic deviation of the vector of Factoid DSSM-surgery steam <request, snippet> according to the first documents of the issuance of the FACTS-747, FACTS-19
|
FF_FI_is_assistant
unknown: 87
a sign that the request came from the assistant
|
Sf_Median_fq_tomato_dssm_factoid_score
unknown: 88
Median Factoid DSSM scors according to the new Tomato DSSM formula. Facts-2545
|
Sf_Min_fq_tomato_dssm_factoid_score
unknown: 89
A minimum of factual DSSM-scorches according to the new Tomato DSSM formula. Facts-2545
|
Sf_Std_fq_tomato_dssm_factoid_score
unknown: 90
The average deviation of DSSM scors according to the new Tomato DSSM formula. Facts-2545
|
FF_FI_neocortex_serp_items_wiz_images
unknown: 91
Cosinus between emblems of the request and availability of wiz-images on a Serpa
|
FF_FI_neocortex_serp_items_wiz_video
unknown: 92
Cosinus between emblems of the request and availability of Wiz-Video on a Serpa
|
FF_FI_neocortex_serp_items_union_facts
unknown: 93
Cosinus between embezzles of request and availability of Union-Facts on a Serpa
|
FF_FI_neocortex_serp_items_wiz_musicplayer
unknown: 94
The cosine between the embedding of the request and the availability of Wiz-MusicPlayer on the Serpa
|
FF_FI_neocortex_serp_items_wiz_maps
unknown: 95
Cosinus between embezzles of request and availability of Wiz-Maps on a sickle
|
FF_FI_neocortex_serp_items_positive_query_mx
unknown: 96
Cosinus between the embezzle of the request and the presence of a positive Query MX
|
Sf_Max_fq_tomato_dssm_factoid_score
unknown: 97
The maximum of DSSM-skors on the new Tomato DSSM formula. Facts-2545
|
Ff_Max_fi_bert_factsnip_answer_dssm
unknown: 98
MAXM DSSM-SCOROV by models bert_factsnip_answer
|
Ff_Mean_fi_bert_factsnip_answer_dssm
unknown: 99
Average DSSM-School according to the Bert_FACTSNIP_ANSWER model
|
FF_FI_fact_word_min_frequency
unknown: 100
The minimum word frequency from the request, by the frequencies of words in factary requests
|
FF_FI_fact_word_max_frequency
unknown: 101
The maximum frequency of a word from the request, by the frequencies of words in factual requests
|
FF_FI_fact_word_med_smooth_inverse_frequency
unknown: 102
Squeezed inverted median frequencies from a request, according to the frequencies of words in factary requests
|
FF_FI_fact_word_relative_min_frequency
unknown: 103
The minimum word frequency from the request, by the frequencies of words in factual requests, relative to the frequency of the word in general texts
|
FF_FI_fact_word_relative_mean_frequency
unknown: 104
The average frequency of words from the request, by the frequencies of words in factary requests, relative to the frequencies of words in general texts
|
FF_FI_fact_word_relative_med_smooth_inverse_frequency
unknown: 105
Squeezed inverted median frequencies from a request, according to the frequencies of words in factary requests, relative to the frequencies of words in general texts
|
FF_FI_fact_bigram_max_frequency
unknown: 106
The maximum frequency of the bigma from the request, according to the frequencies of birams in factary queries
|
FF_FI_norm_query_char_len
unknown: 107
Request length after Normalizetext in symbols
|
FF_FI_norm_query_word_len
unknown: 108
Request length after Normalizetext in words
|
FF_FI_question_word_count
unknown: 109
The number of interrogative words in the request
|
FF_FI_first_advpro_hash
unknown: 110
Hesh of the first pronoun dialect in the request
|
FF_FI_first_preposition_hash
unknown: 111
Hesh of the first preposition in the request
|
FF_FI_long_word_count
unknown: 112
The number of words, the length of which is more than 3, in a normalized request
|
FF_FI_query_normalizied_length_diff
unknown: 113
(length of_nabrication_biez_normalization - length_vseh_normalized_glines) / length_normalized_Prosa (in lengths of words, arithmic progression coefficients)
|
RF_MEAN_FI_BQPRSampleMixMatchWeightedValue
unknown: 114
среднее от MixMatchWeightedValue factor over hits from BQPRSample stream
|
RF_MEAN_FI_SamplePeriodDayFrcFullMatchValue
unknown: 115
среднее от FullMatchValue factor over hits from SamplePeriodDayFrc stream
|
RF_MEAN_FI_SamplePeriodDayFrcMixMatchWeightedValue
unknown: 116
Average from SampleperiodDayFrcMixMatchWeightedValue
|
RF_STD_FI_DoubleFrcCMMatchTop5AvgMatch
unknown: 117
Average Deviation DoubleFrcmatchtop5AVGMATCH
|
RF_STD_FI_OneClickFrcXfSpPerWordCMMaxMatchMin
unknown: 118
Average deviation from SampleperidDayFrcMixMatchWeightedValue
|
RF_MEAN_FI_AvgDTWeightedByRankMobileFullMatchValue
unknown: 119
Middle from AvgdtWeightedbyrankmobilefullmatchvalue
|
RF_MIN_FI_QfufAllAvgW
unknown: 120
Minimum from Qfufallavgw, which: Linguistic Boosting factor. The average weight of the QFUF type extensions.
|
RF_MAX_FI_QfufAllTotalW
unknown: 121
Maximum from Qfufalltotalw, which: Linguistic Boosting factor. Type of extensions: QFUF. Transferred the total weight of the extensions.
|
RF_MIN_FI_RandomLogQueryAvgAddTime
unknown: 122
Minimum from RandomlogqueryavGaddtime: ADDTIME average value for the year. It is calculated in offline.
|
RF_MIN_FI_RandomLogQueryAvgTxtHiRelSy
unknown: 123
At least from RandomlogqueryavgtXthirelsy: the average Txthirelsy value for the year. It is calculated in offline.
|
RF_MIN_FI_RandomLogQueryAvgTextLike
unknown: 124
At least from Randomlogqueryavgtextlike: the average Textlike value for the year. It is calculated in offline.
|
RF_MAX_FI_RandomLogQueryAvgHasNoAllWordsTRSy
unknown: 125
Maximum from Randomlogqueryavghasnoallwordstersy: The average Hasnoallwordstersy value for the year. It is calculated in offline.
|
RF_MIN_FI_RandomLogQueryAvgIsForum
unknown: 126
minimum of randomlogqueryavgisforum
|
RF_MIN_FI_RandomLogQueryAvgQueryDOwnerOnlyClickRate
unknown: 127
minimum of randomlogQueryvgQuerydowneronlyclickrate
|
RF_MIN_FI_RandomLogQueryAvgLongestText
unknown: 128
minimum of randomlogqueryavglongesttext
|
RF_MEAN_FI_RandomLogQueryAvgDifferentInternalLinks
unknown: 129
Average from RandomlogqueryavgdiferentinTernallinks
|
RF_MIN_FI_RandomLogQueryAvgQueryDOwnerOnlyClickRate_Reg
unknown: 130
minimum of randomlogQueryvgQuerydowneronlyclickrate_reg
|
RF_MIN_FI_RandomLogQueryAvgBM25_0
unknown: 131
minimum of randomlogqueryavgbm25_0
|
RF_MIN_FI_RandomLogQueryAvgQueriesAvgCM2
unknown: 132
minimum of randomlogQueryvgqueriesavgcm2
|
RF_MIN_FI_RandomLogQueryAvgRegBrowserUserHub
unknown: 133
minimum of randomlogQueryavgregBrowseruserhub
|
RF_MAX_FI_RandomLogQueryAvgQueryUrlCorrectedCtrXfactor
unknown: 134
maximum of randomlogQueryvgquieurlcorreCTRXTRXFACTOR
|
RF_MIN_FI_RandomLogQueryAvgXfDtShowAllSumWFSumWBodyMinWindowSize
unknown: 135
A minimum of randomlogQueryavgxfdshowallsumwfsumwbodyminwindowsize
|
RF_MAX_FI_RandomLogQueryClicksWeightedAvgYabarUrlAvgTime
unknown: 136
Maximum from RandomlogQuryClicksweightedavgyabarururlavgtime: Malinated by clicks Yabarurlavgtime for the year. It is calculated in offline.
|
RF_MAX_FI_RandomLogQueryClicksWeightedAvgDifferentInternalLinks
unknown: 137
maximum of randomlogQueryclickSweigtedavgdifferentinternllinks
|
RF_STD_FI_VpcgCorrectedClicksSLPPerWordCMMaxPredictionMin
unknown: 138
Average deviation from VPCGCORRECTEDCLICSSLPPERWORDCMAXPREXPRECTIONMIN
|
RF_MEAN_FI_VpcgCorrectedClicksSLPMixMatchWeightedValue
unknown: 139
среднее от VpcgCorrectedClicksSLPMixMatchWeightedValue
|
RF_STD_FI_VpcgCorrectedClicksSLPCMMatchTop5AvgPrediction
unknown: 140
Average deviation from VPCGCORRECTEDCLICSLPMATCHTOP5AVGPREDION
|
RF_MAX_FI_QueryDoppMedianDwelltime
unknown: 141
Maximum of QuerydopMediandwellTime
|
RF_MEAN_FI_QueryDoppMultipleClicksShows
unknown: 142
Average from QuerydoppmultIclicksshows
|
RF_MAX_FI_QueryDoppMultipleClicksProbability
unknown: 143
Maximum of QuerydoppmultipleClicksprobability
|
RF_MAX_FI_XfDtShowAllTotalW
unknown: 144
maximum of xfdshowalltotalw
|
RF_PR
unknown: 0
Page Rank. The factor will be remarked.
|
RF_Long
unknown: 1
Long document (the longer the document, the greater the value of the factor).
|
RF_SR
unknown: 2
The complex Static Rank is assembled from static components according to a separate formula ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/#oftnd1 *))).
|
RF_Removed_3
unknown: 3
|
RF_LinkQuality
unknown: 4
The quality of incoming links (the classifier of the bream) is broken, cm [405]
|
RF_TextFeatures
unknown: 5
The quality of the text. It is considered a rather complex formula
|
RF_Removed_6
unknown: 6
|
RF_HostSize
unknown: 7
The size of the Host named after Raskovalov in the documents without taking into account the takes (each double is taken into account in the factor by an independent document)
|
RF_LinksWithWordsPercent
unknown: 8
The percentage of incoming links with the words of the request
|
RF_NumWordsTRFm
unknown: 9
The percentage of all the words of the request in the text (with an accuracy to the form)
|
RF_LinkAge
unknown: 10
The average age of links that brought something to LR linkage = min (log (average age of links)/7, 1), 3 years are adopted for 1
|
RF_QueryURLClicksFRC
unknown: 11
the ratio of the number of clicks on this Urlu to all clicks on request
|
RF_PassageLegacyTR
unknown: 12
TR of the best passage - how high -quality snippet
|
RF_TxtBM25AttenSyn
unknown: 13
Tr with discount for suggestions
|
RF_YabarHostAvgActions
unknown: 14
The average for users is the number of active actions (clicks, clicks) with the continuous finding of the user (in second) on the pages of the host.
|
RF_YabarUrlVisits
unknown: 15
Varla's attendance according to I-Bara
|
RF_YabarUrlAvgTime
unknown: 16
The average for users time is the user on the page. It is read as the difference between neighboring transitions.
|
RF_NormalTextIdfSum_broken
unknown: 17
IDF for various parts of the document, broken, are not used
|
RF_Diversity2
unknown: 18
Geographical distribution of the request
|
RF_OwnerSDiffClickEntropy
unknown: 19
Entropy - distribution of clicks
|
RF_OwnerSDiffShowEntropy
unknown: 20
Entropy - distribution of shows
|
RF_GeoRegionalityV
unknown: 21
V- geovital - regional issuance is of fundamental importance
|
RF_SynFLremap1
unknown: 22
Show how much the text is unnatural from the point of view of the Russian language. Assessment of how much the text of the document can be considered as a generated synonymizer or automatic. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAfermula/tekushhiekomponenty/antispam?v=1il#h58953-2 more))
|
RF_UrlSessNormDurRate
unknown: 23
nd/i
|
RF_SyntQuality
unknown: 24
Does the request have a complete syntactic analysis
|
RF_SynNumBadWordPairs
unknown: 25
The proportion of bad steam among all found in the table: Z/(X+1), where Z is the number of bad couples in the text, and X is (http://wiki.yandex-team.ru/evgenijgrechnikov/testsynonimizers of 2000-navigable )) steam
|
RF_NumLatinLetters
unknown: 26
The number of Latin letters in the text (not counting the markings) driven into [0.1] formula n/(n+100)
|
RF_TitleIdfSumFixed
unknown: 27
Previous factors - fixed
|
RF_HeadingIdfSumFixed
unknown: 28
Previous factors - fixed
|
RF_PercentWordsInLinks
unknown: 29
The percentage of the number of words inside the tag <a> .. </a> from the number of all words
|
RF_PercentVisibleContent
unknown: 30
The percentage of the number of words outside the tags (outside the brackets <>) from the number of all words
|
RF_PercentUsedFreqWords
unknown: 31
The number used in the text 500 of the most popular words of the language, divided by 500
|
RF_TrigramsCondProb
unknown: 32
Logarithm of the average geometric conditional probabilities of trigrams. The conditional probability of a trigram is its probability, divided by the probability of a bigram from the first two words
|
RF_TextWeightedForms
unknown: 33
The sum of the number of forms balanced by the scales of words - the amount in all words of the request of the number_form_dly_lov/64*weight_lov; REMAP species x/(1 + x).
|
RF_LinkWeightedForms
unknown: 34
Summer of the number of forms balanced by scales
|
RF_QSegmentsBreaks
unknown: 35
Request segments are parts of the request, which in themselves are frequency requests. The factor shows how much the segments are in the text. value 0 - all words are found only within the framework of the indicated segments, 1 - all the entries break segments
|
RF_ParticlesPortion
unknown: 36
The share of particles
|
RF_AdvPronounsPortion
unknown: 37
The proportion of pronoun nouns
|
RF_VerbsPortion
unknown: 38
The share of verbs
|
RF_FemAndMasNounsPortion
unknown: 39
The share of words that can be both masculine nouns and nouns of the feminine, but not of the middle kind, among all nouns (examples: 'hummingbirds' are an example of an indefinite kind that can be determined in two ways, 'Alexander' is homonymy).
|
RF_QClassKak
unknown: 40
question
|
RF_Removed_41
unknown: 41
|
RF_PositionLanguageModel
unknown: 42
The factor about that, a good snippet can turn out.
|
RF_AuraDocLogAuthor
unknown: 43
Logarithm of the number of shingles on which this owner of the document is recognized as the author
|
RF_AuraDocMeanSharedWeight
unknown: 44
The average weight of non-ugly shingles of this document
|
RF_Removed_45
unknown: 45
|
RF_NumSlashes
unknown: 46
The number of slashes in Url
|
RF_GskUrlModel
unknown: 47
The factor is calculated from the text of Url using the classifier of sequences Quality/Seq/GSK
|
RF_YmwFull
unknown: 48
The size of the minimum piece of text, including all the words of the request found in the document. Not used now. ((http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayAformula/tekushhiekomponenty/ymw Read more))
|
RF_QueryCommercialityMx
unknown: 49
The measure of 'commercial' request. It is a comprehensively calculated Matrixnet factor formula for the procurement vocabulary in direct + for user queries + add. Intensive dictionaries. Requests with intensity to buy a factor seeks to -> 1 commodity requests -> 0.6 with intensity cannot buy, reviews, etc. -> 0 ((http://wiki.yandex-team.ru/Faktorydljanovogokatorazaprosov Factors of the Classifier))) (HTTP : //wiki.yandex-team.ru/jandekspoisk/antispam/antiseo/klassifikATORCHESSKIXZAPROSOV STUNITURE OF HIM))
|
RF_TitleTrigramsQuery
unknown: 50
Calculates the coating of the request with letter trigrams of the document header
|
RF_InlinksModel
unknown: 51
Probabilistic model built on the texts of incoming links
|
RF_OwnerNavQuota
unknown: 52
The share of clicks for navigation requests
|
RF_IsGeo
unknown: 53
It launches on the basic search under the name ISGEO the maximum weight of the meters of the gelator in the request. A geo-object is understood as an object of the category GEO, Geo1, Geoaddr, Geoaddr1, Landmark, Landmark1 (see ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects kaovsky allocation))))))))))))))))))))))))))))))). wiki.yandex-team.ru/arsengadzhikurbanov/wares Read more))
|
RF_BclmLite
unknown: 54
Modification of the BCLM2 factor, lightweight for use in tulle. The main difference is that BCLMLite does not use absolute displacements of words relative to the beginning of the document. Instead, the factor works with the usual positions of the type <number of the_prising, position_v_production>. At the same time, the proximity between the words is taken into account only inside the sentence. (Http://wiki.yandex-team.ru/jandekspoisk/kachestvopoiska/obshayaFormula/tekushichiekomponenty/bclmlite bclmlite)))))))))))))
|
RF_QueryDOwnerSessNormDuration_Reg
unknown: 55
CONTRY / K
|
RF_QueryDOwnerWeightClick_Reg
unknown: 56
w/k
|
RF_SegmentAuxSpacesInText
unknown: 57
The number of spaces in the AUX segment
|
RF_SegmentContentCommasInText
unknown: 58
The number of commas in the Content segment
|
RF_IdfVariance
unknown: 59
Dispersion of IDF words,
|
RF_UrlNGramsModel
unknown: 60
Urlngramsmodel ranking factor in ERF
|
RF_QueryDOwnerWeightedSumFRCAndBM25FdPRFixed
unknown: 61
The amount of factors QueryDownerClicksFRC and BM25FDPRFIXED with scales 0.358449 and 0.184922, respectively. '565' in the name of the factor does not need to be perceived literally, it is Legashi or a typo.
|
RF_NumNonLettersInUrl
unknown: 62
The number of 'Nebukv 'in Url
|
RF_TitleInLinksTrigrams
unknown: 63
The share of unique trigrams in the trigrams of links
|
RF_TrashAdv
unknown: 64
The greasy of the page
|
RF_YabarUrlLcAc
unknown: 65
The number of sessions in which Url was the last, classified as the sessions in which Url appeared
|
RF_TRLRQuorumFm
unknown: 66
The weight of the words of the request that is in the text in the exact form
|
RF_IsText
unknown: 67
It launches on the basic search under the name ISTEXT the maximum weight of the TEXT or Text1 category of the category of the category met in the request. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#istext more)))
|
RF_MinOne
unknown: 68
Returns the maximum degree of household objects in the request under the name Wminone. (See ((http://wiki.yandex-team.ru/alekseysokirko/queryobjects SOM-OV)))). ((http://wiki.yandex-team.ru/arsengadzhikurbanov/Wares#minone more)))))
|
RF_QueryUrlCorrectedCtr_Reg
unknown: 69
'Fixed' clicks calculated using Requestaggregatelib. Regional version
|
RF_NavLinear
unknown: 70
((http://wiki.yandex-team.ru/jandekspoisk/antispam/polunavigacionnyezaprosy#faktornnostiparyurl-zapros classifier)) pairs of vitalnikov [query url], Url Vital for the request, if value is valuable for Ф> 0.
|
RF_QueryThVideohosting
unknown: 71
The result of the work of the lexical classifier of requests predicting the likelihood of click on the page 3973 page
|
RF_ShowsWithAnotherSEClicks
unknown: 72
Urlov shows in the issuance for requests, by which they went to look for other search engines
|
RF_BclmMax
unknown: 73
The proximity of the words of the request to the most difficult word.
|
RF_RegexMaxClickPercentReg
unknown: 74
The share of clicks on this Urlu among all clicks according to similar requests, the country version, see ((http://wiki.yandex-team.ru/development/poisk/Arcadia/indexregex indexregex))))))))
|
RF_YabarWordDepthNodesGradientMin
unknown: 75
The angle in the Depth Nodes space, counted only by words (min for all)
|
RF_DocCreateMonth
unknown: 76
The time of creating a document with an accuracy of 1.0 is the current month, 0- 10 years ago and older. Temporarily disconnected
|
RF_DocUpdateMonth
unknown: 77
The time for updating the document with an accuracy of 1.0 is the current month, 0- 10 years ago and older. Temporarily disconnected
|
RF_XLRMainPage
unknown: 78
|
RF_DaterStatsAverageSourceSegment
unknown: 79
The arithmetic mean position of dates in the document. Temporarily disconnected
|
RF_DBM15Wares2
unknown: 80
|
RF_SegmentWordPortionFromMainContent
unknown: 81
The share of the words of the document from the segments with Score> 2.
|
RF_QiUrlFreqWeightedFRCReg
unknown: 82
FRC groups of frequency requests similar to a given, with averaging through the sum of clicks and shows, according to regional statistics
|
RF_QUBm15Weighted
unknown: 83
Weighed BM15 for a request for an index document - a list of requests for which they switched to it.
|
RF_BrowserHostDownloadProbability
unknown: 84
The likelihood of a racing from a host after click (on the logs of the bar).
|
RF_NHopIsFinal
unknown: 85
The number of chains in which Url was the last normalized for the total number of chains in which this URL was.
|
RF_RegBrowserUserHub
unknown: 86
The page indicator is like a hub (how many pages are the bar users pass from it).
|
RF_SameQueryReturnFRCBrowser
unknown: 87
FRC by transitions from requests that were set by the user several times
|
RF_QueryURLISBMCTRReg
unknown: 88
The average weight of the shows on the first page; Click weighs 1, non -click - according to the SBM_GAMMAS table. Regional version
|
RF_YabarUrlRevisits
unknown: 89
User return on URL
|
RF_PrefixSuffixMaxClickPercentReg
unknown: 90
A factor similar to RegexmaxclickPercentreg, but calculated by Preffix-Suffix Generalization.
|
RF_SamplePeriodClickFrcSyn
unknown: 91
The share of Urla in the total number of Urlov closed for the session on request (Synnorm).
|
RF_SamplePeriodDayFrc
unknown: 92
The average share of clicks for this UrLU for this request among all clicks for this request (QNORM) during the day.
|
RF_SamplePeriodDayFrcXfactor
unknown: 93
Request-murl factor. Value is the result of the collaborative filtration of data for the SampleperiodDayFRC factor
|
RF_QiSamplePeriodDayFrc
unknown: 94
QI version of factor 879.
|
RF_CorrectedCtrQueryMatchPrediction
unknown: 95
Correctedctrreg factor in the annotation index, QueryMatchpredical factor
|
RF_SamplePeriodDayFrcAnnotationMatchWeightedValue
unknown: 96
SampleperiodDayFRC Factor in the annotation index, AnnotationMatchprediction factor
|
RF_LongClickAnnotationMatchWeightedValue
unknown: 97
LongClick Factor in the annotation index, AnnotationMatchpredical factor
|
RF_BQPRSampleAnnotationMatchWeightedValue
unknown: 98
BQPR Factor in the annotation index, AnnotationMatchpredical factor
|
RF_OneClickSynonymMatchPrediction
unknown: 99
OneClick Factor in the annotation index, SynonyMatchpredical factor
|
RF_OneClickFullMatchValue
unknown: 100
OneClick factor in the annotation index, Fullmatchpredical factor
|
RF_OneClickBclmWeightedK3
unknown: 101
OneClick factor in the annotation index, factor BCLMWEIGHTEDK3
|
RF_FractionOfQueriesWithGeoPredicted
unknown: 102
Prediction of a share of requests with geography on a bag of words built for request
|
RF_CorrectedCtrXfactorQueryMatchPrediction
unknown: 103
CorrectedctrxFactor in the annotation index, QueryMatchpredical factor
|
RF_CorrectedCtrXfactorAllWcmMaxPrediction
unknown: 104
CorrectedctrxFactor in the annotation index, factor Valuewcmmax
|
RF_CorrectedCtrXfactorAllWcmMatch80AvgValue
unknown: 105
CorrectedctrxFactor in the annotation index, factor Valuewcmavg
|
RF_RequestWithRegionNameLongClickSPAnnotationMatchWeightedValue
unknown: 106
Linguistic boosting factor. Type of extensions: Requestwithregionname. Factor: AnnotationMatchWeightedValue by stream LongClicksp.
|
RF_DoubleFrcFullMatchValue
unknown: 107
DoubleFRC in the annotation index, Fullmatchpredical factor
|
RF_DoubleFrcAnnotationMatchWeightedValue
unknown: 108
DoubleFRC in the annotation index, AnnotationMatchpredical factor
|
REMOVED_109
unknown: 109
removed
|
REMOVED_110
unknown: 110
removed
|
REMOVED_111
unknown: 111
removed
|
REMOVED_112
unknown: 112
removed
|
REMOVED_113
unknown: 113
removed
|
REMOVED_114
unknown: 114
removed
|
RF_XfDtShowAllMinW
unknown: 115
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: The minimum expansion weight.
|
RF_XfDtShowAllMaxFFieldSet3BclmWeightedFLogW0K0001
unknown: 116
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: BCLMWEIGHTEDFLOGW0 in the Stream group 3. The maximum value of the expansion factor.
|
RF_XfDtShowAllMaxFFieldSetUTBm15FLogW0
unknown: 117
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: BM15FLOGW0 for Urlu and Title. The maximum value of the expansion factor.
|
RF_XfDtShowAllMaxWFLongClickSPFullMatchValue
unknown: 118
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: Fullmatchvalue by stream LongClicksp. The maximum balanced value of the expansion factor.
|
RF_XfDtShowAllMaxWFOneClickFullMatchValue
unknown: 119
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: Fullmatchvalue according to Stream OneClick. The maximum balanced value of the expansion factor.
|
RF_XfDtShowAllSumW2FSumWFieldSet1Bm15FLogK0001
unknown: 120
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: BM15FLOG by the Stream group 1. The average balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) for extensions.
|
RF_XfDtShowAllSumW2FSumWFieldSetUTBm15FLogW0
unknown: 121
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: BM15FLOGW0 for Urlu and Title. The average balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) for extensions.
|
RF_XfDtShowAllSumWFSumWBodyMinWindowSize
unknown: 122
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: Minwindowsize in text. The average balanced values of the expansion factor.
|
RF_XfDtShowBagOfWordsFieldSetBagOfWordsOriginalRequestFractionExact
unknown: 123
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: ORIGINALREQUARY ORIGINALREKETRACTRENEXACT for a group of streams for bag factors (text, Title, annotation streams).
|
RF_XfDtShowBagOfWordsLongClickSPCosineMatchWeightedValue
unknown: 124
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: CosinematchWeightedValue bag by stream LongClicksp.
|
RF_XfDtShowBagOfWordsSimpleClickAnnotationMatchAvgValue
unknown: 125
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: SIMPLECLIC SIMPLECLICS bag.
|
RF_XfDtShowBagOfWordsTitleCosineMaxMatch
unknown: 126
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: CosinemaxMattcg bag.
|
RF_XfDtShowTopMinWFFieldSet3BclmWeightedFLogW0K0001
unknown: 127
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: BCLMWEIGHTEDFLOGW0 in the Stream group 3. The minimum balanced value of the factor for the expansion top.
|
RF_XfDtShowTopMinWFLongClickSPAnnotationMatchWeightedValue
unknown: 128
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: AnnotationMatchWeightedValue by stream LongClicksp. The minimum balanced value of the factor on the expansion top.
|
RF_XfDtShowTopMinWFMaxWLongClickSPAnnotationMatchWeightedValue
unknown: 129
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: AnnotationMatchWeightedValue by stream LongClicksp. The minimum balanced value of the factor for the expansion top extensions normalized for maximum weight by the Top Extensions.
|
RF_XfDtShowTopSumW2FSumWLongClickSPFullMatchValue
unknown: 130
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: Fullmatchvalue by stream LongClicksp. The average balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) according to the expansion top.
|
RF_XfDtShowTopSumW2FSumWOneClickFullMatchValue
unknown: 131
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: Fullmatchvalue according to Stream OneClick. The average balanced values of the factor multiplied by weight (\ frac {\ sum w_i * (w_i * f_i)} {\ sum w_i}) according to the expansion top.
|
RF_XfDtShowTopSumWFSumWFieldSet3BclmWeightedFLogW0K0001
unknown: 132
Linguistic boosting factor. Type of extensions: XFDTSHOW. Factor: BCLMWEIGHTEDFLOGW0 in the Stream group 3. The average balanced values of the factor for the expansion top.
|
RF_OneClickFrcXfSpFullMatchPrediction
unknown: 133
OneClickFRC, calculated by the sampled period and collaboratively expanded, Fullmatchpredical factor
|
RF_OneClickFrcXfSpAnnotationMatchWeightedValue
unknown: 134
OneClickFRC, calculated by the sampled period and collaboratively expanded, AnnotationMatchpredictionWeighted factor
|
RF_OneClickFrcXfSpWcmCoveragePrediction
unknown: 135
OneClickFRC, calculated by the sampled period and collaboratively expanded, WCMCOVERAGEPREDION factor
|
RF_IsLocalProbability
unknown: 136
The value of the classifier of localization for request
|
RF_FullUrlFraction
unknown: 137
URL coating with trigrams from the request. Analogue of Urldomainfraction, Urlpathandparamsfraction factors.
|
SF_unique_w
unknown: 138
|
SF_good
unknown: 139
|
SF_tsim
unknown: 140
|
SF_pbeg
unknown: 141
|
SF_blocks
unknown: 142
|
SF_rdots
unknown: 143
|
SF_ursrls
unknown: 144
|
SF_ursrlns
unknown: 145
|
SF_ns_wpos_pct
unknown: 146
|
SF_seg_weight_sum
unknown: 147
|
SF_seg_middle_word
unknown: 148
|
SF_seg_spaces_per_symbol
unknown: 149
|
SF_wtit_good
unknown: 150
|
SF_wtit_same_w
unknown: 151
|
SF_wtit_ns_wpos_pct
unknown: 152
|
SF_wtit_upos_pct
unknown: 153
|
SF_real_len
unknown: 154
|
SF_fq_is_definition
unknown: 155
|
BF_VideoWizAvgRelExp
unknown: 156
|
BF_ImagesWebToImagesLastSessions
unknown: 157
The share of the request sessions that ended in Y. Smarts after the transition from the search web
|
BF_WebCTR0123
unknown: 158
CTR Web sum of the first 4 elements
|
BF_WaresIsClothes
unknown: 159
Clothes detected in query
|
Removed160
unknown: 160
|
Removed161
unknown: 161
|
BF_QuickDocFraction
unknown: 162
QuickDocCount * 100 / WebDocCountFresh
|
BF_QuickPeakLen
unknown: 163
Peak size (in watches), 50 top documents are taken into account from each fast -butter machine
|
Removed164
unknown: 164
|
Removed165
unknown: 165
|
Removed166
unknown: 166
|
BF_WaresTextIntent
unknown: 167
|
BF_AfishaEventHostClassifier
unknown: 168
afisha host classifier
|
BF_AutoHostClassifier
unknown: 169
host classifier for auto vertical
|
BF_ClassificationKak
unknown: 170
Classification wizard rule class Kak
|
BF_VideoMaxWordsCSTR
unknown: 171
VideoMaxWordsCSTR
|
BF_ImagesLogProdWordsFRC
unknown: 172
ImagesLogProdWordsFRC
|
BF_ImagesLogProdWordsSTR
unknown: 173
ImagesLogProdWordsSTR
|
BF_ImagesMaxWordsCSTR
unknown: 174
ImagesMaxWordsCSTR
|
BF_VideoDocCount
unknown: 175
Number of documents in dg grouping
|
BF_VideoQuickDocCount
unknown: 176
Number of documents in dgq grouping
|
FF_FI_same_numbers
unknown: 177
the number of answers in which there are numbers from this
|
FF_FI_query_model
unknown: 178
ski uniramic text classifier trained for factskeeping queries from Toloka
|
FF_FI_facts_w2v_sim
unknown: 179
The average similarity of the response to others, counted on the basis of the non -metered Word2VEC Runet
|
FF_FI_queryfact_w2v_sim
unknown: 180
The similarity of Snippet to a request, calculated on the basis of an unelematic Word2Vec Runet
|
FF_FI_query_host2vec_weight
unknown: 181
The value of logistics regression predicting the question of the request for the average vector of hosts to issue
|
FF_FI_querydoc_host2vec
unknown: 182
The value of logistics regression predicting that the answer is correct, according to the concatenation of the Khost vector and the medium vector of hosts to issue
|
FF_FI_query_is_encyc
unknown: 183
checks the operation of the impyscar rule of the encyclopedicity of the request
|
FF_FI_cluster_1
unknown: 184
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_4
unknown: 185
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_8
unknown: 186
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_10
unknown: 187
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_11
unknown: 188
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_12
unknown: 189
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_13
unknown: 190
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_14
unknown: 191
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_16
unknown: 192
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_18
unknown: 193
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_19
unknown: 194
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_20
unknown: 195
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_21
unknown: 196
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_22
unknown: 197
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_24
unknown: 198
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_27
unknown: 199
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_28
unknown: 200
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cluster_29
unknown: 201
the proximity of the total host vector to one of the clusters built according to the answers from Toloka
|
FF_FI_cross_model
unknown: 202
The meaning of the regression trained in bigrams in which the first word is taken from the request, the second is from the answer
|
FF_FI_snippet_unigram_weight
unknown: 203
The meaning of the regression trained at the frequencies of words in a snippet predicting that Snippet contains the answer
|
SF_fq_schema_is_question
unknown: 204
The page has a scheme of Schema.org Question
|
SF_fq_schema_has_approved_answer
unknown: 205
The page has a marking Schema.org Question and the best answer was selected
|
SF_fq_schema_best_ans_word_count
unknown: 206
Schema.org Question in words in words
|
SF_fq_schema_best_ans_upvote_count
unknown: 207
The number of votes for the best answer Schema.org Question
|
SF_fq_schema_best_ans_max_span_lcswc_div_span_wc
unknown: 208
The length of the greatest overall tuning of the best response to Schema.org Question and Snippet (share of SNIPPET words)
|
SF_fq_schema_best_ans_max_span_lcswc_div_ans_wc
unknown: 209
The length of the greatest overall tuning of the best response to Schema.org Question and Snippet (share of words of the best answer)
|
SF_fq_schema_best_ans_lcsw_pos_ratio_in_snip
unknown: 210
The position of the greatest overall tuning of the best response to Schema.org Question and Snippet in Snippet
|
SF_fq_schema_best_ans_lcsw_pos_ratio_in_ans
unknown: 211
The position of the greatest overall tuning of the best answer to Schema.org Question and Snippet in the best response
|
SF_fq_schema_matched_ans_word_count
unknown: 212
The length in the words of the answer Schema.org Question, most similar to snippet
|
SF_fq_schema_matched_ans_upvote_count
unknown: 213
the number of votes for the answer Schema.org Question, most similar to snippet
|
SF_fq_schema_matched_ans_max_span_lcswc_div_span_wc
unknown: 214
Schema.org Question, which is most similar to Snippet, and Snippet (share of SNIPPET words), the most common overall substitution of the response.
|
SF_fq_schema_matched_ans_max_span_lcswc_div_ans_wc
unknown: 215
The length of the largest overall substitution of Schema.org Question response, most similar to snippet, and snippet (share of the words of the answer)
|
SF_fq_schema_matched_ans_lcsw_pos_ratio_in_snip
unknown: 216
Schema.org Question response position, most similar to snippet, and snippet in snippet
|
SF_fq_schema_matched_ans_lcsw_pos_ratio_in_ans
unknown: 217
Schema.org Question response position, the most similar to snippet, and snippet in response
|
SF_fq_ru_fact_snippet_dssm_factoid_score
unknown: 218
The cosine between the embeddings of the request and snippet built by the DSSM neuroset trained on the answers of Mail.ru. Facts-717, Facts-19.
|
FF_FI_host_fact_score
unknown: 219
The ratio of the number of times when the host was shown on a Serpa with a fact to a total number of times when the host was present on the Serpa.
|
FF_FI_meaningful_word_count
unknown: 220
The number of words in the factual fracture of Snippet (CI, the punctuation and words are also not taken into account of 3 characters not containing numbers)
|
FF_FI_avg_similarity
unknown: 221
Average similarity to SNiPPTs to switch words
|
FF_FI_similarity_top1
unknown: 222
The best coincidence according to the words in Actsnipp (CI, the punctuation and words are also not taken into account of 3 characters not containing numbers)
|
FF_FI_similarity_top2
unknown: 223
The second coincidence in the words in Actsnippe (CI, the punctuation and words are also not taken into account of 3 characters not containing numbers)
|
FF_FI_similarity_top3
unknown: 224
The third coincidence according to the words in Actsnipp (CI, the punctuation and words are also not taken into account of 3 characters not containing numbers)
|
FF_FI_similarity_top4
unknown: 225
The fourth coincidence according to Actsnippe (CI, the punctuation and words are also not taken into account of 3 characters not containing numbers)
|
FF_FI_similarity_range90
unknown: 226
the number of coincidences according to the acts differing from the best not more than 90%
|
FF_FI_similarity_range80
unknown: 227
the number of coincidences according to the acts differing from the best by 80-90%
|
FF_FI_similarity_range70
unknown: 228
the number of coincidences according to the acts in the act of the best by 70-80%
|
FF_FI_similarity_range60
unknown: 229
the number of coincidences according to the acts in the act of the best by 60-70%
|
FF_FI_similarity_range50
unknown: 230
the number of coincidences according to the acts in the act of the best by 50-60%
|
DssmOneClickProbability
unknown: 231
DSSM model trained on clicks, target=OneClicks/Clicks. Takes bigrams into account.
|
QueryThEncyclopedic
unknown: 232
The result of the work of the lexical classifier of requests predicting the likelihood of click on the theme of 3561
|
AddTime
unknown: 233
The time of adding a page, more - a more old document; The root is placed from time displayed at the interval [0.1] so that 3+ years gives 1.
|
QfufAllSumWFSumWQueryDwellTimeMixMatchWeightedValue
unknown: 234
Linguistic boosting factor. Type of extensions: QFUF. Factor: MixMatchweightedValue on Querydwelltime Stream. The average balanced values of the expansion factor.
|
DssmRandomLogQueryAvgDaterAge
unknown: 235
The average Dateraage value for the year for a year predicted using a neural network.
|
DssmRandomLogQueryDwelltimeWeightedAvgUrlDomainFraction
unknown: 236
The Malue Network DwellTime-AMI predicted using the neural network is the value of Urldomainfraction for the year.
|
DssmRandomLogQueryAvgXfDtShowAllSumWFSumWBodyMinWindowSize
unknown: 237
The average value of the XFDTSHOWALSUMWFSUMWBODYMINWINDOWSIZE for the year for the year.
|
DssmBoostingXfWeightKMeans5AvgTop02ScoreQE
unknown: 238
Dssm Boosting AvgTop02Score aggregation for XfWeight model over 5-means centroids (query as expansion).
|
DssmQueryEmbeddingCtrNoMinerPca0
unknown: 239
The main components of the requesting Embling from the DSSMCTRNOMINER model
|
ErratumLogQueryProbability
unknown: 240
Double logarithm of the probability of a request for a language model of the Erratum typo service
|
DssmQueryEmbeddingCtrNoMinerPca2
unknown: 241
The main components of the requesting Embling from the DSSMCTRNOMINER model
|
QUrlStatPower
unknown: 242
The number of URL shows on request, normalization x/(100 + x).
|
query_len
unknown: 243
|
plm_like
unknown: 244
|
share_of_punct
unknown: 245
|
sent2
unknown: 246
|
llinks_avp
unknown: 247
|
bag_avg_5
unknown: 248
|
FF_FI_neocortex_facts
unknown: 249
The prediction of the NeoCortEx model trained in TextTotext on factory logs
|
FF_FI_neocortex_oml
unknown: 250
The prediction of the NeoCortex model trained in TextTotext on answers
|
FF_FI_neocortex_facts_big
unknown: 251
The prediction of the large NEOCORTEX model trained in TextTotext on factory logs
|
FF_FI_is_assistant
unknown: 252
a sign that the request came from the assistant
|
SF_fq_tomato_dssm_factoid_score
unknown: 253
Cosinus between the embezzle of the request and snippet built by the Tomato DSSM neuroset; 256.128.otvet_mail_ru.toloka. Facts-2545
|
FF_FI_bert_factsnip_answer_dssm
unknown: 254
The cosinus between the embezzle of the request and Snippte built by the Bert_estimate_answer.dssm DSSM-Nevi-network, trained on the answers of the search bart, who is learned from the Toloca
|
FF_FI_neocortex_serp_items_wiz_images
unknown: 255
Cosinus between emblems of the request and availability of wiz-images on a Serpa
|
FF_FI_neocortex_serp_items_wiz_video
unknown: 256
Cosinus between emblems of the request and availability of Wiz-Video on a Serpa
|
FF_FI_neocortex_serp_items_union_facts
unknown: 257
Cosinus between embezzles of request and availability of Union-Facts on a Serpa
|
FF_FI_neocortex_serp_items_wiz_musicplayer
unknown: 258
The cosine between the embedding of the request and the availability of Wiz-MusicPlayer on the Serpa
|
FF_FI_neocortex_serp_items_wiz_maps
unknown: 259
Cosinus between embezzles of request and availability of Wiz-Maps on a sickle
|
FF_FI_neocortex_serp_items_positive_query_mx
unknown: 260
Cosinus between the embezzle of the request and the presence of a positive Query MX
|
MF_FI_average_delta
unknown: 261
Numerical derivative 84
|
MF_FI_diff_to_average
unknown: 262
Difference with its average 253
|
MF_FI_diff_to_average_bert_factsnip_answer_dssm
unknown: 263
Difference with its average 254
|
TR
unknown: 264
Text relevance (maxfreq is the frequency of the most frequent word that makes sense of the length of the document).
|
PrBonus
unknown: 265
Priority bonus, priority 7 - text priority. The binary factor, matters 0 for all monosyllabic requests, and the value of 1 for almost all two or more words, except for a very small number of answers for which there is not a single link that has passed quorum, and the text also did not pass the quorum.
|
TRUnmapped
unknown: 266
TR divided by a cube of the number of words in a request and transformed by a standard REMAPTR.
|
HasNoTR
unknown: 267
The document has no TR.
|
SoftAndOk
unknown: 268
The document passed Softand on the restrictions of the syntactic sorcerer. Only for documents with textual relevance. For monosyllabic requests, always 1.
|
TRWithStops
unknown: 269
Weight of maximum coincidence of forms in the text and request
|
JokerWeight
unknown: 270
The ratio of the amount of IDF words in a sentence+Title to all words.
|
TR_W1
unknown: 271
Analogues of the factors of the same name, the weight of the word = 1
|
FirstLastClickMobileCosineMatchMaxPrediction
unknown: 272
CosineMatchMaxPrediction factor over hits from FirstLastClick stream (Mobile sessions filtered)
|
QueryToTextAllSumFCountBodyPairMinProximity
unknown: 273
Linguistic boosting factor. Type of extensions: Querytotext. Factor: PairminProximity according to the contents of the document. The average values of the expansion factor.
|
REMOVED_274
unknown: 274
removed
|
QueryDoppMultipleClicksShows
unknown: 275
The number of shows of the request with more than one click in history. The request is normalized by doppelgangers
|
EthosVideoTextWeight22
unknown: 276
Linear text classifier prediction learned on video production pool with positives and wins factor two
|
uidf
unknown: 277
|
len
unknown: 278
|
end
unknown: 279
|
lenp
unknown: 280
|
FF_FI_snippet_sentence_count
unknown: 281
Number of proposals in Snippet
|
FF_FI_snippet_bad_sentence_count
unknown: 282
The number of poorly formed sentences in Snippet (does not begin on the capital letter or does not end at the point or contains less than one sentence)
|
FF_FI_snippet_uppercase_words_rel_freq
unknown: 283
The number of words starting with the title letter / number of all words in snippet
|
PF_FM_dialogs_all
unknown: 284
The value of the Katbust formula on all assessments from the dialogs
|
PF_FM_dialogs_snip
unknown: 285
The value of the Katbustic formula on estimates from dialogs according to Snippets
|
PF_FM_dialogs_old
unknown: 286
The value of the Katbust formula on estimates from dialogs until 2018
|
QF_QUERY_MX
unknown: 287
The value of the request formula
|
PF_FM_estimate_answers_v1
unknown: 288
The value of the Katbust formula on estimates from the Estimate Answers V1 cache from 2019
|
QF_query_mx2
unknown: 289
The value of the request formula for estimates v2
|
PF_FM_snippet_bert_v2
unknown: 290
Bert Catbust Distillation on an assessment of the act of V2
|
FF_FI_fact_snippet_true_bert_target_0
unknown: 291
Bert value on an assessment of an act, zero head of multitargete
|
FF_FI_fact_snippet_true_bert_target_1
unknown: 292
The value of BERT on an assessment of the act, the first head of the multitargete
|
RF_Removed_293
unknown: 293
|
FF_FI_fact_snippet_true_info_bert_target_0
unknown: 294
Info Bert value on an assessment of an act
|
RF_Removed_295
unknown: 295
Dependence so that when the factor is changed, they do not forget about us
|
RF_Removed_296
unknown: 296
Dependence so that when the factor is changed, they do not forget about us
|
ConfidenceLevel
unknown: 0
Confidence level from misspell service (8000, 10000, etc)
|
RelevDiff
unknown: 1
|
Similaity
unknown: 2
|
WikiSerp
unknown: 3
|
WikiMisspell
unknown: 4
|
QueryLength
unknown: 5
|
FullQuorum
unknown: 6
|
TRp2
unknown: 7
|
FiltrationSegments
unknown: 8
|
TRLRQuorumFm
unknown: 9
|
SmallWindow
unknown: 10
|
Removed_11
unknown: 11
|
AccumulatedRelevDiff
unknown: 12
|
IsOrgWeb
unknown: 13
|
IsOrgMisspell
unknown: 14
|
TovarCategoryFactorDiff
unknown: 15
|
TovarVendorFactorDiff
unknown: 16
|
WebHas10Docs
unknown: 17
|
MisspellHas10Docs
unknown: 18
|
DictSerp
unknown: 19
|
DictMisspell
unknown: 20
|
AccFullQuorum
unknown: 21
|
AccTRp2
unknown: 22
|
AccFiltrationSegments
unknown: 23
|
AccTRLRQuorumFm
unknown: 24
|
AccSmallWindow
unknown: 25
|
Removed_26
unknown: 26
|
IsForeignQuery
unknown: 27
|
ErratumCorrectedWeight
unknown: 28
|
VacantFactor1
unknown: 29
|
VacantFactor2
unknown: 30
|
ErratumOriginalWeight
unknown: 31
|
SnippetUniquedQueryWordsPercent
unknown: 32
|
SnippetUniquedTitleQueryWordsPercent
unknown: 33
|
SnippetUniquedEverywhereQueryWordsPercent
unknown: 34
|
SnippetUniquedHasAllQueryWordsW
unknown: 35
|
SnippetUniquedHasAllQueryWordsM
unknown: 36
|
SnippetUniquedHasAnyQueryWordsW
unknown: 37
|
SnippetUniquedHasAnyQueryWordsM
unknown: 38
|
SnippetUniquedCrossQueryWordsPercent
unknown: 39
|
SnippetUniquedCrossTitleQueryWordsPercent
unknown: 40
|
SnippetUniquedCrossEverywhereQueryWordsPercent
unknown: 41
|
SnippetUniquedCrossHasAllQueryWordsW
unknown: 42
|
SnippetUniquedCrossHasAllQueryWordsM
unknown: 43
|
SnippetUniquedCrossHasAnyQueryWordsW
unknown: 44
|
SnippetUniquedCrossHasAnyQueryWordsM
unknown: 45
|
FirstUrlsClassDiff
unknown: 46
|
FirstUrlsAreTheSame
unknown: 47
|
DictBoostPossible
unknown: 48
|
MisspelledPart
unknown: 49
|
OrgBoostPossible
unknown: 50
|
OrgWeight
unknown: 51
|
Th3973Web
unknown: 52
|
SyqWeb
unknown: 53
|
SyqMsp
unknown: 54
|
Th3561Web
unknown: 55
|
OsWeb
unknown: 56
|
OsMsp
unknown: 57
|
WmaxobeWeb
unknown: 58
|
WmaxoneMsp
unknown: 59
|
Cm2Web
unknown: 60
|
QruWeb
unknown: 61
|
IsSiteWeb
unknown: 62
|
IsSiteMsp
unknown: 63
|
HasFioWeb
unknown: 64
|
HasFioMsp
unknown: 65
|
QrrWeb
unknown: 66
|
QradmWeb
unknown: 67
|
QradmMsp
unknown: 68
|
VmWeb
unknown: 69
|
VwMsp
unknown: 70
|
IsNavMxWeb
unknown: 71
|
IsNavMxMsp
unknown: 72
|
IlWeb
unknown: 73
|
IlMsp
unknown: 74
|
ForumWeb
unknown: 75
|
ForumMsp
unknown: 76
|
FsmlWeb
unknown: 77
|
FsmlMsp
unknown: 78
|
NavMxWeb
unknown: 79
|
NavMxMsp
unknown: 80
|
SoundexEng
unknown: 81
|
CommonMisspell
unknown: 82
|
PornoInWeb
unknown: 83
|
PornoInMsp
unknown: 84
|
BestUrlInQueryWeb
unknown: 85
|
UngroupWeb
unknown: 86
|
FirstMisspellPos
unknown: 87
|
SoundexTr
unknown: 88
|
HostSimilarity
unknown: 89
|
SnipIntersection
unknown: 90
|
PredictedPfoundWeb
unknown: 91
|
PredictedPfoundMsp
unknown: 92
|
HostSimilarity5
unknown: 93
|
SearchFactor1TrMspMax
unknown: 94
|
SearchFactor16TRhitwMspMax
unknown: 95
|
SearchFactor69TxtHeadExWebAcc
unknown: 96
|
SearchFactor100TextFeaturesDiffAcc
unknown: 97
|
SearchFactor563DBM25_2WebAcc
unknown: 98
|
SearchFactor294UrlDomainFractionDiffMax
unknown: 99
|
SearchFactor294UrlDomainFractionWebAcc
unknown: 100
|
SearchFactor724UrlDomainSimilarityFixedWebMax
unknown: 101
|
IsRu
unknown: 102
|
IsTr
unknown: 103
|
CyrillicCommonMsp
unknown: 104
|
AccumulatedRelevDiffFixed
unknown: 105
|
IsAutocorrectConfidence
unknown: 106
Is confidence level equal to 10000
|
IsBlendConfidence
unknown: 107
Is confidence level equal to 8000
|
SpellCheckerFeature_0
unknown: 108
|
SpellCheckerFeature_28
unknown: 109
|
SpellCheckerFeature_145
unknown: 110
|
SpellCheckerFeature_156
unknown: 111
|
SpellCheckerFeature_168
unknown: 112
|
SpellCheckerFeature_170
unknown: 113
|
SpellCheckerFeature_230
unknown: 114
|
SpellCheckerFeature_307
unknown: 115
|
SpellCheckerFeature_308
unknown: 116
|
SpellCheckerFeature_337
unknown: 117
|
SpellCheckerFeature_393
unknown: 118
|
SpellCheckerFeature_394
unknown: 119
|
SpellCheckerFeature_476
unknown: 120
|
SpellCheckerFeature_477
unknown: 121
|
SpellCheckerFeature_508
unknown: 122
|
SpellCheckerFeature_742
unknown: 123
|
SpellCheckerFeature_792
unknown: 124
|
SpellCheckerFeature_835
unknown: 125
|
SpellCheckerFeature_837
unknown: 126
|
SpellCheckerFeature_847
unknown: 127
|
SpellCheckerFeature_849
unknown: 128
|
SpellCheckerFeature_852
unknown: 129
|
SpellCheckerFeature_854
unknown: 130
|
SpellCheckerFeature_926
unknown: 131
|
SpellCheckerFeature_933
unknown: 132
|
SpellCheckerFeature_934
unknown: 133
|
SpellCheckerFeature_938
unknown: 134
|
SpellCheckerFeature_939
unknown: 135
|
SpellCheckerFeature_944
unknown: 136
|
SpellCheckerFeature_947
unknown: 137
|
SpellCheckerFeature_948
unknown: 138
|
SpellCheckerFeature_949
unknown: 139
|
SpellCheckerFeature_952
unknown: 140
|
SpellCheckerFeature_954
unknown: 141
|
SpellCheckerFeature_955
unknown: 142
|
SpellCheckerFeature_958
unknown: 143
|
SpellCheckerFeature_962
unknown: 144
|
SpellCheckerFeature_987
unknown: 145
|
SpellCheckerFeature_990
unknown: 146
|
SpellCheckerFeature_991
unknown: 147
|