What is an operative SEO Indicator System? What is an indicator system?
thumb_upLike (8)
commentReply (1)
thumb_up8 likes
comment
1 replies
M
Mason Rodriguez 1 minutes ago
What is an impression? What is a financial SEO indicator system?...
W
William Brown Member
access_time
14 minutes ago
Tuesday, 29 April 2025
What is an impression? What is a financial SEO indicator system?
thumb_upLike (10)
commentReply (1)
thumb_up10 likes
comment
1 replies
K
Kevin Wang 4 minutes ago
What does conversion mean? Ranking Distribution: One of the Most Important SEO Metrics What is the d...
A
Amelia Singh Moderator
access_time
24 minutes ago
Tuesday, 29 April 2025
What does conversion mean? Ranking Distribution: One of the Most Important SEO Metrics What is the dwell time or time on site?
thumb_upLike (41)
commentReply (1)
thumb_up41 likes
comment
1 replies
B
Brandon Kumar 2 minutes ago
Back to overviewThe inverse document frequency – IDF – counts how often a certain word o...
H
Harper Kim Member
access_time
18 minutes ago
Tuesday, 29 April 2025
Back to overviewThe inverse document frequency – IDF – counts how often a certain word occurs in a collection of documents. In this way, the uniqueness of a word within a document group can be calculated.ContentsContentsWhere does the inverse document frequency come from How does the IDF help me in evaluations Example 1 for IDFExample 2 on IDFExample 3 on IDFConclusionIDF as a counterpart to Term Frequency and Within Document Frequency
Inverse document frequency is a measure that is used in the field of Information Sciences to provide an indication of the number of documents in a document collection in which certain words occur.
thumb_upLike (7)
commentReply (2)
thumb_up7 likes
comment
2 replies
N
Natalie Lopez 3 minutes ago
The size of the document collection is determined beforehand.
Where does the inverse document fr...
E
Ethan Thomas 5 minutes ago
In her article, ‘A statistical interpretation of term specificity and its application in retrieval...
A
Alexander Wang Member
access_time
10 minutes ago
Tuesday, 29 April 2025
The size of the document collection is determined beforehand.
Where does the inverse document frequency come from
The foundation for the IDF value was laid as early as 1972 by the British computer scientist Karen Spärck Jones.
thumb_upLike (38)
commentReply (3)
thumb_up38 likes
comment
3 replies
E
Emma Wilson 10 minutes ago
In her article, ‘A statistical interpretation of term specificity and its application in retrieval...
C
Christopher Lee 3 minutes ago
How does the IDF help me in evaluations
The Inverse Document Frequency for a given word (I...
In her article, ‘A statistical interpretation of term specificity and its application in retrieval’, she was the first in her field to define how the incidence of a term/keyword can be calculated. The idea behind this method is elegant and easy to understand: a word from a query that occurs in very many documents is not a suitable discriminator and should therefore be weighted less heavily than a word that occurs in very few documents.
thumb_upLike (50)
commentReply (1)
thumb_up50 likes
comment
1 replies
N
Natalie Lopez 16 minutes ago
How does the IDF help me in evaluations
The Inverse Document Frequency for a given word (I...
M
Mia Anderson Member
access_time
60 minutes ago
Tuesday, 29 April 2025
How does the IDF help me in evaluations
The Inverse Document Frequency for a given word (IDFt) divides the number of documents in the document collection (ND) by the number of documents in the collection that contain the given word (ƒt):
IDFt = log10( ND / ƒt )The more documents there are in the collection that contain this word, the smaller the IDF value for a word becomes. This is a very good way of calculating stop words (commonly used words in any language), for example, as they occur in a large proportion of the documents.
thumb_upLike (47)
commentReply (2)
thumb_up47 likes
comment
2 replies
V
Victoria Lopez 41 minutes ago
Example 1 for IDF
An example would be a collection of 100 documents in which the word ‘th...
D
Dylan Patel 60 minutes ago
Example 3 on IDF
Last but not least, let us assume that the word ‘xylophone’ occurs in ...
N
Nathan Chen Member
access_time
52 minutes ago
Tuesday, 29 April 2025
Example 1 for IDF
An example would be a collection of 100 documents in which the word ‘the’ occurs in every document:
IDFt = log10( 100% of all documents in the corpus / 100% of the documents in the corpus that contain the particular word ) = log10(1) = 0. The word ‘the’ has no unique feature in this collection of documents.
Example 2 on IDF
In the same collection of 100 documents, the word “it” occurs in 50 documents:
IDFt = log10 ( 100% of all documents in the corpus / 50% of the documents in the corpus that contain the particular word ) = log10(2) = 0.3
Due to the nature of a logarithm, an occurrence in 50% of the possible cases is no longer 50% of the total uniqueness, as is the case with the value 1, but a value of 0.3.
thumb_upLike (44)
commentReply (0)
thumb_up44 likes
C
Charlotte Lee Member
access_time
28 minutes ago
Tuesday, 29 April 2025
Example 3 on IDF
Last but not least, let us assume that the word ‘xylophone’ occurs in exactly one document in the above corpus of documents:
IDFt = log10( 100% of all documents in the corpus / 1% of the documents in the corpus that contain the particular word ) = log10(100) = 2. The absolute uniqueness of a word within a document collection has a maximum value of 2, according to the above calculation. Source: https://commons.wikimedia.org/wiki/File:Plot_IDF_functions.png
Conclusion
The IDF can be used as an effective counterpart to other metrics that are used to measure the incidence of terms by asking the following questions: which words occur frequently in a single document but are relatively unique across all the documents that we look at?
thumb_upLike (17)
commentReply (0)
thumb_up17 likes
D
Dylan Patel Member
access_time
75 minutes ago
Tuesday, 29 April 2025
Which words occur in all documents and are therefore probably less interesting? This is the case if we are looking at either the pure keyword density (term frequency – TF) or a weighted value (Within Document Frequency – WDF).
thumb_upLike (13)
commentReply (1)
thumb_up13 likes
comment
1 replies
S
Sophie Martin 48 minutes ago
IDF as a counterpart to Term Frequency and Within Document Frequency
In both the TF*IDF and...
E
Ethan Thomas Member
access_time
16 minutes ago
Tuesday, 29 April 2025
IDF as a counterpart to Term Frequency and Within Document Frequency
In both the TF*IDF and WDF*IDF weighting evaluations, the IDF value has the function of giving a lower rating to words that occur in all documents. The more often a word occurs in a document, the higher the TF/WDF value; the more often a word occurs across all documents, the lower the IDF.
thumb_upLike (38)
commentReply (0)
thumb_up38 likes
C
Chloe Santos Moderator
access_time
51 minutes ago
Tuesday, 29 April 2025
Stop words, which occur in (almost) all documents thus lose importance, no matter how often they occur in a single document, since the IDF value for these approaches 0. From: SISTRIX Team Steve Paine 19.02.2021 SEO KPIs Can I visually compare the Visibility Index to other KPIs?
thumb_upLike (25)
commentReply (3)
thumb_up25 likes
comment
3 replies
G
Grace Liu 26 minutes ago
What is CPM - Cost Per Mille? What is Net Popularity? What is Link Popularity?...
J
James Smith 18 minutes ago
What is IP Popularity? What is Inverse Document Frequency - IDF? What is Domain Popularity?...
What is CTR - Click-Through-Rate? What is CPO - Cost per Order? What is CPA - Cost per Action?
thumb_upLike (35)
commentReply (1)
thumb_up35 likes
comment
1 replies
N
Nathan Chen 6 minutes ago
How to identify and use a SEO KPI, a performance indicator What is Bounce Rate? What is an operative...
S
Sofia Garcia Member
access_time
42 minutes ago
Tuesday, 29 April 2025
How to identify and use a SEO KPI, a performance indicator What is Bounce Rate? What is an operative SEO Indicator System?
thumb_upLike (29)
commentReply (0)
thumb_up29 likes
L
Lily Watson Moderator
access_time
22 minutes ago
Tuesday, 29 April 2025
What is an indicator system? What is an impression?
thumb_upLike (30)
commentReply (1)
thumb_up30 likes
comment
1 replies
J
James Smith 7 minutes ago
What is a financial SEO indicator system? What does conversion mean? Ranking Distribution: One of th...
I
Isabella Johnson Member
access_time
92 minutes ago
Tuesday, 29 April 2025
What is a financial SEO indicator system? What does conversion mean? Ranking Distribution: One of the Most Important SEO Metrics What is the dwell time or time on site?
thumb_upLike (47)
commentReply (2)
thumb_up47 likes
comment
2 replies
I
Isabella Johnson 36 minutes ago
Back to overview German English Spanish Italian French...
S
Sebastian Silva 41 minutes ago
What is Inverse Document Frequency - IDF? - SISTRIX Login Free trialSISTRIX BlogFree ToolsAsk SISTRI...
H
Hannah Kim Member
access_time
72 minutes ago
Tuesday, 29 April 2025
Back to overview German English Spanish Italian French
thumb_upLike (9)
commentReply (3)
thumb_up9 likes
comment
3 replies
O
Oliver Taylor 68 minutes ago
What is Inverse Document Frequency - IDF? - SISTRIX Login Free trialSISTRIX BlogFree ToolsAsk SISTRI...
C
Chloe Santos 27 minutes ago
What is CPM - Cost Per Mille? What is Net Popularity? What is Link Popularity?...