Knowledge Subject Similarity

Concept Similarity — CSPR

Knowledge Subject Similarity

In the following we analyze, how the similarity of knowledge subjects can be determined by measuring the distances within the Conceptual Spaces (CS) spanned by several quality and quantity dimensions. As an example we want to compare four people in terms of the features sets height and age. This could also be the same person at different stages of life. Each feature set FSi represents the height which the person has at the respective age and the position of the knowledge subject Σi in the conceptual space CSPC(.Age, .Height_cm):
  • FS1 = {(.Age, 20),  (.Height_cm, 170)}
  • FS2 = {(.Age, 17),  (.Height_cm, 165)}
  • FS3 = {(.Age, 12),  (.Height_cm, 130)}
  • FS4 = {(.Age, 7),  (.Height_cm, 95)}
Based on the Knowledge Subject Distance definition KSD the abolute value of the similarity of the four persons is computed as:
  • 122 = (20-17)2+(170-165)2= 32+52 = 9+25 = 34; ⋈1,2 = 5.83
  • 232 = (17-12)2+(165-130)2= 52+352 = 25+1225 = 1250; ⋈2,3 = 35.36
  • 342 = (12-7)2+(130-95)2=52+352 = 25+1225 = 1250; ⋈3,4 = 35.36
This means concretely that, for example, the distance between persons 2 and 3 is identical to the distance between persons 3 and 4.

The measure of relative similarity can be between 0% and 100% and is computed as Knowledge Subject Distance Ratio according to definition KSDR with:

  • dr1,12 = (20/20)2+(170/170)2=12+12= 2; ⋈%1,1 = √(2/2)=1 ≘ 100%
  • dr1,22 = (17/20)2+(165/170)2 = 0.852 + 0.972 = 0.7225 + 0.942 = 1.6645; ⋈%1,2 = √(1.6635 / 2 ) = √(0.83225) = 0.912 ≘ 91.2%
  • dr2,32 = (12/17)2+(130/165)2 = 0.852+0.78782 = 0.7225 + 0.62075 = 1.34325; ⋈%2,3 = √(1.34325/2) = √(0.671625) = 0.8195 ≘ 81.95%
  • dr3,42 = (7/12)2 + (95/130)2 = 0.5832+0.7312 = 0.33988+0.5340 = 0.86388; ⋈%3,4 = √(0.86388/2) = √(0.43694) = 0.6610 ≘ 66.10%
Of course, the similarity of the same persons like ⋈%1,1 must be 100%. Since the relative similarity measure normalizes to the scale from 0% to 100%, the ranking is much easier to understand by the user. Based on the example, it is therefore easy to see that with advancing age, growth slows down or even stagnates, leading to higher similarities as ⋈%1,2 ≘ 91.2% between individuals 1 and 2.

Extension: deriver.app

Source: taoke.de — CSPR Knowledge Subject Similarity.