Interface IStringMeasure
-
- All Superinterfaces:
IMeasure
- All Known Subinterfaces:
ITrieFilterableStringMeasure
- All Known Implementing Classes:
Caverphone1Measure,Caverphone2Measure,CosineMeasure,DaitchMokotoffSoundexMeasure,DoubleMetaphoneMeasure,DoubleMetaphoneMeasure,ExactMatchMeasure,JaccardMeasure,JaroMeasure,JaroWinklerMeasure,KoelnPhoneticMeasure,KoelnPhoneticMeasure,LessThanMeasure,LevenshteinMeasure,MatchRatingApproachEncoderMeasure,MetaphoneMeasure,MongeElkanMeasure,NysiisMeasure,OverlapMeasure,QGramSimilarityMeasure,RatcliffObershelpMeasure,RefinedSoundexMeasure,SoundexMeasure,StringMeasure,TrigramMeasure
public interface IStringMeasure extends IMeasure
- Author:
- Axel-C. Ngonga Ngomo (ngonga@informatik.uni-leipzig.de)
-
-
Method Summary
All Methods Instance Methods Abstract Methods Modifier and Type Method Description booleancomputableViaOverlap()Returns true if this similarity function can be computed just via the getSimilarity(overlag, lengthA, lengthB)intgetAlpha(int xTokensNumber, int yTokensNumber, double threshold)Threshold for the positional filteringintgetMidLength(int tokensNumber, double threshold)Theshold for the length of the tokens to be indexedintgetPrefixLength(int tokensNumber, double threshold)Length of prefix to consider when mapping the input string with other strings.doublegetSimilarity(int overlap, int lengthA, int lengthB)Returns the similarity of two strings given their length and the overlap.doublegetSizeFilteringThreshold(int tokensNumber, double threshold)-
Methods inherited from interface org.aksw.limes.core.measures.measure.IMeasure
getName, getRuntimeApproximation, getSimilarity, getSimilarity, getType
-
-
-
-
Method Detail
-
getPrefixLength
int getPrefixLength(int tokensNumber, double threshold)Length of prefix to consider when mapping the input string with other strings.- Parameters:
tokensNumber- Size of input string inthreshold- Similarity threshold- Returns:
- Prefix length
-
getMidLength
int getMidLength(int tokensNumber, double threshold)Theshold for the length of the tokens to be indexed- Parameters:
tokensNumber- Number of tokens of current inputthreshold- Similarity threshold- Returns:
- Length of tokens to be indexed
-
getSizeFilteringThreshold
double getSizeFilteringThreshold(int tokensNumber, double threshold)
-
getAlpha
int getAlpha(int xTokensNumber, int yTokensNumber, double threshold)Threshold for the positional filtering- Parameters:
xTokensNumber- Size of the first input stringyTokensNumber- Size of the first input stringthreshold- Similarity threshold- Returns:
- Threshold for positional filtering
-
getSimilarity
double getSimilarity(int overlap, int lengthA, int lengthB)Returns the similarity of two strings given their length and the overlap. Useful when these values are known so that no computation of known values have to be carried out anew- Parameters:
overlap- Overlap of strings A and BlengthA- Length of AlengthB- Length of B- Returns:
- Similarity of A and B
-
computableViaOverlap
boolean computableViaOverlap()
Returns true if this similarity function can be computed just via the getSimilarity(overlag, lengthA, lengthB)- Returns:
- True if it's possible, else false;
-
-