Packages

implicit class StatsCriteria extends Logging

Linear Supertypes
Logging, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. StatsCriteria
  2. Logging
  3. AnyRef
  4. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new StatsCriteria(triples: DataSet[Triple])

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  5. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  6. val env: ExecutionEnvironment
  7. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  8. def equals(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  9. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  10. final def getClass(): Class[_]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  11. def hashCode(): Int
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  12. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  13. val logger: Logger
    Attributes
    protected
    Definition Classes
    Logging
  14. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  15. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  16. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  17. def stats: DataSet[String]

    Compute distributed RDF dataset statistics.

    Compute distributed RDF dataset statistics.

    returns

    VoID description of the given dataset

  18. def statsBlanksAsObject(): DataSet[Triple]

    19.

    19. Blanks as object criterion

    returns

    number of triples where blanknodes are used as objects.

  19. def statsBlanksAsSubject(): DataSet[Triple]

    18.

    18. Blanks as subject criterion

    returns

    number of triples where blanknodes are used as subjects.

  20. def statsClassUsageCount(): DataSet[(Node, Int)]

    2. Class Usage Count Criterion
    Count the usage of respective classes of a datase, the filter rule that is used to analyze a triple is the same as in the first criterion.

    2. Class Usage Count Criterion
    Count the usage of respective classes of a datase, the filter rule that is used to analyze a triple is the same as in the first criterion. As an action a map is being created having class IRIs as identifier and its respective usage count as value. If a triple is conform to the filter rule the respective value will be increased by one. Filter rule : ?p=rdf:type && isIRI(?o) Action : M[?o]++

    returns

    DataSet of classes used in the dataset and their frequencies.

  21. def statsClassesDefined(): DataSet[Node]

    3. Classes Defined Criterion
    Gets a set of classes that are defined within a dataset this criterion is being used.

    3. Classes Defined Criterion
    Gets a set of classes that are defined within a dataset this criterion is being used. Usually in RDF/S and OWL a class can be defined by a triple using the predicate rdf:type and either rdfs:Class or owl:Class as object. The filter rule illustrates the condition used to analyze the triple. If the triple is accepted by the rule, the IRI used as subject is added to the set of classes. Filter rule : ?p=rdf:type && isIRI(?s) &&(?o=rdfs:Class||?o=owl:Class) Action : S += ?s

    returns

    DataSet of classes defined in the dataset.

  22. def statsDataTypes(): DataSet[(String, Int)]

    20.

    20. Datatypes criterion

    returns

    histogram of types used for literals.

  23. def statsDistinctEntities(): DataSet[Triple]

    16. Distinct entities
    Count distinct entities of a dataset by filtering out all IRIs.

    16. Distinct entities
    Count distinct entities of a dataset by filtering out all IRIs. Filter rule : S+=iris({?s,?p,?o}) Action : S

    returns

    DataSet of distinct entities in the dataset.

  24. def statsDistinctObjects(): DataSet[Triple]

    Distinct Objects
    Count distinct objects within triples.

    Distinct Objects
    Count distinct objects within triples. Filter rule : isURI(?o) Action : M[?o]++

    returns

    DataSet of objects used in the dataset.

  25. def statsDistinctSubjects(): DataSet[Triple]

    Distinct Subjects
    Count distinct subject within triples.

    Distinct Subjects
    Count distinct subject within triples. Filter rule : isURI(?s) Action : M[?s]++

    returns

    DataSet of subjects used in the dataset.

  26. def statsLabeledSubjects(): DataSet[Node]

    24.

    24. Labeled subjects criterion.

    returns

    list of labeled subjects.

  27. def statsLanguages(): DataSet[(String, Int)]

    21.

    21. Languages criterion

    returns

    histogram of languages used for literals.

  28. def statsLinks(): DataSet[(String, String, Int)]

    26.

    26. Links criterion.

    Computes the frequencies of links between entities of different namespaces. This measure is directed, i.e. a link from ns1 -> ns2 is different from ns2 -> ns1.

    returns

    list of namespace combinations and their frequencies.

  29. def statsLiterals(): DataSet[Triple]

    * 17.

    * 17. Literals criterion

    returns

    number of triples that are referencing literals to subjects.

  30. def statsObjectVocabularies(): AggregateDataSet[(String, Int)]

    32. Object vocabularies
    Compute object vocabularies/namespaces used through the dataset.

    32. Object vocabularies
    Compute object vocabularies/namespaces used through the dataset. Filter rule : ns=ns(?o) Action : M[ns]++

    returns

    DataSet of distinct object vocabularies used in the dataset and their frequencies.

  31. def statsPredicateVocabularies(): AggregateDataSet[(String, Int)]

    31. Predicate vocabularies
    Compute predicate vocabularies/namespaces used through the dataset.

    31. Predicate vocabularies
    Compute predicate vocabularies/namespaces used through the dataset. Filter rule : ns=ns(?p) Action : M[ns]++

    returns

    DataSet of distinct predicate vocabularies used in the dataset and their frequencies.

  32. def statsPropertiesDefined(): DataSet[Node]

    Properties Defined
    Count the defined properties within triples.

    Properties Defined
    Count the defined properties within triples. Filter rule : ?p=rdf:type && (?o=owl:ObjectProperty || ?o=rdf:Property)&& !isIRI(?s) Action : M[?p]++

    returns

    DataSet of predicates defined in the dataset.

  33. def statsPropertyUsage(): DataSet[(Node, Int)]

    5. Property Usage Criterion
    Count the usage of properties within triples.

    5. Property Usage Criterion
    Count the usage of properties within triples. Therefore an DataSet will be created containing all property IRI's as identifier. Afterwards, their frequencies will be computed. Filter rule : none Action : M[?p]++

    returns

    DataSet of predicates used in the dataset and their frequencies.

  34. def statsSameAs(): DataSet[Triple]

    25.

    25. SameAs criterion.

    returns

    list of triples with owl#sameAs as predicate

  35. def statsSubjectVocabularies(): AggregateDataSet[(String, Int)]

    30. Subject vocabularies
    Compute subject vocabularies/namespaces used through the dataset.

    30. Subject vocabularies
    Compute subject vocabularies/namespaces used through the dataset. Filter rule : ns=ns(?s) Action : M[ns]++

    returns

    DataSet of distinct subject vocabularies used in the dataset and their frequencies.

  36. def statsTypedSubjects(): DataSet[Node]

    24.

    24. Typed subjects criterion.

    returns

    list of typed subjects.

  37. def statsUsedClasses(): DataSet[Triple]

    1. Used Classes Criterion
    Creates an DataSet of classes are in use by instances of the analyzed dataset.

    1. Used Classes Criterion
    Creates an DataSet of classes are in use by instances of the analyzed dataset. As an example of such a triple that will be accepted by the filter is sda:Gezim rdf:type distLODStats:Developer. Filter rule : ?p=rdf:type && isIRI(?o) Action : S += ?o

    returns

    DataSet of classes/instances

  38. final def synchronized[T0](arg0: ⇒ T0): T0
    Definition Classes
    AnyRef
  39. def toString(): String
    Definition Classes
    AnyRef → Any
  40. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  41. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  42. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()

Inherited from Logging

Inherited from AnyRef

Inherited from Any

Ungrouped