public interface UriCollector extends SinkBase
getUris(CrawleableUri) method.| Modifier and Type | Method and Description |
|---|---|
void |
addNewUri(CrawleableUri uri,
CrawleableUri newUri)
Adds the given new URI to the list of URIs collected for the given URI.
|
default void |
addNewUri(CrawleableUri uri,
org.apache.jena.graph.Node newUri)
Adds the given new URI to the list of URIs collected for the given URI.
|
default void |
addNewUri(CrawleableUri uri,
String newUri)
Adds the given new URI to the list of URIs collected for the given URI.
|
default void |
addTriple(CrawleableUri uri,
org.apache.jena.graph.Triple triple)
Adds the given triple to the list of URIs collected from the given URI.
|
long |
getSize(CrawleableUri uri)
Returns the total of uris that have been collected
|
Iterator<byte[]> |
getUris(CrawleableUri uri)
Returns a list of serialized
CrawleableUri instances that have been
collected for the given URI. |
closeSinkForUri, openSinkForUridefault void addTriple(CrawleableUri uri, org.apache.jena.graph.Triple triple)
addNewUri(CrawleableUri, CrawleableUri) method instead since this
enables the addition of meta data to the collected URI.uri - The URI from which the given triple has been collected.triple - The triple that has been collected.void addNewUri(CrawleableUri uri, CrawleableUri newUri)
uri - The URI from which the given new URI has been collected.newUri - The new URI that has been collected.default void addNewUri(CrawleableUri uri, org.apache.jena.graph.Node newUri)
uri - The URI from which the given new URI has been collected.newUri - The new URI that has been collected.default void addNewUri(CrawleableUri uri, String newUri)
uri - The URI from which the given new URI has been collected.newUri - The new URI that has been collected.Iterator<byte[]> getUris(CrawleableUri uri)
CrawleableUri instances that have been
collected for the given URI.uri - The URI from which the returned serialized URIs have been
collected.Iterator that iterates over the already serialized URIs
that have been collected for the given URI.long getSize(CrawleableUri uri)
uri - The URI from which the returned serialized URIs have been
collected.Copyright © 2017–2019. All rights reserved.