nutrient/com.pspdfkit.document.library/PdfLibrary

PdfLibrary

class PdfLibrary(path: String, tokenizer: PdfLibrary.TokenizerType? = null)

PdfLibrary implements a SQLite-based full-text-search engine. You can register documents to be indexed in the background and then search for keywords within that collection. There can be multiple libraries, although usually one is enough for the common use case.

Parameters

path

Writable path to library database file.

tokenizer

The tokenizer to use, one of TokenizerType.PORTER or TokenizerType.UNICODE. This controls how the PdfLibrary matches queries to the content in the index. If null is passed, TokenizerType.PORTER will be used as the default tokenizer.

Constructors

PdfLibrary

constructor(path: String, tokenizer: PdfLibrary.TokenizerType? = null)

Types

Companion

object Companion

Companion object providing factory methods for PdfLibrary.

TokenizerType

enum TokenizerType : Enum<PdfLibrary.TokenizerType>

Enum specifying the indexing tokenizer to use for the search library.

Properties

dataSource

var dataSource: LibraryDataSource?

The library's data source. This object will be retained and used to provide documents for indexing. When set, the library can use updateIndexFromDataSource to automatically index documents provided by the data source.

indexedUIDs

val indexedUIDs: List<String>

Returns list of UIDs of documents currently indexed.

isIndexing

val isIndexing: Boolean

Indicates whether the indexing is in progress or not.

queuedUIDs

val queuedUIDs: List<String>

Returns list of UIDs of documents queued for indexing.

Functions

addLibraryIndexingListener

fun addLibraryIndexingListener(listener: LibraryIndexingListener)

Adds a LibraryIndexingListener to monitor document indexing status. If the listener has already been added previously, this method will be a no-op. Adding null is not allowed, and will result in an exception.

clearIndex

fun clearIndex()

Completely clears the index for this library.

enqueueDocuments

fun enqueueDocuments(documents: List<PdfDocument>, indexingOptions: IndexingOptions = IndexingOptions())

Queues an array of documents for indexing. Any documents already queued or fully indexed will be ignored.

enqueueDocumentSources

fun enqueueDocumentSources(documentSources: List<DocumentSource>, indexingOptions: IndexingOptions = IndexingOptions())

Queues an array of documents for indexing. Any documents already queued or fully indexed will be ignored. This call will avoid opening documents until they're indexed and it's thus significantly more memory friendly than enqueueDocuments.

enqueueDocumentSourcesWithMetadata

fun enqueueDocumentSourcesWithMetadata(documentSources: List<Pair<DocumentSource, ByteArray?>>, indexingOptions: IndexingOptions = IndexingOptions())

Queues an array of documents for indexing together with passed free-form metadata. This call will avoid opening documents until they're indexed and it's thus significantly more memory friendly than enqueueDocumentsWithMetadata.

enqueueDocumentsWithMetadata

fun enqueueDocumentsWithMetadata(documents: List<Pair<PdfDocument, ByteArray>>, indexingOptions: IndexingOptions = IndexingOptions())

Queues an array of documents for indexing together with passed free-form metadata. Metadata can be retrieved after indexing with getMetadataForUID method call.

getIndexStatusForUID

fun getIndexStatusForUID(uid: String): LibraryIndexStatus

Returns indexing status for a document with passed UID.

getLibraryObserverMappingSize

@VisibleForTesting

fun getLibraryObserverMappingSize(): Int

Returns the number of registered library indexing listeners. Intended for internal testing only.

getMetadataForUID

fun getMetadataForUID(uid: String): ByteArray?

Returns metadata appended to document with enqueueDocumentsWithMetadata call.

getNativeLibrary

@VisibleForTesting

fun getNativeLibrary(): ERROR CLASS: Symbol not found for NativeDocumentLibrary

Returns the underlying native document library instance. Intended for internal testing only.

getSaveReverseText

fun getSaveReverseText(): Boolean

Indicates whether saving the reverse text is enabled.

indexedDocumentSourceWithUid

suspend fun indexedDocumentSourceWithUid(uid: String): DocumentSource?

Retrieves a document source with the specified UID from the data source, if any. Using this method is preferred to directly interacting with the data source's methods.

removeDocuments

fun removeDocuments(documentUIDs: List<String>)

Invalidates index for documents.

removeLibraryIndexingListener

fun removeLibraryIndexingListener(listener: LibraryIndexingListener)

Removes a registered LibraryIndexingListener added with addLibraryIndexingListener. Upon calling this method the listener will no longer be notified of any changes. If the listener has not been added, this method will be a no-op. Adding null is not allowed, and will result in an exception.

fun search(searchString: String, options: QueryOptions? = null, resultListener: QueryResultListener)

Query the database for a match of searchString. Only direct matches, begins-with and ends-with matches are supported. Returns a map of document UIDs to set of pages matching inside that document.

setSaveReverseText

fun setSaveReverseText(saveReverseText: Boolean)

Will save a reversed copy of the original page text. If enabled the index database will be about 2x bigger, but ends-with matches will be enabled.

size

fun size(): Int

Returns number of indexed documents in this library.

stopSearch

fun stopSearch()

Stops search and all in-progress preview text generator tasks.

updateIndexFromDataSource

suspend fun updateIndexFromDataSource(indexingOptions: IndexingOptions = IndexingOptions())

Updates the index based on information provided by the data source. If there is no data source set, this method will throw an IllegalStateException. Any currently queued documents will be removed.

PdfLibrary

Parameters

See also

Constructors

Types

Properties

Functions