public class TikaEntityProcessor extends EntityProcessorBase
An implementation of EntityProcessor
which reads data from rich docs
using Apache Tika
To index latitude/longitude data that might
be extracted from a file's metadata, identify
the geo field for this information with this attribute:
spatialMetadataField
ABORT, cacheSupport, context, CONTINUE, entityName, isFirstInit, ON_ERROR, onError, query, rowIterator, SKIP, TRANSFORM_ROW, TRANSFORMER
Constructor and Description |
---|
TikaEntityProcessor() |
Modifier and Type | Method and Description |
---|---|
protected void |
firstInit(Context context)
first time init call. do one-time operations here
it's necessary to call it from the overridden method,
otherwise it throws NPE on accessing zipper from nextRow()
|
void |
init(Context context)
This method is called when it starts processing an entity.
|
Map<String,Object> |
nextRow()
For a simple implementation, this is the only method that the sub-class should implement.
|
destroy, getNext, initCache, nextDeletedRowKey, nextModifiedParentRowKey, nextModifiedRowKey
close, postTransform
public void init(Context context)
EntityProcessor
init
in class EntityProcessorBase
context
- The current contextprotected void firstInit(Context context)
EntityProcessorBase
firstInit
in class EntityProcessorBase
public Map<String,Object> nextRow()
EntityProcessorBase
nextRow
in class EntityProcessorBase
Copyright © 2000–2015 The Apache Software Foundation. All rights reserved.