OpenNLPNameFinder

This node finds names using OpenNLP. It takes in the OpenNLP model. Models can be downloaded from http://opennlp.sourceforge.net/models-1.5/

Input

It takes in a DataFrame as input.

Output

It extracts the names from the specified column and stores the result in the specified output column.

Type

transform

Class

fire.nodes.opennlp.NodeOpenNLPNameFinder

Fields

Name Title Description
model Model Path to the model file (on HDFS when running on the cluster)
inputCol Input Text Column input column name
outputCol Output Column Output Column containing the results

Details

This node performs namefinder using OpenNLP to easily detect named entities and numbers in text.

To be able to detect entities the Name Finder needs a model. The model is dependent on the language and entity type it was trained for.

https://opennlp.apache.org/documentation/1.6.0/manual/opennlp.html#tools.namefind.recognition.cmdline

The OpenNLP project offers a number of pre-trained name finder models which are trained on various freely available corpora. They can be downloaded at the OpenNLP download page.

http://opennlp.sourceforge.net/models-1.5/