Tika

Reads in files from a given path and parses them with Apache Tika

Type

dataset

Class

fire.nodes.dataset.NodeDatasetTika

Fields

Name Title Description
path Path Path of the file/directory
fileNameCol File Name Column File Name Column in the Output DataFrame
contentCol Content Column Tika output Column in the Output DataFrame