Read PDF File

This workflow reads in PDF file from the given location. It then parses its content and creates DataFrame then prints the results.

Workflow

Below is the workflow that shows:

  • How to read in PDF file from the given location and create the DataFrame from it
  • Prints the result
ReadandParse

Reading And Parsing PDF File

DatasetPDF processor uses the passed location to download PDF file, parse its content into string and create the DataFrame.

Processor Configuration

ReadandParse

Processor Output

ReadandParse

Prints the Results

It prints the result onto the screen.