Sparkflows
latest
Architecture & Deployment
Installation
Configuration
Authentication
Security
Operating Guide
Quickstart Guide
User Guide
Analytical Apps User Guide
Machine Learning User Guide
Time Series Analysis
Tutorials
Troubleshooting
FAQ
Administration Guide
Databricks Guide
AWS Guide
AZURE Guide
Load Balancer
Superset
Python Integration
Performance Tuning
Developer Guide
Processors
16-Utilities
09-DataProfiling
05-FeatureEngineering
01-IO
11-ML-SparkML
ML-TS
02-Parse
FieldSplitter
RegexTokenizer
Fixed Length Fields
ApacheLogs
ParseJSONCol
OCR
MultiRegexExtractor
06-Filter
18-OpenNLP
15-ScoreCardPy
03-Prepare
04-DataValidation
CustomProcessors
17-Documentation
12-ML-H2O
13-ML-AWSSagemaker
14-ML-Sklearn
08-Group
06-Code
10-Visualization
19-Deprecated
15-Streaming
15-StructuredStreaming
14-DL
07-JoinUnion
Release Notes
REST API Authentication
REST API Examples using Python
REST API Examples using Java
REST API Examples using curl
Third Party Acknowledgements
Sparkflows
Docs
»
Processors
»
02-Parse
Edit on GitHub
02-Parse
ΒΆ
FieldSplitter
Input
Output
Type
Class
Fields
RegexTokenizer
Type
Class
Fields
Fixed Length Fields
Type
Class
Fields
ApacheLogs
Type
Class
Fields
ParseJSONCol
Type
Class
Fields
OCR
Type
Class
Fields
MultiRegexExtractor
Input
Output
Type
Class
Fields