Available Processors
strip
Remove leading and trailing whitespace
replace
Replace substring in strings
regex
Extract substring using pattern
cast
Convert to specified type
join
Join list values to string
default
Return fallback value if empty
lowercase
Convert strings to lowercase
parse_datetime
Parse datetime to ISO format
Processor Reference
1. strip
Remove leading and trailing whitespace from strings. Parameters: None Example:2. replace
Replace substring in strings. Parameters:old(required): Substring to replacenew(required): Replacement string
3. regex
Extract substring using regular expression pattern. Parameters:pattern(required): Regex pattern to matchgroup(optional): Capture group to extract (default: 1)
4. cast
Convert value to specified type. Parameters:to(required): Target type -"int","float","bool", or"str"
true,1,yes,on→ True- Everything else → False
5. join
Join list values into a single string. Parameters:separator(optional): String to join with (default:" ")
6. default
Return default value if input is None, empty string, or empty list. Parameters:default(required): Fallback value
7. lowercase
Convert strings to lowercase. Parameters: None Example:8. parse_datetime
Parse datetime string into ISO format. Parameters:format(optional): strptime format string (if None, uses dateutil parser for flexible parsing)
Processor Chaining
Processors run sequentially. Output of one becomes input to the next.Example 1: Clean and Convert Price
Example 2: Extract Rating Number
Example 3: Normalize Text
" In Stock "Output:
"in_stock"
Example 4: Handle Missing Values
Common Patterns
Extracting Currency Values
"$1,299.99", "Price: $99", " $42.50 "
Extracting Numbers from Text
"23 items", "Quantity: 5", "42"
Boolean Fields
True if “in stock” or “available”, else False
Date Fields
Lists to Comma-Separated String
["Python", "Web Scraping", "Automation"]Output:
"Python, Web Scraping, Automation"
Complete Examples
E-commerce Product
Job Listing
Real Estate Listing
Error Handling
Processors handle errors gracefully:- Graceful Failures
- Chain Behavior
- strip, replace, lowercase, join: Return original value if not applicable type
- regex: Returns original value if pattern doesn’t match
- cast: Returns None if conversion fails
- parse_datetime: Returns None if parsing fails
- Unknown processor type: Skipped, logs warning
Best Practices
Troubleshooting
Processor Returns None
Validate input type
Some processors only work on specific types:
regex: strings onlyjoin: lists onlyparse_datetime: strings only