This release adds a new S3 operator and fixes a bug within the fork
operator.
Download the release on GitHub.
Features
Section titled “Features”from_s3
operator
Section titled “from_s3 operator”The from_s3
operator reads files from Amazon S3 with support for glob
patterns, automatic format detection, and file monitoring.
from_s3 "s3://my-bucket/data/**.json"
The operator supports multiple authentication methods including default AWS credentials, explicit access keys, IAM role assumption, and anonymous access for public buckets:
from_s3 "s3://my-bucket/data.csv", access_key=secret("AWS_ACCESS_KEY"), secret_key=secret("AWS_SECRET_KEY")
For S3-compatible services, specify custom endpoints via URL parameters:
from_s3 "s3://my-bucket/data/**.json?endpoint_override=minio.example.com:9000&scheme=http"
Additional features include file watching for continuous ingestion, automatic file removal or renaming after processing, and path field injection to track source files in events.
Bug Fixes
Section titled “Bug Fixes”Fix fork
operator stopping after initial events
Section titled “Fix fork operator stopping after initial events”We fixed a bug where the fork
operator would stop processing events after handling
only the first few events, causing data loss in downstream pipeline stages.