argo-dataflow

module

v0.10.3 Latest Latest Go to latest Published: Jul 25, 2022 License: Apache-2.0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/argoproj-labs/argo-dataflow

Links

Open Source Insights

README ¶

Dataflow

Summary

Dataflow is a Kubernetes-native platform for executing large parallel data-processing pipelines.

Each pipeline is specified as a Kubernetes custom resource which consists of one or more steps which source and sink messages from data sources such Kafka, NATS Streaming, or HTTP services.

Each step runs zero or more pods, and can scale horizontally using HPA or based on queue length using built-in scaling rules. Steps can be scaled-to-zero, in which case they periodically briefly scale-to-one to measure queue length so they can scale a back up.

Learn more about features.

Use Cases

Real-time "click" analytics
Anomaly detection
Fraud detection
Operational (including IoT) analytics

Screenshot

Example

pip install git+https://github.com/argoproj-labs/argo-dataflow#subdirectory=dsls/python

from argo_dataflow import cron, pipeline

if __name__ == '__main__':
    (pipeline('hello')
     .namespace('argo-dataflow-system')
     .step(
        (cron('*/3 * * * * *')
         .cat()
         .log())
    )
     .run())

Documentation

Read in order:

Beginner:

Intermediate:

Advanced

Architecture Diagram

Directories ¶

Path	Synopsis
api
v1alpha1 Package v1alpha1 contains API Schema definitions for the dataflow v1alpha1 API group +kubebuilder:object:generate=true +groupName=dataflow.argoproj.io	Package v1alpha1 contains API Schema definitions for the dataflow v1alpha1 API group +kubebuilder:object:generate=true +groupName=dataflow.argoproj.io
examples
git
hack
kafka Module
kill
manager
controllers
controllers/scaling
prestop
runner
init
sidecar
sidecar/shared/kafka
sidecar/shared/nats
sidecar/shared/stan
sidecar/sink
sidecar/sink/db
sidecar/sink/http
sidecar/sink/jetstream
sidecar/sink/kafka
sidecar/sink/log
sidecar/sink/s3
sidecar/sink/stan
sidecar/sink/volume
sidecar/source
sidecar/source/cron
sidecar/source/db
sidecar/source/http
sidecar/source/jetstream
sidecar/source/kafka
sidecar/source/loadbalanced
sidecar/source/s3
sidecar/source/stan
sidecar/source/volume
sidecar/tls
util
runtimes
golang1-17
sdks
golang Code generated by gen.sh.	Code generated by gen.sh.
shared
builtin
builtin/cat
builtin/dedupe
builtin/expand
builtin/filter
builtin/flatten
builtin/group
builtin/map
containerkiller
debug
podexec
symbol
util
util/retry
test
testapi

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL