Английская Википедия:Apache Beam
Шаблон:Short description Шаблон:Infobox software Apache Beam is an open source unified programming model to define and execute data processing pipelines, including ETL, batch and stream (continuous) processing.[1] Beam Pipelines are defined using one of the provided SDKs and executed in one of the Beam’s supported runners (distributed processing back-ends) including Apache Flink, Apache Samza, Apache Spark, and Google Cloud Dataflow.[2]
History
Apache Beam[2] is one implementation of the Dataflow model paper.[3] The Dataflow model is based on previous work on distributed processing abstractions at Google, in particular on FlumeJava[4] and Millwheel.[5][6]
Google released an open SDK implementation of the Dataflow model in 2014 and an environment to execute Dataflows locally (non-distributed) as well as in the Google Cloud Platform service.
Timeline
Apache Beam makes minor releases every 6 weeks.[7]
See also
References
Шаблон:Apache Software Foundation Шаблон:Google FOSS
- Английская Википедия
- Apache Software Foundation
- Apache Software Foundation projects
- Big data products
- Cluster computing
- Distributed stream processing
- Google software
- Hadoop
- Java platform
- Free software programmed in Java (programming language)
- Страницы, где используется шаблон "Навигационная таблица/Телепорт"
- Страницы с телепортом
- Википедия
- Статья из Википедии
- Статья из Английской Википедии