wordcount

command

v2.25.0+incompatible Latest Latest Go to latest Published: Oct 20, 2020 License: Apache-2.0 Imports: 10 Imported by: 0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

Repository URL not available.

Links

Open Source Insights

Documentation ¶

Overview ¶

wordcount is an example that counts words in Shakespeare and includes Beam best practices.

This example is the second in a series of four successively more detailed 'word count' examples. You may first want to take a look at minimal_wordcount. After you've looked at this example, then see the debugging_workcount pipeline, for introduction of additional concepts.

For a detailed walkthrough of this example, see

https://beam.apache.org/get-started/wordcount-example/

Basic concepts, also in the minimal_wordcount example: Reading text files; counting a PCollection; writing to text files

New Concepts:

Executing a Pipeline both locally and using the selected runner
Defining your own pipeline options
Using ParDo with static DoFns defined out-of-line
Building a composite transform

Concept #1: you can execute this pipeline either locally or using by selecting another runner. These are now command-line options added by the 'beamx' package and not hard-coded as they were in the minimal_wordcount example. The 'beamx' package also registers all included runners and filesystems as a convenience.

To change the runner, specify:

--runner=YOUR_SELECTED_RUNNER

To execute this pipeline, specify a local output file (if using the 'direct' runner) or a remote file on a supported distributed file system.

--output=[YOUR_LOCAL_FILE | YOUR_REMOTE_FILE]

The input file defaults to a public data set containing the text of of King Lear, by William Shakespeare. You can override it and choose your own input with --input.

Source Files ¶

View all Source files

wordcount.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL