protoc-gen-bq-schema

command module
v0.0.0-...-7ff2b2b Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Jul 21, 2020 License: Apache-2.0 Imports: 13 Imported by: 0

README

protoc-gen-bq-schema

protoc-gen-bq-schema is a plugin for ProtocolBuffer compiler. It converts messages written in .proto format into schema files in JSON for BigQuery. So you can reuse existing data definitions in .proto for BigQuery with this plugin.

Installation

go get github.com/GoogleCloudPlatform/protoc-gen-bq-schema

Usage

protoc --bq-schema_out=path/to/outdir foo.proto

protoc and protoc-gen-bq-schema commands must be found in $PATH.

The generated JSON schema files are suffixed with .schema and their base names are named after their package names and bq_table_name options.

If you do not already have the standard google protobuf libraries in your proto_path, you'll need to specify them directly on the command line (and potentially need to copy bq_schema.proto into a proto_path directory as well), like this:

protoc --bq-schema_out=path/to/out/dir foo.proto --proto_path=. --proto_path=<path_to_google_proto_folder>/src
Example

Suppose that we have the following foo.proto.

syntax = "proto2";
package foo;
import "bq_table.proto";
import "bq_field.proto";

message Bar {
  option (gen_bq_schema.bigquery_opts).table_name = "bar_table";

  message Nested {
    repeated int32 a = 1;
  }

  required int32 a = 1;
  optional Nested b = 2;
  repeated string c = 3;

  optional bool d = 4 [(gen_bq_schema.bigquery).ignore = true];
  optional uint64 e = 5 [
    (gen_bq_schema.bigquery) = {
      require: true
      type_override: 'TIMESTAMP'
    }
  ];
}

message Baz {
  required int32 a = 1;
}

protoc --bq-schema_out=. foo.proto will generate a file named foo/bar_table.schema. The message foo.Baz is ignored because it doesn't have option gen_bq_schema.bigquery_opts.

Example - Policy Tags

Support exists for specifying a BigQuery Policy Tag on a field.

Suppose that we have the following foo.proto that has policy_tags set on the first_name field:

syntax = "proto3";
package foo;
import "bq_table.proto";
import "bq_field.proto";

message Bar {
  option (gen_bq_schema.bigquery_opts).table_name = "bar_table";

  required string user = 1;
  optional string first_name = 2 [(gen_bq_schema.bigquery).policy_tags = "pii"];
}

protoc --bq-schema_out=. foo.proto will generate a file named foo/bar_table.schema. The table bar_table is defined in bar_table.schema and the field with name first_name includes the policyTags json marking it as a field which is subject to some form of column-level security.

{
  "name": "first_name",
  "type": "STRING",
  "mode": "NULLABLE",
  "policyTags": {
   "names": [
    "pii"
   ]
  }
 }

Policy tags must take the form projects/project-id/locations/location/taxonomies/taxonomy-id/policyTags/policytag-id

Special Case - OneOf Repeated fields of type Empty Message

Take the following example:

message PartnerPositionApplicationRequiresReviewEvent {
  option (gen_bq_schema.bigquery_opts).table_name = "partner_position_application_requires_review";

  string application_id = 1;
  repeated ApplicationReviewReason reasons = 2;
}

message ApplicationReviewReason {
  oneof reason {
    ReasonAddressMatch address_match = 1;
    ReasonHasCriminalRecord has_criminal_record = 2;
  }
}

message ReasonAddressMatch {
  repeated string address_partner_ids = 1;
}

message ReasonHasCriminalRecord {}

The reason has_criminal_record is message type ReasonHasCriminalRecord which has no fields. In this instance the message is used like a boolean. Rather than discard this field in the BigQuery schema, a field of type BOOLEAN is created instead.

License

protoc-gen-bq-schema is licensed under the Apache License version 2.0. This is not an official Google product.

Documentation

Overview

protoc plugin which converts .proto to schema for BigQuery. It is spawned by protoc and generates schema for BigQuery, encoded in JSON.

usage:

$ bin/protoc --bq-schema_out=path/to/outdir foo.proto

Directories

Path Synopsis

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL