loader

package
v4.1.23 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Nov 7, 2022 License: Apache-2.0 Imports: 10 Imported by: 0

Documentation

Index

Constants

View Source
const String16RuneSize = 16

Variables

This section is empty.

Functions

func CSVtoNumpyMulti

func CSVtoNumpyMulti(csvReader *csv.Reader, tbk io.TimeBucketKey, cvm *CSVMetadata, chunkSize int,
	isVariable bool,
) (npm *io.NumpyMultiDataset, endReached bool, err error)

Types

type CSVConfig

type CSVConfig struct {
	FirstRowHasColumnNames bool     `yaml:"firstRowHasColumnNames"`
	TimeFormat             string   `yaml:"timeFormat"`
	Timezone               string   `yaml:"timeZone"`
	ColumnNameMap          []string `yaml:"columnNameMap"`
}

CSVConfig is constructed from the control file that specifies the formatting of the csv data.

type CSVMetadata

type CSVMetadata struct {
	Config *CSVConfig // Configuration of the CSV file, including the names of the columns
	// DSV is data shapes inside this CSV file. The first 2 columns are "Epoch-date" and "Epoch-time".
	// If the schema of existent bucket is "Epoch,Ask,Bid", DSV is ["Epoch-date", "Epoch-time", "Epoch", "Ask", "Bid"].
	DSV []io.DataShape
	// ColumnIndex maps the index of the columns in the CSV file to each time bucket in the DB.
	// ColumnIndex[i+2]=-1 when the column of DSV[i] doesn't exist in the provided CSV file.
	// e.g. when the bucket is "Epoch,Ask,Bid" and Column[3] = -1, it means the provided CSV doesn't have "Ask" column.
	ColumnIndex []int
}

func ReadMetadata

func ReadMetadata(dataFD, controlFD *os.File, dbDataShapes []io.DataShape) (csvReader *csv.Reader, cvm *CSVMetadata, err error)

ReadMetadata returns formatting info about the csv file containing the data to be loaded into the database.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL