excelFormExtractor

module
v0.3.3 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Dec 2, 2024 License: MIT

README

Excel extractor

Extract excel form content into structured data.

Usage

  1. SECCF extraction: supplier export control classification Form/declaration
from py_excel_form_extractor import extractor, go

company_names = extractor.CompanyNameList()  # the company name which can be mentioned in the file
for company_name in ["Boeing", "Boeing Ltd"]:
    company_names.append(company_name)

extr = extractor.make_seccf_extractor("Example.xlsx", company_names)
extraction = extr.extract()

# convert to JSON string
extr_json = extr.to_json()

BUILD

  1. Building the go binary
go build -o gobinary ./cmd/excelExtractor
  1. Running the program without building the binary
go build -o ./bin/excel-extrator ./cmd/excelFormExtractor/main.go
  1. Run the binary:
./bin/excel-extrator

Local Python bindings generation and installation

pip3 install pybindgen wheel
gopy build --output=py_excel_form_extractor -vm=python3 ./pkg/*
RELEASE_VERSION=YOUR_UPDATED_PACKAGE_VERSION python3 setup.py bdist_wheel --force

# install wheel file
wheel_file=$(ls dist/*.whl | head -n1); pip3 install $wheel_file

Directories

Path Synopsis
cmd
pkg

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL