Directories

Path Synopsis
addtoqueue adds a message to a queue.
addtoqueue adds a message to a queue.
bookpipeline is the core command of the bookpipeline package, which watches queues for messages and does various OCR related tasks when it receives them, saving the results in cloud storage.
bookpipeline is the core command of the bookpipeline package, which watches queues for messages and does various OCR related tasks when it receives them, saving the results in cloud storage.
booktopipeline uploads a book to cloud storage and adds the name to a queue ready to be processed by the bookpipeline tool.
booktopipeline uploads a book to cloud storage and adds the name to a queue ready to be processed by the bookpipeline tool.
confgraph creates a graph showing the average word confidence of each page of hOCR in a directory.
confgraph creates a graph showing the average word confidence of each page of hOCR in a directory.
getallhocrs downloads every 'best' file from a set of OCRed books stored on cloud infrastructure
getallhocrs downloads every 'best' file from a set of OCRed books stored on cloud infrastructure
getandpurgequeue gets and deletes all messages from a queue.
getandpurgequeue gets and deletes all messages from a queue.
getbests downloads every 'best' file from a set of OCRed books stored on cloud infrastructure
getbests downloads every 'best' file from a set of OCRed books stored on cloud infrastructure
getpipelinebook downloads the pipeline results for a book.
getpipelinebook downloads the pipeline results for a book.
getsamplepages downloads sample pages from each book in a set of OCRed books
getsamplepages downloads sample pages from each book in a set of OCRed books
getstats gets relevant files for creating statistics from a set of OCRed books stored on cloud infrastructure
getstats gets relevant files for creating statistics from a set of OCRed books stored on cloud infrastructure
logwholequeue gets all messages in a queue.
logwholequeue gets all messages in a queue.
lspipeline lists useful things related to the book pipeline.
lspipeline lists useful things related to the book pipeline.
mkpipeline sets up the necessary buckets and queues for the book pipeline.
mkpipeline sets up the necessary buckets and queues for the book pipeline.
pagegraph creates a graph showing the average confidence of each word in a page of hOCR.
pagegraph creates a graph showing the average confidence of each word in a page of hOCR.
pdfbook creates a searchable PDF from a directory of hOCR and image files.
pdfbook creates a searchable PDF from a directory of hOCR and image files.
rescribe is a modification of bookpipeline designed for local-only operation, which rolls uploading, processing, and downloading of a single book by the pipeline into one command.
rescribe is a modification of bookpipeline designed for local-only operation, which rolls uploading, processing, and downloading of a single book by the pipeline into one command.
spotme creates new spot instances for the book pipeline.
spotme creates new spot instances for the book pipeline.
trimqueue deletes any messages in a queue that match a specified prefix.
trimqueue deletes any messages in a queue that match a specified prefix.