chronica

command module

v0.2.6 Latest Latest Go to latest Published: Feb 16, 2024 License: Apache-2.0 Imports: 8 Imported by: 0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/9rum/chronica

README ¶

Chronica

A data-imbalance-aware scheduler for distributed deep learning

Chronica is a data-imbalance-aware scheduler and associated scheduling framework for distributed deep learning. Chronica accelerates data-parallel training with imbalanced data such as video, audio and text by reducing redundant operations and memory footprint while improving scalability and resource utilization. The goal of Chronica is to make a fast, efficient and easy-to-use data scheduling framework, compatible with existing deep learning systems such as TensorFlow, PyTorch and JAX.

Highlighted features

Data-imbalance-aware scheduling

Chronica provides several schedule kinds such as static and dynamic, each of which can be useful for workloads with different characteristics.

Partitioned data set

In addition to the traditional fully sharded data sets, Chronica supports a partitioned data set where the data is split into multiple data partitions across nodes in the cluster.

Minimal code modifications

While benefiting from the above features, users only need to modify a few lines in the existing code base, and the rests are transparent.

Supported frameworks

Chronica currently supports PyTorch only.

Quick start

Prerequisites

You need to have the latest version of the Go compiler installed on the master node (rank 0) and add the GOBIN environment variable to PATH, i.e., PATH=$GOBIN:$PATH. Once the Go compiler is installed, you can install the chronica pip package.

pip install chronica

How to use

To use Chronica, make the following modifications to your program:

Make your data set inherit from Chronica's data set class instead of existing data set class: e.g., for PyTorch, use chronica.torch.utils.data.Dataset instead of torch.utils.data.Dataset.
Overwrite __sizeof__ in your data set, which represents the relative size of each data sample: e.g., for video data sets, the relative size of each data sample is determined by the number of frames. Thus you can overwrite __sizeof__ using OpenCV as follows.
```
def __sizeof__(self, index: int) -> int:
    return int(cv2.VideoCapture(self.videos[index]).get(cv2.CAP_PROP_FRAME_COUNT))
```
Use Chronica's data sampler instead of existing data sampler: e.g., for PyTorch, use chronica.torch.utils.data.DistributedSampler instead of torch.utils.data.DistributedSampler.
Pass additional parameters to the data sampler: e.g., set kind="dynamic" if you need dynamic scheduling.

[!TIP] In most cases, the above modifications can be done by adding chronica. to import statements and overwriting __sizeof__, as shown below:

-from torch.utils.data import Dataset
+from chronica.torch.utils.data import Dataset

+    def __sizeof__(self, index: int) -> int:
+        return int(cv2.VideoCapture(self.videos[index]).get(cv2.CAP_PROP_FRAME_COUNT))

-from torch.utils.data import DistributedSampler
+from chronica.torch.utils.data import DistributedSampler

Publications

Sanha Maeng, Gordon Euhyun Moon and Sungyong Park, Chronica: A Data-Imbalance-Aware Scheduler for Distributed Deep Learning, in Proceedings of the 2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing (CCGrid)

Citation

If you use Chronica in your work, we would appreciate citations using the following metadata:

@inproceedings{maeng2023chronica,
  author={Sanha Maeng, Gordon Euhyun Moon and Sungyong Park},
  booktitle={2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing (CCGrid)},
  title={Chronica: A Data-Imbalance-Aware Scheduler for Distributed Deep Learning},
  year={2023},
  pages={262--272},
  doi={10.1109/CCGrid57682.2023.00033}
}

Documentation ¶

Overview ¶

Package main runs the communicator server. The initialization and termination of the server may be invoked by the sampler, and the types of data set and scheduler are provided upon the initialization.

Source Files ¶

View all Source files

main.go

Directories ¶

Path	Synopsis
communicator The communicator package implements an intermediary to communicate with Chronica's scheduler.	The communicator package implements an intermediary to communicate with Chronica's scheduler.
internal
btree Package btree implements in-memory B-trees of arbitrary degree.	Package btree implements in-memory B-trees of arbitrary degree.
data Package data provides primitives for representing and organizing the given data sets.	Package data provides primitives for representing and organizing the given data sets.
scheduler Package scheduler provides primitives for scheduling imbalanced data.	Package scheduler provides primitives for scheduling imbalanced data.

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL