db

package

v0.2.56 Latest Latest Go to latest Published: Dec 12, 2025 License: MIT Imports: 10 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/Project-Sylos/Spectra

Links

Open Source Insights

README ¶

Database Package

The database package provides the data persistence layer for Spectra using BoltDB. It implements a unified single-bucket architecture with per-world existence tracking for maximum performance.

Structure

db/
├── db.go      # Main database operations and CRUD
└── schema.go  # Bucket initialization and verification

Single-Bucket Architecture

Unified `nodes` Bucket

All nodes stored in a single bucket with plain UUID IDs as keys
Each node stored as JSON-serialized types.Node struct
existence_map JSON field tracks which "worlds" (primary, s1, s2, etc.) each node exists in
Optimized for minimal database round trips

Index Buckets

index_parent_id: Key format {parentID}|{nodeID} for efficient parent-child lookups
index_path: Key format {path} → value {nodeID} for path-based lookups
index_parent_path: Key format {parentPath}|{nodeID} for parent path queries

World-Based Filtering

Nodes are filtered by world in Go code after deserialization
Each node can exist in multiple worlds simultaneously
Filtering checks existence_map[world] boolean value

Key Features

Single Vectorized Queries: Fetch parent + children in one operation
World-Aware Filtering: Efficient Go-based world filtering
Bulk Operations: Transaction-based bulk inserts for performance
Indexed Queries: Optimized index buckets for common query patterns
ACID Compliance: BoltDB provides ACID transactions automatically

Core Operations

Node Management

InsertNode(node) - Insert node into nodes bucket and update all indexes
GetNodeByID(id) - Retrieve node by ID from nodes bucket
GetNodeByPath(path, world) - Retrieve node by path using index_path bucket
DeleteNode(id) - Delete node from nodes bucket and all indexes
BulkInsertNodes(nodes) - Insert multiple nodes in one transaction

Children Operations

GetChildrenByParentID(parentID, world) - Get children filtered by world using index_parent_id
GetParentAndChildren(parentID, world) - Get parent + children in ONE operation (optimized)
CheckChildrenExist(parentID, world) - Check if parent has children in world

System Operations

InitializeBuckets() - Create all required buckets
CreateRootNode() - Create single root node with existence in all worlds
DeleteAllNodes() - Clear nodes bucket and all index buckets
GetTableInfo() - Get world metadata
GetNodeCount(world) - Count nodes in specific world

Bucket Structure

`nodes` Bucket

Key: Node ID (UUID string)
Value: JSON-serialized types.Node struct

`index_parent_id` Bucket

Key: {parentID}|{nodeID} (e.g., "root|abc-123")
Value: Empty (key contains all information)

`index_path` Bucket

Key: Node path (e.g., "/folder/file.txt")
Value: Node ID (UUID string)

`index_parent_path` Bucket

Key: {parentPath}|{nodeID} (e.g., "/folder|abc-123")
Value: Empty (key contains all information)

Node Structure

Each node is stored as a JSON-serialized types.Node:

type Node struct {
    ID           string          // UUID identifier
    ParentID     string          // UUID parent reference
    Name         string          // Display name
    Path         string          // Relative path
    ParentPath   string          // Parent path
    Type         string          // "folder" or "file"
    DepthLevel   int             // BFS-style depth index
    Size         int64           // File size (0 for folders)
    LastUpdated  time.Time       // Synthetic timestamp
    Checksum     *string         // SHA256 checksum (NULL for folders)
    ExistenceMap map[string]bool // JSON: {"primary": true, "s1": true, "s2": false}
}

Performance Optimizations

Vectorized Queries

The GetParentAndChildren method fetches both parent and all children efficiently:

Fetch parent node by ID from nodes bucket
Use index_parent_id bucket with prefix scan to find all children
Filter by world in Go after deserialization
Sort results (parent first, then by type and name)

Index Prefix Scans

Parent-child queries use efficient prefix scans on index_parent_id:

Prefix: {parentID}|
All keys starting with this prefix represent children of that parent
BoltDB's ordered key structure makes this very efficient

Bulk Operations

BulkInsertNodes performs all inserts in a single BoltDB transaction:

All nodes inserted atomically
All indexes updated in the same transaction
Automatic rollback on any error

Usage

The database layer is used internally by the SpectraFS implementation. It provides the foundation for all data persistence operations in the synthetic filesystem.

Documentation ¶

Index ¶

func InitializeBuckets(db *bbolt.DB) error
func VerifyBucketsExist(db *bbolt.DB) error
type DB
- func New(dbPath string, secondaryTables map[string]float64) (*DB, error)

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

func InitializeBuckets ¶

func InitializeBuckets(db *bbolt.DB) error

InitializeBuckets creates all required buckets in the BoltDB database This replaces the SQL table creation logic

func VerifyBucketsExist ¶

func VerifyBucketsExist(db *bbolt.DB) error

VerifyBucketsExist checks if all required buckets exist in the database This replaces the SQL table existence checks

Types ¶

type DB ¶

type DB struct {
	// contains filtered or unexported fields
}

DB wraps BoltDB connection and provides key-value CRUD operations

func New ¶

func New(dbPath string, secondaryTables map[string]float64) (*DB, error)

New creates a new database connection and initializes the schema

func (*DB) BulkInsertNodes ¶

func (db *DB) BulkInsertNodes(nodes []*types.Node) error

BulkInsertNodes inserts multiple nodes in a single BoltDB transaction

func (*DB) CheckChildrenExist ¶

func (db *DB) CheckChildrenExist(parentID, world string) (bool, error)

CheckChildrenExist checks if a parent has any children in a specific world

func (*DB) Close ¶

func (db *DB) Close() error

Close closes the database connection BoltDB is ACID compliant and automatically persists all changes

func (*DB) CreateRootNode ¶

func (db *DB) CreateRootNode() error

CreateRootNode creates a single root node with existence in all worlds This function is idempotent - it will skip creating the node if it already exists

func (*DB) DeleteAllNodes ¶

func (db *DB) DeleteAllNodes() error

DeleteAllNodes removes all nodes from the nodes bucket and all indexes (for Reset)

func (*DB) DeleteNode ¶

func (db *DB) DeleteNode(id string) error

DeleteNode deletes a node from the nodes bucket and all indexes

func (*DB) GetChildrenByParentID ¶

func (db *DB) GetChildrenByParentID(parentID, world string) ([]*types.Node, error)

GetChildrenByParentID retrieves all children of a parent node filtered by world

func (*DB) GetNodeByID ¶

func (db *DB) GetNodeByID(id string) (*types.Node, error)

GetNodeByID retrieves a node by its ID from the nodes bucket

func (*DB) GetNodeByPath ¶

func (db *DB) GetNodeByPath(path, world string) (*types.Node, error)

GetNodeByPath retrieves a node by its path, optionally filtering by world

func (*DB) GetNodeCount ¶

func (db *DB) GetNodeCount(world string) (int, error)

GetNodeCount returns the total number of nodes in a specific world

func (*DB) GetParentAndChildren ¶

func (db *DB) GetParentAndChildren(parentID, world string) ([]*types.Node, error)

GetParentAndChildren retrieves parent and all its children in ONE optimized query This is the key performance optimization for ListChildren operations

func (*DB) GetSecondaryTables ¶

func (db *DB) GetSecondaryTables() []string

GetSecondaryTables returns the list of secondary world names

func (*DB) GetStats ¶

func (db *DB) GetStats() (*types.Stats, error)

GetStats retrieves the current filesystem statistics

func (*DB) GetTableInfo ¶

func (db *DB) GetTableInfo() ([]types.TableInfo, error)

GetTableInfo returns information about all worlds

func (*DB) InsertNode ¶

func (db *DB) InsertNode(node *types.Node) error

InsertNode inserts a new node into the nodes bucket and updates all indexes

func (*DB) UpdateExistenceMap ¶

func (db *DB) UpdateExistenceMap(id string, existenceMap map[string]bool) error

UpdateExistenceMap updates the existence map for a node

func (*DB) VerifyAndInitialize ¶

func (db *DB) VerifyAndInitialize(dbFileExists bool, secondaryTables map[string]float64) error

VerifyAndInitialize performs comprehensive database verification and initialization It checks each stage and creates what's missing: A) Database file exists (checked before connection) B) Buckets exist C) Root node exists

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL