rate

package module

v0.1.8 Latest Latest Go to latest Published: May 18, 2025 License: MIT Imports: 4 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/webriots/rate

Links

Open Source Insights

README ¶

Rate

A high-performance rate limiting library for Go applications with multiple rate limiting strategies.

Features

Ultra-Fast: Core operations execute in single digit nanoseconds with zero allocations, enabling hundreds of millions of operations per second
Thread-Safe: All operations are designed to be thread-safe using optimized atomic operations
Zero External Dependencies: Relies solely on the Go standard library
Memory Efficient: Uses compact data structures and optimized storage
- Packed representations for token buckets
- Custom 56-bit timestamp implementation
- Efficient atomic slice implementations
Multiple Rate Limiting Strategies:
- TokenBucketLimiter: Classic token bucket algorithm with multiple buckets
- AIMDTokenBucketLimiter: Additive-Increase/Multiplicative-Decrease algorithm inspired by TCP congestion control
Highly Scalable: Designed for high-throughput concurrent systems
- Multiple buckets distribute load across different request IDs
- Low contention design for concurrent access patterns

Installation

go get github.com/webriots/rate

Quick Start

Token Bucket Rate Limiter

package main

import (
	"fmt"
	"time"

	"github.com/webriots/rate"
)

func main() {
	// Create a new token bucket limiter:
	// - 1024 buckets (automatically rounded to nearest power of 2 if not already a power of 2)
	// - 10 tokens burst capacity
	// - 100 tokens per second refill rate
	limiter, err := rate.NewTokenBucketLimiter(1024, 10, 100, time.Second)
	if err != nil {
		panic(err)
	}

	// Try to take a token for a specific ID
	id := []byte("user-123")

	if limiter.TakeToken(id) {
		fmt.Println("Token acquired, proceeding with request")
		// ... process the request
	} else {
		fmt.Println("Rate limited, try again later")
		// ... return rate limit error
	}

	// Check without consuming
	if limiter.Check(id) {
		fmt.Println("Token would be available")
	}
}

Go Playground

AIMD Rate Limiter

package main

import (
	"fmt"
	"time"

	"github.com/webriots/rate"
)

func main() {
	// Create an AIMD token bucket limiter:
	// - 1024 buckets (automatically rounded to nearest power of 2 if not already a power of 2)
	// - 10 tokens burst capacity
	// - Min rate: 1 token/s, Max rate: 100 tokens/s, Initial rate: 10 tokens/s
	// - Increase by 1 token/s on success, decrease by factor of 2 on failure
	limiter, err := rate.NewAIMDTokenBucketLimiter(
		1024,  // numBuckets
		10,    // burstCapacity
		1.0,   // rateMin
		100.0, // rateMax
		10.0,  // rateInit
		1.0,   // rateAdditiveIncrease
		2.0,   // rateMultiplicativeDecrease
		time.Second,
	)
	if err != nil {
		panic(err)
	}

	id := []byte("api-client-123")

	// Try to take a token
	if limiter.TakeToken(id) {
		// Process the request
		success := processRequest()

		if success {
			// If successful, increase the rate
			limiter.IncreaseRate(id)
		} else {
			// If failed (e.g., downstream service is overloaded),
			// decrease the rate to back off
			limiter.DecreaseRate(id)
		}
	} else {
		// We're being rate limited
		fmt.Println("Rate limited, try again later")
	}
}

func processRequest() bool {
	// Your request processing logic
	fmt.Println("Processing request")
	return true
}

Go Playground

Detailed Usage

TokenBucketLimiter

The token bucket algorithm is a common rate limiting strategy that allows for controlled bursting. It maintains a "bucket" of tokens that refills at a constant rate, and each request consumes a token.

limiter, err := rate.NewTokenBucketLimiter(
    numBuckets,     // Number of buckets (automatically rounded to nearest power of 2 if not already a power of 2)
    burstCapacity,  // Maximum number of tokens in each bucket
    refillRate,     // Rate at which tokens are refilled
    refillRateUnit, // Time unit for refill rate
)

Parameters:

numBuckets: Number of token buckets (automatically rounded up to the nearest power of two if not already a power of two)
burstCapacity: Maximum number of tokens that can be consumed at once
refillRate: Rate at which tokens are refilled
refillRateUnit: Time unit for refill rate calculations (e.g., time.Second)

AIMDTokenBucketLimiter

The AIMD (Additive Increase, Multiplicative Decrease) algorithm provides dynamic rate adjustments inspired by TCP congestion control. It gradually increases token rates during successful operations and quickly reduces rates when encountering failures.

limiter, err := rate.NewAIMDTokenBucketLimiter(
    numBuckets,                // Number of buckets (automatically rounded to nearest power of 2 if not already a power of 2)
    burstCapacity,             // Maximum tokens per bucket
    rateMin,                   // Minimum token refill rate
    rateMax,                   // Maximum token refill rate
    rateInit,                  // Initial token refill rate
    rateAdditiveIncrease,      // Amount to increase rate on success
    rateMultiplicativeDecrease, // Factor to decrease rate on failure
    rateUnit,                  // Time unit for rate calculations
)

Parameters:

numBuckets: Number of token buckets (automatically rounded up to the nearest power of two if not already a power of two)
burstCapacity: Maximum number of tokens that can be consumed at once
rateMin: Minimum token refill rate
rateMax: Maximum token refill rate
rateInit: Initial token refill rate
rateAdditiveIncrease: Amount to increase rate by on success
rateMultiplicativeDecrease: Factor to decrease rate by on failure
rateUnit: Time unit for rate calculations (e.g., time.Second)

Performance

The library is optimized for high performance and efficiency:

Core operations complete in 1-15ns with zero allocations
Efficient memory usage through packed representations
Minimal contention in multi-threaded environments

Benchmarks

Run the benchmarks on your machine with:

# Run all benchmarks with memory allocation stats
go test -bench=. -benchmem

# Run specific benchmark
go test -bench=BenchmarkTokenBucketCheck -benchmem

Here are the results from an Apple M1 Pro:

# Core operations
BenchmarkTokenBucketCheck-10                    203429409                5.919 ns/op           0 B/op          0 allocs/op
BenchmarkTokenBucketTakeToken-10                165718064                7.240 ns/op           0 B/op          0 allocs/op
BenchmarkTokenBucketParallel-10                 841380494                1.419 ns/op           0 B/op          0 allocs/op
BenchmarkTokenBucketContention-10               655109293                1.827 ns/op           0 B/op          0 allocs/op
BenchmarkTokenBucketWithRefill-10               165840525                7.110 ns/op           0 B/op          0 allocs/op

# Bucket creation (different sizes)
BenchmarkTokenBucketCreateSmall-10              16929116                71.00 ns/op          192 B/op          2 allocs/op
BenchmarkTokenBucketCreateMedium-10               664831              1820 ns/op            8256 B/op          2 allocs/op
BenchmarkTokenBucketCreateLarge-10                 45537             25471 ns/op          131137 B/op          2 allocs/op

# Internals
BenchmarkTokenBucketPacked-10                   1000000000               0.3103 ns/op          0 B/op          0 allocs/op
BenchmarkTokenBucketUnpack-10                   1000000000               0.3103 ns/op          0 B/op          0 allocs/op
BenchmarkTokenBucketIndex-10                    265611426                4.510 ns/op           0 B/op          0 allocs/op
BenchmarkTokenBucketRefill-10                   591397021                2.023 ns/op           0 B/op          0 allocs/op

# Real-world scenarios
BenchmarkTokenBucketManyIDs-10                  139485549                8.590 ns/op           0 B/op          0 allocs/op
BenchmarkTokenBucketDynamicID-10                93835521                12.87 ns/op            0 B/op          0 allocs/op
BenchmarkTokenBucketRealWorldRequestRate-10     853565757                1.401 ns/op           0 B/op          0 allocs/op
BenchmarkTokenBucketHighContention-10           579507058                2.068 ns/op           0 B/op          0 allocs/op
BenchmarkTokenBucketWithSystemClock-10          459682273                2.605 ns/op           0 B/op          0 allocs/op

Advanced Usage

Choosing the Optimal Number of Buckets

The numBuckets parameter is automatically rounded up to the nearest power of two if not already a power of two (e.g., 256, 1024, 4096) for efficient hashing and is a critical factor in the limiter's performance and fairness.

How Bucket Selection Works

When you call TakeToken(id) or similar methods, the library:

Hashes the ID using Go's built-in hash/maphash algorithm (non-cryptographic, high performance)
Uses the hash value & (numBuckets-1) to determine the bucket index
Takes/checks a token from that specific bucket

// Simplified version of the internal hashing mechanism
func index(id []byte) int {
    return int(maphash.Bytes(seed, id) & (numBuckets - 1))
}

Understanding Collision Probability

A collision occurs when two different IDs map to the same bucket. When this happens:

Those IDs share the same rate limit, which may be undesirable
Higher contention can occur on that bucket when accessed concurrently

The probability of collision depends on:

Number of buckets (numBuckets)
Number of distinct IDs being rate limited

The birthday paradox formula gives us the probability of at least one collision:

Number of Distinct IDs	Buckets=1024	Buckets=4096	Buckets=16384	Buckets=65536
10	4.31%	1.09%	0.27%	0.07%
100	99.33%	70.43%	26.12%	7.28%
1,000	100.00%	100.00%	100.00%	99.95%
10,000	100.00%	100.00%	100.00%	100.00%
100,000	100.00%	100.00%	100.00%	100.00%

However, the probability of collision doesn't tell the whole story. What matters more is the percentage of IDs involved in collisions and the expected number of collisions:

Configuration	Load Factor	Expected Collisions	% of IDs in Collisions
100 IDs in 65,536 buckets	0.0015	~0.08	~0.15%
1,000 IDs in 65,536 buckets	0.0153	~7.6	~1.5%
10,000 IDs in 65,536 buckets	0.1526	~763	~7.3%

Guidelines for Choosing `numBuckets`

It's important to understand that even though the probability of having at least one collision approaches 100% quickly as the number of IDs increases, the percentage of IDs affected by collisions grows much more slowly.

Based on the percentage of IDs involved in collisions:

For small systems (≤100 IDs):
- 4,096 buckets: ~2.4% of IDs will be involved in collisions
- 16,384 buckets: ~0.6% of IDs will be involved in collisions
For medium systems (101-1,000 IDs):
- 16,384 buckets: ~6% of IDs will be involved in collisions
- 65,536 buckets: ~1.5% of IDs will be involved in collisions
For large systems (1,001-10,000 IDs):
- 65,536 buckets: ~7.3% of IDs will be involved in collisions
- 262,144 buckets (2^18): ~1.9% of IDs will be involved in collisions
For very large systems (>10,000 IDs):
- 1,048,576 buckets (2^20): ~4.7% of IDs among 100,000 will be involved in collisions

General Rule of Thumb: To keep collision impact below 5% of IDs, maintain a load factor (IDs/buckets) below 0.1.

Memory Impact:

Each bucket uses 8 bytes (uint64)
65,536 buckets = 512 KB of memory
1,048,576 buckets = 8 MB of memory

For systems with many distinct IDs where even low collision percentages are unacceptable, consider:

Sharding rate limiters by logical partitions (e.g., user type, region)
Using multiple rate limiters with different hashing algorithms
Implementing a consistent hashing approach

Note: While having some collisions is usually acceptable for rate limiting (it just means some IDs might share limits), the key is to keep the percentage of affected IDs low enough for your use case.

Memory usage scales linearly with numBuckets, so there's a trade-off between collision probability and memory consumption.

// Example for a system with ~5,000 distinct IDs
limiter, err := rate.NewTokenBucketLimiter(
    65536,    // numBuckets (will remain 2^16 since it's already a power of 2) to maintain <10% collision probability
    10,       // burstCapacity
    100,      // refillRate
    time.Second,
)

Note: If you observe uneven rate limiting across different IDs, consider increasing the number of buckets to reduce collision probability.

Custom Rate Limiting Scenarios

High-Volume API Rate Limiting

// API rate limiting with 100 requests per second, 10 burst capacity
limiter, _ := rate.NewTokenBucketLimiter(1024, 10, 100, time.Second)

// Take a token for a specific API key
apiKey := []byte("customer-api-key")
if limiter.TakeToken(apiKey) {
    // Process API request
} else {
    // Return 429 Too Many Requests
}

Adaptive Backend Protection with AIMD

// Create AIMD limiter for dynamic backend protection
limiter, _ := rate.NewAIMDTokenBucketLimiter(
    4096,   // numBuckets
    20,     // burstCapacity
    1.0,    // rateMin
    1000.0, // rateMax
    100.0,  // rateInit
    10.0,   // rateAdditiveIncrease
    2.0,    // rateMultiplicativeDecrease
    time.Second,
)

func handleRequest(backendID []byte) {
    if limiter.TakeToken(backendID) {
        err := callBackendService()
        if err == nil {
            // Backend is healthy, increase rate
            limiter.IncreaseRateForID(backendID)
        } else if isOverloadError(err) {
            // Backend is overloaded, decrease rate
            limiter.DecreaseRateForID(backendID)
        }
    } else {
        // Circuit breaking, return error immediately
    }
}

Design Philosophy

The library is designed with these principles in mind:

Performance First: Every operation is optimized for minimal CPU and memory usage
Thread Safety: All operations are thread-safe for concurrent environments without lock contention
Memory Efficiency: Uses packed representation and custom implementations for efficient storage
Simplicity: Provides simple interfaces for common rate limiting patterns
Flexibility: Multiple strategies to handle different rate limiting requirements

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License.

Documentation ¶

Overview ¶

Package rate provides rate limiting primitives for controlling request flow, preventing service overload, and implementing backoff strategies. It includes several rate limiters like token buckets and AIMD (Additive Increase Multiplicative Decrease) algorithms with thread-safe operations.

Index ¶

type AIMDTokenBucketLimiter
- func NewAIMDTokenBucketLimiter(numBuckets uint, burstCapacity uint8, rateMin float64, rateMax float64, ...) (*AIMDTokenBucketLimiter, error)
type TokenBucketLimiter
- func NewTokenBucketLimiter(numBuckets uint, burstCapacity uint8, refillRate float64, ...) (*TokenBucketLimiter, error)
- func (t *TokenBucketLimiter) Check(id []byte) bool
- func (t *TokenBucketLimiter) TakeToken(id []byte) bool

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

This section is empty.

Types ¶

type AIMDTokenBucketLimiter ¶

type AIMDTokenBucketLimiter struct {
	// contains filtered or unexported fields
}

AIMDTokenBucketLimiter wraps a TokenBucketLimiter to implement Additive Increase Multiplicative Decrease (AIMD) rate limiting. This provides a dynamic rate limiting strategy that gradually increases token rates during successful operations and quickly reduces rates when encountering failures or congestion. This is similar to the congestion control algorithm used in TCP.

func NewAIMDTokenBucketLimiter ¶

func NewAIMDTokenBucketLimiter(
	numBuckets uint,
	burstCapacity uint8,
	rateMin float64,
	rateMax float64,
	rateInit float64,
	rateAdditiveIncrease float64,
	rateMultiplicativeDecrease float64,
	rateUnit time.Duration,
) (*AIMDTokenBucketLimiter, error)

NewAIMDTokenBucketLimiter creates a new AIMD token bucket limiter with the given parameters:

numBuckets: number of token buckets (automatically rounded up to the nearest power of two if not already a power of two, for efficient hashing)
burstCapacity: max number of tokens that can be consumed at once
rateMin: minimum token refill rate
rateMax: maximum token refill rate
rateInit: initial token refill rate
rateAdditiveIncrease: amount to increase rate by on success
rateMultiplicativeDecrease: factor to decrease rate by on failure
rateUnit: time unit for rate calculations (e.g., time.Second)

All rates are expressed in tokens per rateUnit.

func (*AIMDTokenBucketLimiter) Check ¶

func (a *AIMDTokenBucketLimiter) Check(id []byte) bool

Check returns whether a token would be available for the given ID without actually taking it. This is useful for preemptively checking if an operation would be rate limited before attempting it. Returns true if a token would be available, false otherwise. This method is thread-safe and can be called concurrently from multiple goroutines.

func (*AIMDTokenBucketLimiter) DecreaseRate ¶

func (a *AIMDTokenBucketLimiter) DecreaseRate(id []byte)

DecreaseRate multiplicatively decreases the rate for the bucket associated with the given ID. This implements the "multiplicative decrease" part of the AIMD algorithm, typically called when congestion or failures occur. The rate is decreased by dividing by rateMD, but will not go below the minimum rate (rateMin). This method is thread-safe and uses atomic operations to ensure consistency.

func (*AIMDTokenBucketLimiter) IncreaseRate ¶

func (a *AIMDTokenBucketLimiter) IncreaseRate(id []byte)

IncreaseRate additively increases the rate for the bucket associated with the given ID. This implements the "additive increase" part of the AIMD algorithm, typically called on successful operations. The rate is increased by rateAI up to the maximum rate (rateMax). This method is thread-safe and uses atomic operations to ensure consistency.

func (*AIMDTokenBucketLimiter) TakeToken ¶

func (a *AIMDTokenBucketLimiter) TakeToken(id []byte) bool

TakeToken attempts to take a token for the given ID from the appropriate bucket. It returns true if a token was successfully taken, false otherwise. This method is thread-safe and can be called concurrently from multiple goroutines.

type TokenBucketLimiter ¶

type TokenBucketLimiter struct {
	// contains filtered or unexported fields
}

TokenBucketLimiter implements the token bucket algorithm for rate limiting. It maintains multiple buckets to distribute load and reduce contention. Each bucket has a fixed capacity and refills at a specified rate.

func NewTokenBucketLimiter ¶

func NewTokenBucketLimiter(
	numBuckets uint,
	burstCapacity uint8,
	refillRate float64,
	refillRateUnit time.Duration,
) (*TokenBucketLimiter, error)

NewTokenBucketLimiter creates a new token bucket rate limiter with the specified parameters:

numBuckets: number of token buckets (automatically rounded up to the nearest power of two if not already a power of two, for efficient hashing)
burstCapacity: maximum number of tokens that can be consumed at once
refillRate: rate at which tokens are refilled
refillRateUnit: time unit for refill rate calculations (e.g., time.Second)

Returns a new TokenBucketLimiter instance and any error that occurred during creation. The numBuckets parameter is automatically rounded up to the nearest power of two if not already a power of two, for efficient hashing.

func (*TokenBucketLimiter) Check ¶

func (t *TokenBucketLimiter) Check(id []byte) bool

Check returns whether a token would be available for the given ID without actually taking it. This is useful for preemptively checking if an operation would be rate limited before attempting it. Returns true if a token would be available, false otherwise.

func (*TokenBucketLimiter) TakeToken ¶

func (t *TokenBucketLimiter) TakeToken(id []byte) bool

TakeToken attempts to take a token for the given ID. It returns true if a token was successfully taken, false if the operation should be rate limited. This method is thread-safe and can be called concurrently from multiple goroutines.

Source Files ¶

View all Source files

Directories ¶

Path	Synopsis
time56 Package time56 provides a specialized 56-bit timestamp implementation for space-efficient, high-resolution timing within a limited time range.	Package time56 provides a specialized 56-bit timestamp implementation for space-efficient, high-resolution timing within a limited time range.

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL

README ¶

Rate

Features

Installation

Quick Start

Token Bucket Rate Limiter

AIMD Rate Limiter

Detailed Usage

TokenBucketLimiter

Parameters:

AIMDTokenBucketLimiter

Parameters:

Performance

Benchmarks

Advanced Usage

Choosing the Optimal Number of Buckets

How Bucket Selection Works

Understanding Collision Probability

Guidelines for Choosing numBuckets

Custom Rate Limiting Scenarios

High-Volume API Rate Limiting

Adaptive Backend Protection with AIMD

Design Philosophy

Contributing

License

Documentation ¶

Overview ¶

Index ¶

Constants ¶

Variables ¶

Functions ¶

Types ¶

type AIMDTokenBucketLimiter ¶

func NewAIMDTokenBucketLimiter ¶

func (*AIMDTokenBucketLimiter) Check ¶

func (*AIMDTokenBucketLimiter) DecreaseRate ¶

func (*AIMDTokenBucketLimiter) IncreaseRate ¶

func (*AIMDTokenBucketLimiter) TakeToken ¶

type TokenBucketLimiter ¶

func NewTokenBucketLimiter ¶

func (*TokenBucketLimiter) Check ¶

func (*TokenBucketLimiter) TakeToken ¶

Source Files ¶

Directories ¶

Guidelines for Choosing `numBuckets`