jer/kevo

Fork 0

Jeremy Tregunna ee23a47a74

docs: added idea, plan, and todo docs

2025-04-19 14:06:53 -06:00

6.6 KiB

Raw Blame History

Implementation Plan for Go Storage Engine

Architecture Overview

┌─────────────┐     ┌─────────────┐     ┌─────────────────────────┐
│ Client API  │────▶│  MemTable   │────▶│ Immutable SSTable Files │
└─────────────┘     └─────────────┘     └─────────────────────────┘
       │                   ▲                         ▲
       │                   │                         │
       ▼                   │                         │
┌─────────────┐            │            ┌─────────────────────────┐
│  Write-     │────────────┘            │ Background Compaction   │
│  Ahead Log  │                         │ Process                 │
└─────────────┘                         └─────────────────────────┘
       │                                            │
       │                                            │
       ▼                                            ▼
┌─────────────────────────────────────────────────────────────────┐
│                       Persistent Storage                         │
└─────────────────────────────────────────────────────────────────┘

Package Structure

go-storage/
├── cmd/
│   └── storage-bench/       # Benchmarking tool
│
├── pkg/
│   ├── config/              # Configuration and manifest
│   ├── wal/                 # Write-ahead logging with transaction markers
│   ├── memtable/            # In-memory table implementation
│   ├── sstable/             # SSTable read/write
│   │   ├── block/           # Block format implementation
│   │   └── footer/          # File footer and metadata
│   ├── compaction/          # Compaction strategies
│   ├── iterator/            # Merged iterator implementation
│   ├── transaction/         # Transaction management with Snapshot + WAL
│   │   ├── snapshot/        # Read snapshot implementation
│   │   └── txbuffer/        # Transaction write buffer
│   └── engine/              # Main engine implementation with single-writer architecture
│
└── internal/
    ├── checksum/            # Checksum utilities (xxHash64)
    └── utils/               # Shared internal utilities

Development Phases

Phase A: Foundation (1-2 weeks)

Set up project structure and Go module
Implement config package with serialization/deserialization
Build basic WAL with:
- Append operations (Put/Delete)
- Replay functionality
- Configurable fsync modes
Write comprehensive tests for WAL durability

Phase B: In-Memory Layer (1 week)

Implement MemTable with:
- Skip list data structure
- Sorted key iteration
- Size tracking for flush threshold
Connect WAL replay to MemTable restore
Test concurrent read/write scenarios

Phase C: Persistent Storage (2 weeks)

Design and implement SSTable format:
- Block-based layout with restart points
- Checksummed blocks
- Index and metadata in footer
Build SSTable writer:
- Convert MemTable to blocks
- Generate sparse index
- Write footer with checksums
Implement SSTable reader:
- Block loading and validation
- Binary search through index
- Iterator interface

Phase D: Basic Engine Integration (1 week)

Implement Level 0 flush mechanism:
- MemTable to SSTable conversion
- File management and naming
Create read path that merges:
- Current MemTable
- Immutable MemTables awaiting flush
- Level 0 SSTable files

Phase E: Compaction (2 weeks)

Implement a single, efficient compaction strategy:
- Simple tiered compaction approach
Handle tombstones and key deletion
Manage file obsolescence and cleanup
Build background compaction scheduling

Phase F: Basic Atomicity and Advanced Features (2-3 weeks)

Implement merged iterator across all levels
Add snapshot capability for reads:
- Point-in-time view of the database
- Consistent reads across MemTable and SSTables
Implement simple atomic batch operations:
- Support atomic multi-key writes
- Ensure proper crash recovery for batch operations
- Design interfaces that can be extended for full transactions
Add basic statistics and metrics

Phase G: Optimization and Benchmarking (1 week)

Develop benchmark suite for:
- Random vs sequential writes
- Point reads vs range scans
- Compaction overhead and pauses
Optimize critical paths based on profiling
Tune default configuration parameters

Phase H: Optional Enhancements (as needed)

Add Bloom filters to reduce disk reads
Create monitoring hooks and detailed metrics
Add crash recovery testing

Testing Strategy

Unit Tests: Each component thoroughly tested in isolation
Integration Tests: End-to-end tests for complete workflows
Property Tests: Generate randomized operations and verify correctness
Crash Tests: Simulate crashes and verify recovery
Benchmarks: Measure performance across different workloads

Implementation Notes

Error Handling

Use descriptive error types and wrap errors with context
Implement recovery mechanisms for all critical operations
Validate checksums at every read opportunity

Concurrency

Implement single-writer architecture for the main write path
Allow concurrent readers (snapshots) to proceed without blocking
Use appropriate synchronization for reader-writer coordination
Ensure proper isolation between transactions

Batch Operation Management

Use WAL for atomic batch operation durability
Leverage LSM's natural versioning for snapshots
Provide simple interfaces that can be built upon for transactions
Ensure proper crash recovery for batch operations

Go Idioms

Follow standard Go project layout
Use interfaces for component boundaries
Rely on Go's GC but manage large memory allocations carefully
Use context for cancellation where appropriate

6.6 KiB Raw Blame History

Implementation Plan for Go Storage Engine

Architecture Overview

Package Structure

Development Phases

Phase A: Foundation (1-2 weeks)

Phase B: In-Memory Layer (1 week)

Phase C: Persistent Storage (2 weeks)

Phase D: Basic Engine Integration (1 week)

Phase E: Compaction (2 weeks)

Phase F: Basic Atomicity and Advanced Features (2-3 weeks)

Phase G: Optimization and Benchmarking (1 week)

Phase H: Optional Enhancements (as needed)

Testing Strategy

Implementation Notes

Error Handling

Concurrency

Batch Operation Management

Go Idioms

6.6 KiB

Raw Blame History