Architecture

This document describes Dalva's internal architecture.

System Overview

Dalva is a full-stack application with:

Backend: FastAPI + SQLAlchemy + DuckDB
Frontend: React + TypeScript + Vite
Database: DuckDB (SQLite-like, file-based)

graph TB
    subgraph SDK["Python SDK (dalva package)"]
        Run[Run Class]
    end

    Run -->|HTTP POST| API[REST API]

    subgraph Frontend["Frontend (React)"]
        P[Projects Page]
        R[Runs Page]
        M[Metrics Charts]
        C[Compare Runs Page]
    end

    Frontend --> Q[React Query Cache]
    Q --> API

    subgraph Backend["Backend (FastAPI)"]
        Routes[API Routes]
        LF[Logger Functions]
    end

    API --> Routes
    Routes --> LF

    LF --> DB[(DuckDB)]

    DB --> Projects[projects]
    DB --> Runs[runs]
    DB --> Metrics[metrics]
    DB --> Configs[configs]

Backend Architecture

Key Design Decisions

1. Short-Lived Sessions (DuckDB Compatibility)

DuckDB allows one writer per file across OS processes. The old design held sessions open during training, blocking the web server.

Solution: Every logger function opens a fresh session, writes, commits, and closes immediately:

def log_metrics(run_id, metrics, step=None):
    with session_scope() as db:  # Opens session
        for metric_path, value in metrics.items():
            db.add(Metric(...))
    # Session automatically closed here

2. EAV Model for Metrics

The Metric table uses an Entity-Attribute-Value model for flexibility:

CREATE TABLE metrics (
    id INTEGER PRIMARY KEY,
    run_id INTEGER REFERENCES runs(id),
    attribute_path TEXT,      -- e.g., "train/loss"
    attribute_type TEXT,     -- e.g., "float_series"
    step INTEGER,            -- NULL for summary, int for series
    float_value REAL,
    int_value INTEGER,
    string_value TEXT,
    bool_value BOOLEAN
);

This allows logging arbitrary metrics without schema changes.

3. Series vs Scalar Types via Step

The step parameter determines metric type:

Step Value	Type Suffix	Example
`None`	(none)	`float`, `int`, `string`, `bool`
`0, 1, 2, ...`	`_series`	`float_series`, `int_series`, etc.

This is enforced at write time - attempting to write a different type for the same metric key raises an error.

Database Schema

erDiagram
    projects {
        int id PK
        string name
        string project_id
        datetime created_at
        datetime updated_at
    }

    runs {
        int id PK
        int project_id FK
        string run_id
        string name
        string state
        datetime created_at
        datetime updated_at
    }

    metrics {
        int id PK
        int run_id FK
        string attribute_path
        string attribute_type
        int step
        float float_value
        int int_value
        string string_value
        bool bool_value
    }

    configs {
        int id PK
        int run_id FK
        string key
        string value
    }

    projects ||--o{ runs : "has"
    runs ||--o{ metrics : "logs"
    runs ||--o{ configs : "has"

Frontend Architecture

Data Flow

sequenceDiagram
    User Action->>React Component: Click/Interact
    React Component->>React Query Hook: API call
    React Query Hook->>Backend: HTTP Request
    Backend->>Database: Query
    Database-->>Backend: Result
    Backend-->>React Query Hook: JSON Response
    React Query Hook-->>React Component: Data update
    React Component-->>User: Rendered UI

React Query Configuration

const queryClient = new QueryClient({
  defaultOptions: {
    queries: {
      staleTime: 30_000,      // 30 seconds
      refetchOnWindowFocus: false,
    },
  },
});

Chart Rendering Logic

The MetricViewer component decides how to render a metric based on its type:

const isSeries = attributeType?.endsWith('_series') ?? false;

if (isSeries) {
  // Render interactive chart with Plotly
  return <MetricChart data={values} hasSteps={hasSteps} />;
} else {
  // Render single value card
  return <ValueCard value={values[0].value} />;
}