JSON Message Store | Jazzmine Core

1. Behavior and Context

In the jazzmine architecture, the JSON Message Store serves as the "Zero-Configuration" storage engine.

In-Memory Performance: Upon initialization, the store reads the entire database into memory and builds optimized dictionaries (indexes) for instant lookup by ID. All "Read" operations are served directly from RAM.
Atomic Persistence: To prevent data loss, the store uses an atomic "write-then-rename" strategy. When data is flushed, it writes to a temporary file and performs an operating system-level rename to overwrite the original file.
Concurrency Control: It utilizes an internal asyncio.Lock to ensure that concurrent turns within the same process do not cause data corruption or race conditions.
Reconstructive Indexing: Every time the store is initialized, it rebuilds its secondary indexes (such as the map between messages and traces) from the raw data arrays to ensure absolute consistency.

2. Purpose

Local Development: Enabling developers to start building agents immediately without installing database infrastructure.
Human Inspectability: Allowing developers to open the database file in any text editor to audit agent reasoning or debug message history.
Portability: Providing a single-file database that can be easily moved, shared, or checked into version control for reproducible tests.
Low-Latency Mocking: Serving as a high-speed storage mock for continuous integration (CI) pipelines.

3. High-Level API (Configuration)

The JSON Message Store is initialized through the MessageStore.initialize factory by specifying the json storage type.

Example: Initializing the JSON Store

python

from jazzmine.core.message_store import MessageStore

config = {
    "storage": "json",
    "path": "data/conversation_db.json"
}

# The store loads the file at 'path' into memory immediately
store = await MessageStore.initialize(config)

4. Detailed Functionality

_init() [Internal]

Functionality: Loads the database from disk and populates the in-memory indexes.

How it works:

Checks if the file at the configured path exists.
If it exists, it reads the JSON and reconstructs the _messages, _conversations, and _traces dictionaries.
It builds a secondary index, _trace_by_msg, which allows for O(1) lookup of an execution trace given an assistant message ID.

get_immediate_context(conversation_id, n)

Functionality: Retrieves the most recent messages for the LLM prompt directly from RAM.

How it works:

Filters the in-memory message dictionary for records matching the conversation_id.
Excludes any messages where the is_flagged attribute is True.
Sorts the subset by timestamp.
Returns the final n messages. Because the data is already in memory, this operation is near-instant.

_flush() [Internal]

Functionality: Synchronizes the in-memory state with the physical file on disk.

How it works:

Collects all dictionaries back into a single JSON-serializable object.
Writes the JSON string to a file named {path}.tmp.
Calls os.replace to atomically move the temporary file to the final destination. This ensures that even if the power fails during a write, the original database remains uncorrupted.

delete_conversation(conversation_id)

Functionality: Performs a cascading deletion of a session and all associated history.

How it works:

Removes the record from the _conversations index.
Iterates through the _messages index and identifies all IDs belonging to that conversation.
Purges those messages and their corresponding records in the _traces and _trace_by_msg indexes.
Triggers an immediate _flush() to commit the deletion to disk.

5. Error Handling

File Permissions: If the configured path is not writable, the _flush operation will raise an OSError.
Malformed Data: If the JSON file is manually edited and becomes invalid, the store will raise a JSONDecodeError during _init, preventing the agent from starting with corrupted data.
Pydantic Integration: When retrieving records, data is validated through model_validate. If the file contains fields that are incompatible with the current framework version, a validation error is raised.

6. Remarks

Scalability Limit: Because the entire database is loaded into memory and rewritten to disk on every change, this store is not suitable for production applications with high volumes of data (e.g., thousands of long conversations).
Process Safety: This store is not safe for use by multiple separate OS processes simultaneously. It should only be accessed by a single agent instance at a time.
Cleanup: To ensure the final state is captured, always call await store.close() which triggers a final flush and releases the internal lock.