Mistral Reference Architecture

v20260311

mistral-reference-architecture

Guides setting up production-ready Mistral AI integrations with layered layout, a singleton client, Zod-based config, custom errors, caching-aware service layer, health endpoints, and prompt templates.

Mistral Architecture Best Practices TypeScript Error Handling Service Layer Configuration Health Checks

Get Skill

163 downloads

Overview

Mistral AI Reference Architecture

Overview
Prerequisites
Instructions
Output
Error Handling
Examples
Resources

Overview

Production-ready architecture patterns for Mistral AI integrations. Covers layered project structure, client singleton, configuration management, error handling, service layer with caching, health checks, and prompt templates.

Prerequisites

Understanding of layered architecture
Mistral AI SDK knowledge
TypeScript project setup
Testing framework configured

Layer Architecture

API Layer (Routes, Controllers, Middleware)
    ↓
Service Layer (Business Logic, Orchestration)
    ↓
Mistral Layer (Client, Prompts, Error Handling)
    ↓
Infrastructure Layer (Cache, Queue, Monitoring)

Instructions

Step 1: Create Directory Structure

Organize into src/mistral/ (client, config, types, errors, prompts, handlers), src/services/ai/ (chat, rag, cache), src/api/ (routes, streaming), tests/ (unit, integration), and config/ (per-environment JSON).

Step 2: Implement Client Singleton

Create getMistralClient() factory that lazily initializes a single Mistral instance with config from environment. Include resetMistralClient() for testing.

Step 3: Configure with Zod Validation

Define config schema with z.object() for apiKey, model (default mistral-small-latest), timeout (30s), maxRetries (3), and cache settings. Parse from environment variables.

Step 4: Build Error Handling Layer

Create MistralServiceError class with code, status, retryable flag. Wrap raw errors: 429 = RATE_LIMIT (retryable), 401 = AUTH_ERROR, 500+ = SERVICE_ERROR (retryable).

Step 5: Implement Service Layer

Build ChatService with complete() and stream() methods. Add cache check for deterministic requests (temperature=0). Wrap all calls with retry and error handling.

Step 6: Add Health Check

Implement checkMistralHealth() that calls client.models.list() and returns status (healthy/degraded/unhealthy) with latency measurement.

Step 7: Create Prompt Templates

Define reusable PromptTemplate interface with system prompt and user template function. Include templates for summarize, classify, and codeReview.

Output

Structured project layout with clear layer separation
Client singleton wrapper with retry logic
Zod-validated configuration management
Custom error classes with retryable detection
Service layer with caching and streaming
Health check endpoint
Reusable prompt templates

Error Handling

Issue	Cause	Resolution
Rate limit (429)	Too many requests	Retry with backoff, marked retryable
Auth error (401)	Invalid API key	Check credentials, not retryable
Server error (5xx)	Mistral service issue	Retry with backoff, marked retryable
Config validation	Missing/invalid env vars	Check Zod schema errors

Examples

Client Usage

const client = getMistralClient();
const response = await client.chat.complete({
  model: 'mistral-small-latest',
  messages: [{ role: 'user', content: 'Hello' }],
});

See detailed implementation for advanced patterns.

Resources

Info

Category Development

Name mistral-reference-architecture

Version v20260311

Size 3.41KB

Source jeremylongshore/claude-code-plugins-plus-skills

Updated At 2026-03-12

Mistral Reference Architecture

Mistral AI Reference Architecture

Table of Contents

Overview

Prerequisites

Layer Architecture

Instructions

Step 1: Create Directory Structure

Step 2: Implement Client Singleton

Step 3: Configure with Zod Validation

Step 4: Build Error Handling Layer

Step 5: Implement Service Layer

Step 6: Add Health Check

Step 7: Create Prompt Templates

Output

Error Handling

Examples

Client Usage

Resources