ClickHouse: The Real-Time Analytics Engine and Where WorkingAgents Fits

By James Aspinwall, co-written by Alfred Pennyworth (my trusted AI) — March 7, 2026, 07:22


ClickHouse just raised $400 million at a $15 billion valuation — a 2.5x jump from $6.35 billion nine months earlier. Dragoneer led, with Bessemer, GIC, Index Ventures, Khosla, Lightspeed, and T. Rowe Price participating. ARR on ClickHouse Cloud is growing 250%+ year-over-year across 3,000+ customers. They acquired Langfuse — the leading open-source LLM observability platform (20,000 GitHub stars, 26M+ monthly SDK installs, used by 63 Fortune 500 companies). And they launched an MCP server.

That last point is why WorkingAgents should pay attention.

ClickHouse is not a database. It is a real-time analytical engine that processes billions of rows in milliseconds — and it just built a native interface for AI agents to query it. The convergence of real-time analytics, agent-facing data access, and LLM observability creates a partnership surface with WorkingAgents that neither company could build alone.

What ClickHouse Does

ClickHouse is an open-source, column-oriented OLAP database built for speed. Where traditional row-oriented databases (Postgres, MySQL) store data horizontally — one row at a time — ClickHouse stores data vertically, one column at a time. For analytical queries that scan specific columns across millions or billions of rows, this architecture is 100x faster than row-oriented alternatives.

The numbers are not marketing:

Metric ClickHouse
Query speed Milliseconds at petabyte scale
GitHub stars 46,200+
Contributors 2,800+
Pull requests 70,400+
Releases 746+
Cloud customers 3,000+
ARR growth 250%+ YoY
Valuation $15B
Total funding ~$1B+

Enterprise customers include Meta, eBay, Microsoft, Spotify, Lyft, HubSpot, Cisco, GitLab, Deutsche Bank, IBM, Sony, Tesla, Capital One, Anthropic, and Cursor. Recent AI-focused adopters include Sierra, Poolside, Weights & Biases, LangChain, Lovable, and Decagon.

The Product Suite

ClickHouse Cloud — Fully managed, serverless, auto-scaling. Available on AWS, GCP, and Azure. Three tiers:

Tier Starting Price Features
Basic $50/month Serverless, scales to zero
Scale Custom Dedicated clusters, isolated hardware
Enterprise Custom BYOC, data residency, advanced compliance

ClickHouse Open Source — Self-hosted, free. The same engine that powers the cloud offering. 46,000+ stars on GitHub.

ClickHouse Local — Query CSV, TSV, Parquet files directly without a server. Zero setup.

Postgres Managed by ClickHouse — Native CDC (change data capture) piping Postgres transactions into ClickHouse for up to 100x faster analytics. Unified transactional + analytical stack.

ClickStack — Open-source observability platform. Logs, metrics, and traces stored in ClickHouse.

Langfuse Cloud — LLM observability. Every LLM call traced — cost tracking, quality evaluation, prompt versioning. Langfuse runs on ClickHouse under the hood.

Use Cases

  1. Real-time dashboards — Instant analytics over billions of rows for user-facing products
  2. Observability — Log, metric, and trace storage at massive scale (ClickStack, Langfuse)
  3. Data warehousing — Cost-efficient analytical processing at petabyte scale
  4. ML and GenAI — Vector search, training dataset aggregation, LLM observability
  5. Agent-facing analytics — AI agents querying databases autonomously via MCP

The Agentic Data Stack

This is where ClickHouse’s vision intersects directly with WorkingAgents.

In January 2026, ClickHouse published “The Agentic Data Stack” — a reference architecture for connecting AI agents directly to data. The thesis: traditional analytics pipelines (user → ticket → analyst → dashboard → answer, taking days or weeks) are being replaced by agent-facing systems where AI autonomously discovers, queries, and analyzes data in seconds.

The architecture has three layers:

Chat Layer (LibreChat) — ChatGPT-style interface supporting multiple LLM providers, MCP server connections, inline charts and tables, and code execution.

Data Layer (ClickHouse + MCP Server) — The MCP server exposes three tools to AI agents: list databases, list tables, and execute read-only SELECT queries. Agents iteratively explore schemas and run analytical queries at sub-second speed across billions of rows.

Observability Layer (Langfuse) — Full LLM tracing capturing every call, enabling cost tracking, quality evaluation, and prompt versioning.

Real-world adoption validates the pattern:

The MCP Server

ClickHouse’s remote MCP server is now in public beta for ClickHouse Cloud. It exposes:

Security model: OAuth-based authentication, read-only access only, fully managed on ClickHouse Cloud. The self-managed MCP server (PyPI package) has 220,000+ downloads.

In a demo, Claude conducted 10 sequential queries to analyze dot-com bubble impacts on tech stocks — exploring schemas, running aggregations, detecting patterns — all within seconds, without human intervention between queries.

Why This Matters for Agents

Christian Jensen (Dragoneer partner and ClickHouse board member) put it directly: “As models become more capable, the bottleneck moves to data infrastructure.”

AI agents generating rapid-fire analytical queries need:

ClickHouse’s columnar architecture delivers this. Traditional databases do not.

The Synergy Map

WorkingAgents and ClickHouse serve fundamentally different functions in the AI stack. ClickHouse is the analytical engine — it answers questions about data. WorkingAgents is the operational engine — it schedules actions, manages state, controls permissions, and ensures things get done. Together, they create a complete agent-facing infrastructure.

1. ClickHouse as the Analytics Layer for WorkingAgents

WorkingAgents generates operational data that needs analytical querying:

WorkingAgents currently stores this data in per-user SQLite databases — excellent for isolation and operational queries, but not designed for cross-user analytics at scale. ClickHouse is the analytical complement: pipe operational events from WorkingAgents into ClickHouse, and suddenly you have millisecond dashboards over the entire platform’s activity.

The integration pattern:

WorkingAgents (operational data) → CDC/batch export → ClickHouse (analytics)
                                                          ↓
                                                    Real-time dashboards
                                                    Usage reports
                                                    Anomaly detection
                                                    Billing metering

2. WorkingAgents as the Action Layer for ClickHouse Agents

ClickHouse’s agentic data stack answers questions. WorkingAgents takes action on the answers.

An agent queries ClickHouse: “Show me all customers with declining engagement over the last 30 days.” ClickHouse returns the list in milliseconds. Now what?

ClickHouse tells you what is happening. WorkingAgents decides what to do about it.

This is the missing layer in ClickHouse’s agentic data stack. Their reference architecture has chat (LibreChat), data (ClickHouse), and observability (Langfuse) — but no operational orchestration. No scheduling. No persistent task management. No escalation chains. No access-controlled tool execution. WorkingAgents is the fourth pillar.

3. MCP Server to MCP Server

Both ClickHouse and WorkingAgents expose MCP servers. An AI agent connected to both can:

  1. Query ClickHouse: “What products had the highest return rate last month?”
  2. Get the answer in milliseconds
  3. Call WorkingAgents: “Create a task to review the top 5 products with highest returns”
  4. WorkingAgents creates the task, assigns it, schedules a follow-up
  5. Query ClickHouse again: “What were the return reasons for product X?”
  6. Call WorkingAgents: “Send a push notification to the product manager with this analysis”

Two MCP servers, one agent, seamless data-to-action flow. The agent thinks with ClickHouse and acts with WorkingAgents.

4. Langfuse + WorkingAgents Observability

ClickHouse acquired Langfuse for LLM observability — tracing every LLM call, tracking costs, evaluating quality. WorkingAgents runs LLM-powered agent sessions (ServerChat) that generate exactly the kind of telemetry Langfuse is built to observe.

Integrating Langfuse into WorkingAgents’ chat module would provide:

Langfuse is already used by 19 Fortune 50 and 63 Fortune 500 companies. It runs on ClickHouse. The integration path is clear: WorkingAgents sends LLM traces to Langfuse, Langfuse stores them in ClickHouse, dashboards display operational AI health in real time.

5. Real-Time Monitoring Integration

WorkingAgents has a Monitor module that tracks system health. ClickHouse is built for exactly this kind of high-frequency time-series data:

WorkingAgents’ Monitor currently stores results in SQLite. For a single-user system, this works. For an enterprise deployment with hundreds of monitored endpoints, ClickHouse’s columnar engine provides the analytical horsepower SQLite cannot.

6. Enterprise Customer Overlap

ClickHouse’s customer list overlaps significantly with companies that need operational AI orchestration:

Every ClickHouse customer building AI agents is a potential WorkingAgents customer. The pitch: “You have the analytics. Here is the orchestration.”

The Agentic Data Stack — Extended

ClickHouse’s reference architecture with WorkingAgents as the fourth layer:

┌─────────────────────────────────────────────────┐
│  Chat Layer (LibreChat / Custom UI)             │
│  Natural language → agent reasoning             │
├─────────────────────────────────────────────────┤
│  Data Layer (ClickHouse + MCP Server)           │
│  Analytical queries at sub-second speed         │
├─────────────────────────────────────────────────┤
│  Action Layer (WorkingAgents + MCP Server)  ◄── NEW
│  Scheduling, tasks, CRM, notifications,         │
│  access control, persistent state               │
├─────────────────────────────────────────────────┤
│  Observability (Langfuse on ClickHouse)         │
│  LLM tracing, cost tracking, quality eval       │
└─────────────────────────────────────────────────┘

The data layer tells the agent what is true. The action layer tells the agent what to do. The observability layer tells you whether the agent did it well. The chat layer is the human interface. All four connected via MCP.

The Partnership Path

Phase 1: ClickHouse as Analytics Backend

Pipe WorkingAgents operational events (alarm firings, task completions, tool calls, permission checks) into ClickHouse. Build real-time dashboards showing platform health, usage patterns, and cost metrics. This gives WorkingAgents enterprise clients the analytical visibility they expect.

Phase 2: Dual MCP Integration

Document and publish the pattern: one agent, two MCP servers — ClickHouse for data, WorkingAgents for actions. Build a reference demo showing a complete data-to-action workflow. This is the most compelling partnership demo at any AI conference.

Phase 3: Langfuse Integration

Add Langfuse tracing to WorkingAgents’ ServerChat module. Every LLM call, every tool invocation, every token — traced and stored in ClickHouse. Enterprise clients get full AI observability without additional infrastructure.

Phase 4: Joint Reference Architecture

Extend ClickHouse’s published “Agentic Data Stack” to include WorkingAgents as the action layer. Co-publish the architecture with deployment guides. Position the combined stack as the complete open infrastructure for enterprise AI agents.

The Numbers

ClickHouse Value
Valuation $15B
Series D $400M (Jan 2026)
Total funding ~$1B+
Cloud ARR growth 250%+ YoY
Cloud customers 3,000+
GitHub stars 46,200+
Contributors 2,800+
Langfuse SDK installs 26M+/month
Fortune 500 on Langfuse 63
Key investors Dragoneer, Bessemer, GIC, Index, Khosla, Lightspeed, T. Rowe Price
Open source Yes (Apache 2.0)
Query speed Milliseconds at petabyte scale

The Bottom Line

ClickHouse is where data goes to be understood fast. WorkingAgents is where decisions go to be executed reliably. ClickHouse answers “what is happening” in milliseconds across billions of rows. WorkingAgents answers “what should we do about it” with scheduled actions, persistent state, and crash-recoverable workflows.

The agentic data stack needs both. An agent that can query a database but cannot schedule a follow-up is half a solution. An agent that can schedule actions but cannot analyze data is the other half. Together — ClickHouse for the analytical brain, WorkingAgents for the operational hands — you get a complete autonomous system.

ClickHouse already has the MCP server. WorkingAgents already has the MCP server. The integration is two configuration lines in an agent’s tool list. The technical barrier is near zero. The business case — analytics plus orchestration for every enterprise AI deployment — is the entire market.

Sources: