HVAC AI Agents ships a production-grade MCP server that exposes your BMS, VRF fleet, or thermostat network to Claude, ChatGPT, or any MCP-aware agent. Every read is sub-second. Every write is audit-logged and gated behind configurable safety interlocks.
The Model Context Protocol (MCP) is the emerging standard for connecting LLMs to external systems with structured, tool-callable interfaces. Instead of writing custom function-call glue for each AI provider, you expose your HVAC data once via an MCP server and every MCP-aware client — Claude Desktop, the Anthropic agent SDK, Cursor, a custom GPT-4o pipeline — picks it up without rework. Our server handles the protocol translation so your team writes HVAC logic, not boilerplate.
Read calls return live BMS points — zone temperatures, damper positions, chilled water valve status, fault codes — in a normalized schema regardless of whether the underlying equipment speaks BACnet, Modbus, or a proprietary brand cloud. Response time for a read call is under one second at the 95th percentile. The LLM sees a coherent, brand-agnostic building model rather than a patchwork of vendor APIs.
Write-back is the hard part, and it's where most LLM-BMS integrations fail. An LLM that can hallucinate a setpoint change that trips a safety interlock is not a deployable system. Our MCP server gates every write through a configurable interlock layer: point whitelists, value range constraints, rate-of-change limits, and time-of-day windows. Every LLM-initiated write is logged with the originating prompt, the LLM's response, the proposed value, the interlock check result, and the actual BMS write. If a write is blocked or rolled back, the audit log shows exactly why.
Authentication and tenant scoping are first-class. Each MCP client credential is bound to one or more site scopes — a credential issued for Site A cannot read or write Site B points. Multi-tenant SaaS builders can issue per-customer credentials with per-customer point whitelists. Self-hosted deployments can run the MCP server as a Docker container on the customer's LAN, with brand cloud traffic staying inside the perimeter.