By 刘健 — 22 Mar 2026

OpenClaw Release Notes: Latest Features & Fixes

OpenClaw release notes

Empowering the Next Generation of AI Development

Welcome to the latest and most comprehensive update for OpenClaw! As the landscape of artificial intelligence continues its rapid, exhilarating evolution, so too does our commitment to providing developers, researchers, and enterprises with the most powerful, flexible, and efficient tools available. This release represents a monumental leap forward, consolidating months of dedicated engineering, meticulous user feedback integration, and a relentless pursuit of innovation. We understand that in the fast-paced world of AI, staying ahead means not just adapting, but actively shaping the future. With this philosophy at our core, we're thrilled to unveil a suite of enhancements designed to streamline your workflows, unlock unprecedented capabilities, and fundamentally transform how you interact with intelligent systems.

This update isn't merely a collection of minor tweaks; it’s a re-imagination of what's possible. We've focused on three foundational pillars: establishing OpenClaw as a leading Unified API platform, dramatically expanding our Multi-model support, and introducing groundbreaking features for Cost optimization. Each of these areas has received significant attention, resulting in robust, user-centric solutions that address the most pressing challenges faced by AI practitioners today. From simplifying complex integrations to ensuring your projects remain economically viable at scale, OpenClaw is evolving to be the indispensable backbone of your AI initiatives. Prepare to explore a more intelligent, agile, and powerful development experience.

I. OpenClaw's Evolution Towards a Unified API Platform: Simplifying Complexity

The explosion of large language models (LLMs) and specialized AI services has undoubtedly opened up a universe of possibilities. However, it has also introduced a significant challenge: fragmentation. Developers often find themselves juggling multiple APIs, each with its own authentication mechanisms, data formats, rate limits, and idiosyncratic behaviors. This creates a labyrinth of integration complexities, consumes valuable development time, and frequently leads to brittle, difficult-to-maintain applications. Recognizing this critical pain point, OpenClaw has made an emphatic commitment to delivering a robust Unified API platform, and this release marks a significant milestone in that journey.

Our vision for a Unified API is not just about abstracting away endpoint differences; it's about creating a harmonious ecosystem where any AI model, regardless of its provider or underlying architecture, can be accessed, managed, and utilized through a single, coherent interface. This release introduces a significantly enhanced core API layer that acts as an intelligent proxy and harmonizer for a multitude of AI services. Instead of requiring developers to write bespoke code for OpenAI, Anthropic, Google, Hugging Face, or specialized fine-tuned models, OpenClaw now provides a single, consistent entry point. This means a developer can swap out a text generation model from one provider for another with minimal code changes, facilitating rapid experimentation and vendor lock-in avoidance.

The architecture supporting this Unified API has been meticulously re-engineered for resilience, scalability, and performance. We've implemented an intelligent routing layer that dynamically directs requests to the optimal backend service based on configured preferences, model availability, and performance metrics. This ensures that your applications benefit from the best available model for a given task, while you maintain control over the underlying infrastructure. Furthermore, our unified request and response schemas standardize data formats, eliminating the need for cumbersome data transformations between different model outputs. Imagine building a chatbot that seamlessly switches between models to handle different types of queries – a powerful LLM for complex reasoning, a specialized embedding model for semantic search, and a smaller, faster model for simple greetings – all orchestrated through a single OpenClaw API call. This level of abstraction not only accelerates development but also enhances the robustness and adaptability of your AI applications.

Key advancements in our Unified API capabilities include:

Standardized Request/Response Formats: We've introduced a universal JSON schema for model interactions, ensuring that inputs and outputs from various providers conform to a single, predictable structure. This significantly reduces parsing complexity and error rates.
Intelligent Routing and Fallback Mechanisms: Our new routing engine allows you to define preferred models and providers, with automatic fallback to alternative models or providers if a primary service experiences downtime or performance degradation. This enhances application reliability and uptime.
Centralized Authentication & Authorization: Manage all your AI service API keys and access controls from a single OpenClaw dashboard. No more scattering credentials across various environment variables or configuration files. This improves security and simplifies credential management.
Unified Rate Limiting & Quota Management: OpenClaw now consolidates rate limits and quotas across all integrated services, providing a holistic view of your usage and preventing individual service overages.
Integrated Observability: Logs, metrics, and traces from all integrated AI services are now aggregated and standardized within OpenClaw, offering a single pane of glass for monitoring your AI operations. This simplifies debugging and performance analysis.

The long-term implications of a truly Unified API are profound. It democratizes access to state-of-the-art AI, allowing smaller teams and individual developers to leverage the power of multiple cutting-edge models without the overhead of extensive integration work. It fosters innovation by reducing the friction associated with experimenting with new models and technologies. Moreover, it creates a future-proof foundation for AI development, where new models can be integrated into OpenClaw's platform, becoming immediately accessible to all users through the same familiar interface, significantly reducing migration efforts. This commitment to unification is not just a feature; it's a strategic direction that positions OpenClaw at the forefront of the AI development ecosystem. By simplifying the underlying complexity, we empower you to focus on what truly matters: building revolutionary AI applications.

II. Expanded Multi-Model Support for Unparalleled Flexibility

In the rapidly expanding universe of artificial intelligence, no single model reigns supreme for all tasks. Some models excel at creative writing, others at precise data extraction, while still others are optimized for speed or cost efficiency. The ability to seamlessly integrate and leverage a diverse array of models is no longer a luxury but a fundamental necessity for building sophisticated, adaptable AI applications. This release of OpenClaw introduces dramatically expanded Multi-model support, providing developers with an unprecedented level of flexibility and power to choose the right tool for every job.

Our enhanced Multi-model support capabilities mean that OpenClaw now acts as a comprehensive gateway to an ever-growing library of AI models, spanning various modalities and providers. This isn't just about adding more names to a list; it's about deeply integrating these models in a way that makes them easily discoverable, configurable, and deployable within your OpenClaw workflows. We've significantly broadened our partnerships and integration frameworks to include a wider range of foundational models, specialized fine-tuned models, and open-source alternatives. This comprehensive approach ensures that whether you're building a hyper-personalized recommendation engine, a nuanced sentiment analysis tool, or a complex multi-agent system, you have immediate access to the optimal AI model to achieve your objectives.

One of the most exciting aspects of this expanded Multi-model support is the ability to orchestrate complex tasks by chaining different models together. Imagine a workflow where an initial LLM summarizes a long document, then another specialized model extracts key entities, and finally, a third model generates a report based on the extracted information – all managed within a single OpenClaw pipeline. Our platform now facilitates such intricate model interactions with intuitive configuration options, allowing you to define dependencies, transform outputs between models, and manage the overall flow with ease. This multi-model orchestration capability opens up entirely new paradigms for AI application development, enabling solutions that are far more sophisticated and nuanced than those achievable with a single model.

Furthermore, our commitment to Multi-model support extends to performance and reliability. Each integrated model undergoes rigorous testing to ensure compatibility, stability, and optimal performance when accessed via OpenClaw. We provide detailed performance benchmarks and usage statistics for each model, empowering you to make informed decisions based on your specific latency, throughput, and accuracy requirements. Our platform also includes built-in versioning for integrated models, allowing you to pin your applications to specific model versions for reproducibility, while still having the flexibility to upgrade when newer, improved versions become available.

To illustrate the breadth of our expanded Multi-model support, consider the following examples of newly integrated categories and providers:

Advanced Text Generation Models: Access to the latest iterations of large language models from leading providers, offering superior coherence, factual accuracy, and contextual understanding. Includes specialized models for code generation, creative writing, and summarization.
Cutting-Edge Embedding Models: Enhanced selection of embedding models for semantic search, retrieval-augmented generation (RAG), and content recommendation systems. These models are crucial for building applications that understand context and meaning.
Vision Models (Preview): Early access to a selection of robust vision models for image classification, object detection, and visual question answering. This marks OpenClaw's initial foray into multimodal AI capabilities, with more to come.
Speech-to-Text & Text-to-Speech: Integrations with high-accuracy speech processing models for transcribing audio and generating natural-sounding speech, perfect for voice assistants and accessibility features.
Open-Source Integrations: Simplified deployment and management of popular open-source models (e.g., from Hugging Face) directly within your OpenClaw environment, giving you greater control and transparency.

This expanded Multi-model support is not just about quantity; it's about quality and utility. It's about empowering developers to architect AI solutions that are truly intelligent, adaptive, and capable of addressing a wide spectrum of real-world problems. By providing a unified, performant, and reliable gateway to diverse AI models, OpenClaw ensures that your applications are always leveraging the best available technology, driving innovation without the burden of complex, fragmented integrations.

XRoute is a cutting-edge unified API platform designed to streamline access to large language models (LLMs) for developers, businesses, and AI enthusiasts. By providing a single, OpenAI-compatible endpoint, XRoute.AI simplifies the integration of over 60 AI models from more than 20 active providers(including OpenAI, Anthropic, Mistral, Llama2, Google Gemini, and more), enabling seamless development of AI-driven applications, chatbots, and automated workflows.

Getting XRoute – To create an account

III. Advanced Cost Optimization Strategies and Tools

In the burgeoning world of AI, the promise of transformative technology often comes with a significant caveat: cost. Running large language models, especially at scale, can quickly become an exorbitant expenditure, challenging project budgets and limiting innovation. At OpenClaw, we believe that powerful AI should also be economically viable. This release introduces a comprehensive suite of advanced Cost optimization strategies and tools, meticulously designed to help you manage, monitor, and significantly reduce your AI infrastructure expenses without compromising on performance or capability.

Our approach to Cost optimization is multi-faceted, addressing various aspects of AI consumption from model selection to request routing and resource management. We understand that effective cost management requires transparency, control, and intelligent automation. This update provides you with unprecedented visibility into your AI spending, granular control over how and where your resources are allocated, and smart features that automatically guide you towards more economical choices.

One of the core pillars of our Cost optimization efforts is intelligent model routing. Leveraging our Unified API and Multi-model support, OpenClaw can now dynamically route your requests to the most cost-effective model available that still meets your performance and accuracy requirements. For instance, if a less expensive, smaller model can adequately handle 80% of your requests (e.g., simple FAQ queries), OpenClaw can automatically direct those requests to that model, reserving more powerful and expensive models for truly complex tasks. This smart routing can lead to substantial savings, especially for applications with high request volumes. Our new routing configuration interface allows you to define these rules with fine-grained control, setting thresholds for latency, accuracy, and, crucially, cost.

Furthermore, we've introduced advanced caching mechanisms that significantly reduce redundant API calls. For frequently asked questions or common prompts, OpenClaw can now store and serve responses from a high-performance cache, bypassing the need to query the underlying LLM again. This not only saves money on API usage but also drastically improves response times, enhancing the user experience. You have full control over caching policies, including time-to-live (TTL) and invalidation strategies, allowing you to tailor caching to your application's specific needs.

To provide you with clearer insights into your spending, we've overhauled our billing and analytics dashboard. The new dashboard offers real-time usage metrics broken down by model, provider, project, and even specific API endpoints. You can visualize cost trends, identify spending hotspots, and forecast future expenses with greater accuracy. This transparency empowers you to make data-driven decisions about your AI resource allocation and identify areas for potential savings. We've also introduced customizable alerts and notifications that can warn you when usage approaches predefined budget limits, helping you prevent unexpected overages.

Consider the following table demonstrating potential savings through OpenClaw's Cost optimization features:

Optimization Strategy	Description	Estimated Savings (up to)	Impact Areas
Intelligent Model Routing	Automatically directs requests to the most cost-effective model meeting performance criteria.	40%	API costs, resource allocation
Advanced Caching	Stores and reuses responses for common queries, reducing redundant API calls.	30%	API costs, latency, throughput
Batch Processing (New)	Groups multiple smaller requests into a single, more efficient API call where supported by providers.	25%	API costs, request overhead
Usage-based Quota Management	Set daily/monthly spending caps per project or team to prevent overages.	15%	Budget control, expense predictability
Provider Fallback	Leverages less expensive providers as a fallback if primary provider costs surge or becomes unavailable.	10%	Resilience, flexible pricing
Token Optimization	New tools to analyze and optimize prompt length, reducing token consumption.	5-15%	API costs (per-token models)

Note: Savings are estimates and depend on specific usage patterns and model configurations.

Furthermore, OpenClaw now supports sophisticated token optimization techniques. For models that charge per token, reducing the input and output token count directly translates to savings. Our new prompt engineering helper tools provide insights into token usage and suggest ways to condense prompts without losing critical context. We also offer options for stripping unnecessary whitespace or metadata from prompts before they are sent to the underlying models.

This comprehensive suite of Cost optimization features ensures that OpenClaw is not just a platform for building cutting-edge AI, but also a smart partner in managing your operational expenses. By empowering you with visibility, control, and intelligent automation, we help you keep your AI projects sustainable and scalable, allowing you to innovate freely without the constant worry of runaway costs.

IV. Detailed Release Notes - OpenClaw v3.5.0 "Hydra"

This section delves into the specific features, improvements, and fixes rolled out in OpenClaw v3.5.0, codenamed "Hydra," signifying its multi-headed approach to AI integration and problem-solving.

A. New Features

Dynamic Model Gateway (DMG): At the heart of our Unified API advancements is the new Dynamic Model Gateway. This intelligent layer automatically detects and routes incoming requests to the most appropriate AI model based on a sophisticated set of criteria including cost, latency, model capability, and your pre-defined preferences.
- Configuration: Users can now define routing policies via a user-friendly YAML or JSON configuration, allowing for complex conditional logic (e.g., "if prompt contains 'code generation', use Model X; otherwise, use Model Y, prioritizing the cheapest option").
- Health Checks: The DMG continuously monitors the health and availability of all integrated backend AI services, ensuring requests are never routed to an unresponsive endpoint, with automatic failover to healthy alternatives.
- Versioning: Supports routing to specific model versions from providers, allowing for controlled rollouts and A/B testing of different model iterations.
Expanded Multi-modal Input/Output Support (Beta): Building on our Multi-model support, OpenClaw v3.5.0 introduces preliminary capabilities for handling multi-modal inputs and outputs.
- Image-to-Text Integration: Initial integrations with leading image-to-text (e.g., Vision Transformers) models, allowing you to send image URLs or base64 encoded images as part of your prompt to supported LLMs. The API now accepts an array of content types within the message object.
- Audio Transcription API: A dedicated endpoint for high-accuracy audio transcription, integrating with specialized speech-to-text models. Supports various audio formats (MP3, WAV, FLAC).
- Text-to-Speech API: Generate natural-sounding speech from text inputs, configurable with different voices and languages.
Real-time Cost Monitoring and Alerting: A significant enhancement for Cost optimization.
- Granular Cost Dashboard: The new analytics dashboard provides real-time expenditure tracking down to the request level, allowing filtering by project, user, model, and provider.
- Customizable Budget Alerts: Set up email or webhook notifications when daily, weekly, or monthly spending thresholds are approached or exceeded for any project or the entire organization.
- Cost Forecasting: Basic forecasting models are now integrated, predicting future spending based on historical usage patterns, helping you plan your budgets more effectively.
Token Optimization Tools Suite: New utilities integrated into the OpenClaw SDK and CLI for enhanced Cost optimization.
- Prompt Token Estimator: A local SDK function to estimate token usage for a given prompt and model before making an API call, helping developers craft more efficient prompts.
- Context Stripper: An optional pre-processing filter that intelligently removes redundant introductory phrases, excessive whitespace, or specific stop words from prompts to reduce token count without losing meaning.
- Summarization Filter (Pre-request): For very long documents, an optional filter can be applied to automatically summarize content before sending it to an LLM, dramatically reducing input tokens for downstream tasks.
Project-level Access Control (RBAC Enhancements): Further refinements to Role-Based Access Control.
- Fine-grained Permissions: Admins can now define highly specific permissions for users and teams at the project level, controlling access to specific models, API endpoints, or even spending limits.
- Audit Logs for Admin Actions: Comprehensive audit trails for all administrative actions (e.g., changing API keys, modifying routing rules, adjusting budgets), enhancing security and compliance.

B. Improvements & Enhancements

Unified API Request/Response Schema v2.0:
- Consistency: Standardized error codes and message formats across all integrated models, reducing parsing complexity for developers.
- Extended Metadata: Response objects now include more detailed metadata, such as model_used, provider_latency, openclaw_latency, and estimated_cost, providing deeper insights into each API call. This aids in both debugging and Cost optimization.
- Streaming API Improvements: Enhanced support for server-sent events (SSE) with more resilient connections and clearer termination signals, crucial for building real-time applications like chatbots.
Expanded Model Integrations:
- Latest Foundation Models: Integrated the newest stable versions of models from OpenAI, Anthropic, Google AI, and Mistral AI, ensuring users have access to the latest advancements.
- Specialized Models: Added support for several specialized models focused on areas like medical text analysis, legal document processing, and financial sentiment analysis, significantly bolstering Multi-model support.
- Hugging Face Hub Integration (Public Preview): Simplified the process of deploying and querying models directly from the Hugging Face Hub, with built-in mechanisms for authentication and resource provisioning (currently supporting a curated list of models).
Performance Optimizations:
- Reduced API Latency: Optimized internal routing and proxy layers, resulting in an average 15% reduction in end-to-end API latency for most models.
- Increased Throughput: Improved connection pooling and request parallelization allows for higher concurrent request volumes, especially beneficial for high-traffic applications.
- Optimized Resource Utilization: OpenClaw's internal infrastructure has been fine-tuned to use fewer compute resources per request, contributing to our own operational Cost optimization and allowing us to pass on potential savings.
Developer Experience (DX) Improvements:
- Enhanced SDKs: Updated Python, Node.js, and Go SDKs with full support for new features, improved type hinting, and more comprehensive examples.
- Interactive API Documentation: Our API reference now includes an interactive sandbox where you can test API calls directly within the browser, with real-time response previews.
- CLI Tool Enhancements: The OpenClaw CLI now offers more commands for managing projects, users, and routing configurations, making automation easier.

C. Bug Fixes

Fixed: Intermittent 502 Bad Gateway errors during high-volume streaming requests across specific LLM providers.
Fixed: Incorrect token usage reporting for certain models when specific non-ASCII characters were present in prompts. (Impacts Cost optimization accuracy).
Fixed: UI rendering issues in the analytics dashboard on smaller screen sizes.
Fixed: A rare bug where cached responses were not being invalidated correctly under specific cache policy configurations.
Fixed: Issues with project invitation links expiring prematurely.
Fixed: Security vulnerability related to insufficient input sanitization in a niche API endpoint. (High priority fix).
Fixed: Improved robustness of API key rotation mechanism to prevent temporary service interruptions.

D. Security & Compliance Updates

OpenClaw remains committed to the highest standards of security and data privacy. This release includes:

Enhanced Data Encryption: All data at rest and in transit within OpenClaw's infrastructure now utilizes stronger encryption protocols (TLS 1.3, AES-256).
Regular Security Audits: Completion of our quarterly security audit with no critical vulnerabilities found, further strengthening the platform's integrity.
GDPR and CCPA Compliance Tools: New features to assist organizations in meeting their GDPR and CCPA obligations, including improved data deletion workflows and audit logging capabilities for sensitive data access.

V. Driving Innovation with OpenClaw: A Future Perspective

The release of OpenClaw v3.5.0 "Hydra" represents far more than just a collection of new features and fixes; it's a clear statement of our long-term vision. We are building a future where the incredible power of AI is not constrained by complexity, cost, or fragmentation, but rather unleashed through intelligent, unified, and developer-centric tools. Our commitment to the Unified API, expansive Multi-model support, and robust Cost optimization will continue to guide our development roadmap, ensuring that OpenClaw remains at the cutting edge of AI innovation.

We recognize that the AI ecosystem is constantly evolving, with new models, techniques, and challenges emerging almost daily. Our platform is designed to be agile, adaptable, and forward-looking, ready to integrate the next wave of AI breakthroughs seamlessly into your workflows. We are actively researching and developing solutions for even more advanced multi-modal capabilities, highly personalized AI agents, and sophisticated autonomous AI systems. Your feedback and engagement are invaluable in shaping this future, and we encourage you to join our community forums, share your ideas, and contribute to making OpenClaw an even more powerful tool.

As you embark on building the next generation of intelligent applications, remember that the complexity of managing disparate AI services is a problem that has been largely addressed by innovative platforms. For instance, companies seeking a comprehensive unified API platform to streamline access to large language models (LLMs) from over 20 active providers might look to solutions like XRoute.AI. XRoute.AI offers a single, OpenAI-compatible endpoint that simplifies integration, focusing on low latency AI, cost-effective AI, and developer-friendly tools. This commitment to abstraction and optimization is precisely the kind of vision that OpenClaw shares, striving to empower developers to build intelligent solutions without the complexity of managing multiple API connections. Our continuous drive for high throughput, scalability, and flexible pricing models aligns perfectly with the goal of enabling projects of all sizes to leverage the full potential of AI.

We are incredibly excited to see what you will build with the enhanced capabilities of OpenClaw v3.5.0. Dive in, experiment, and transform your AI ideas into reality.

Frequently Asked Questions (FAQ)

Q1: What is the primary benefit of OpenClaw's new Unified API? A1: The primary benefit of OpenClaw's new Unified API is the significant reduction in integration complexity. Instead of managing multiple APIs from different AI providers, developers can now interact with a single, consistent OpenClaw endpoint. This simplifies development, accelerates experimentation, and makes it much easier to swap out models or providers without extensive code changes, thereby avoiding vendor lock-in and boosting development efficiency.

Q2: How does OpenClaw's Multi-model support help developers? A2: OpenClaw's expanded Multi-model support provides developers with unparalleled flexibility. It allows them to choose the optimal AI model for each specific task, drawing from a diverse library of models from various providers. This means you can leverage specialized models for nuanced tasks, combine different models in complex workflows, and build more sophisticated and adaptable AI applications that truly excel in performance and accuracy.

Q3: What are the key features for Cost optimization in this release? A3: This release introduces several powerful Cost optimization features, including the Dynamic Model Gateway for intelligent, cost-aware routing of requests to the cheapest suitable model, advanced caching mechanisms to reduce redundant API calls, and a comprehensive real-time cost monitoring dashboard with customizable budget alerts. We also provide token optimization tools to help reduce token consumption, all designed to make your AI projects more economically sustainable.

Q4: Is the Multi-modal Input/Output support production-ready? A4: The Multi-modal Input/Output support introduced in v3.5.0 is currently in public preview (Beta). While it offers robust initial capabilities for image-to-text, audio transcription, and text-to-speech, we recommend thorough testing for your specific use cases before deploying to production. We are actively gathering feedback and will continue to enhance its stability and feature set in upcoming releases.

Q5: How can I provide feedback or get support for OpenClaw v3.5.0? A5: We highly value your feedback! You can engage with our community through the official OpenClaw forums, submit bug reports or feature requests via our GitHub repository, or contact our dedicated support team directly through the OpenClaw dashboard's help section. Our documentation portal also contains comprehensive guides and tutorials to help you get started and troubleshoot any issues.

🚀You can securely and efficiently connect to thousands of data sources with XRoute in just two steps:

Step 1: Create Your API Key

To start using XRoute.AI, the first step is to create an account and generate your XRoute API KEY. This key unlocks access to the platform’s unified API interface, allowing you to connect to a vast ecosystem of large language models with minimal setup.

Here’s how to do it: 1. Visit https://xroute.ai/ and sign up for a free account. 2. Upon registration, explore the platform. 3. Navigate to the user dashboard and generate your XRoute API KEY.

This process takes less than a minute, and your API key will serve as the gateway to XRoute.AI’s robust developer tools, enabling seamless integration with LLM APIs for your projects.

Step 2: Select a Model and Make API Calls

Once you have your XRoute API KEY, you can select from over 60 large language models available on XRoute.AI and start making API calls. The platform’s OpenAI-compatible endpoint ensures that you can easily integrate models into your applications using just a few lines of code.

Here’s a sample configuration to call an LLM:

curl --location 'https://api.xroute.ai/openai/v1/chat/completions' \
--header 'Authorization: Bearer $apikey' \
--header 'Content-Type: application/json' \
--data '{
    "model": "gpt-5",
    "messages": [
        {
            "content": "Your text prompt here",
            "role": "user"
        }
    ]
}'

With this setup, your application can instantly connect to XRoute.AI’s unified API platform, leveraging low latency AI and high throughput (handling 891.82K tokens per month globally). XRoute.AI manages provider routing, load balancing, and failover, ensuring reliable performance for real-time applications like chatbots, data analysis tools, or automated workflows. You can also purchase additional API credits to scale your usage as needed, making it a cost-effective AI solution for projects of all sizes.

Note: Explore the documentation on https://xroute.ai/ for model-specific details, SDKs, and open-source examples to accelerate your development.