Amazon has launched its next generation of OpenSearch Serverless, a fully managed search and vector database designed specifically for agentic workloads. This new system can instantly scale up when AI tasks trigger and scale back down to zero when idle, reflecting a growing realization that current cloud infrastructure isn't equipped for the machine-driven future.
While AI agents still represent a small portion of internet activity, machine-generated traffic is already significant and poised to grow. Cloudflare reports that bots accounted for 31% of overall HTTP traffic over the last six months, with AI crawlers making up roughly a quarter of all bot requests during that period.
Google's I/O developer conference highlighted the shift by allowing users to start delegating tasks to AI systems like researching purchases and booking travel. Enterprises are increasingly deploying agents internally and for customers, creating new kinds of machine-generated traffic behind the scenes.
The key technical change with this new generation is that it decouples compute from storage, allowing compute to scale up in seconds to accommodate agent traffic bursts and to scale down to zero, so customers pay $0 when agents are idle. This means the infrastructure can adapt to the constant and autonomous retrieval of information by AI agents.
The shift is emerging across the cloud industry, with Databricks and Snowflake repositioning themselves as AI memory and retrieval systems for enterprise data. Microsoft has rolled out updates to Azure designed to handle AI agent bursts and share memory between agents. Cloudflare recently introduced infrastructure aimed at giving agents persistent environments and instant scalability.







