UC Berkeley's PixelRAG renders pages as screenshots instead of parsing text, boosting RAG accuracy by up to 18.1% and cutting AI agent token costs 10x.
Kimi K2.7-Code claims 30% fewer thinking tokens and a drop-in API swap path, but independent benchmarks show kernel ...
This ensures that all agent activity adheres to the company’s specific commercial licenses, internal security policies, ...
Microsoft's SkillOpt brings deep-learning discipline to AI agent skills, replacing manual prompt tweaking with mathematically ...
The persistent memory system addresses a real and widely felt pain point in agentic development workflows — one that ...
LCLMs compress LLM context before decode — 8.8x faster at 16x compression, beating every KV cache method tested. Open-sourced by NYU and Columbia.
Enterprise AI teams have spent years solving for compute, securing GPU allocations, negotiating cloud capacity, and ...
Google's open-source diffusion language model generates 256 tokens in parallel and self-corrects, hitting 4x speed on one GPU ...
As AI continues to evolve, leaders should invest not only in tools, but also in R&D processes and cultural foundations that ...
Sapient researchers trained a 1B reasoning model on just 40B tokens — scoring competitively with 2B-7B models at a fraction ...
The victory of GPT-5.5 aligns with recent third-party analysis suggesting that OpenAI's models are currently superior at ...
MassMutual caps vendor contracts at 12 months and runs a multi-model architecture — cutting contact center resolution times ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results