Sol and Terra set new high benchmark scores, while Luna performs near GPT-5.5 levels on several tests despite being ...
AI compressed the build. Fundamentals matter more, not less, and the product funnel is now where engineers earn their keep.
VentureBeat delivers news, analysis, and insights on AI, data, and security—helping business leaders stay ahead in the rapidly evolving tech landscape.
NUS researchers' MRAgent framework reduces LLM agent memory retrieval to 118K tokens per query — vs. 3.26M for LangMem — using step-by-step reasoning.
OpenAI is moving away from models that require heavy hand-holding and toward systems that can better infer the user’s goal, ...
LFM2.5-230M proves that while 3-billion-parameter models like VibeThinker are solving advanced calculus, a ...
Xiaomi's HarnessX autonomously rewrites AI agent harnesses mid-execution, delivering +14.5% avg performance gains — and +44% ...
Anthropic has launched Claude Tag, a persistent AI agent for Slack that lets enterprise teams delegate work, automate tasks, ...
Mistral AI's OCR 4 delivers structured document intelligence with bounding boxes, confidence scores, and self-hosted ...
Real environments can't inject edge cases on demand. Alibaba's Qwen-AgentWorld simulates them — and outperformed ...
Because Krea relinquishes centralized control over the downstream deployment of its open weights, the contract legally binds ...
AI has made it easy to ship code faster — but incidents-to-PR ratio is up 242.7% and bugs per developer up 54%. Here's what a real software factory actually requires.