Sol and Terra set new high benchmark scores, while Luna performs near GPT-5.5 levels on several tests despite being ...
AI compressed the build. Fundamentals matter more, not less, and the product funnel is now where engineers earn their keep.
VentureBeat delivers news, analysis, and insights on AI, data, and security—helping business leaders stay ahead in the rapidly evolving tech landscape.
OpenAI is moving away from models that require heavy hand-holding and toward systems that can better infer the user’s goal, ...
NUS researchers' MRAgent framework reduces LLM agent memory retrieval to 118K tokens per query — vs. 3.26M for LangMem — using step-by-step reasoning.
LFM2.5-230M proves that while 3-billion-parameter models like VibeThinker are solving advanced calculus, a ...
Xiaomi's HarnessX autonomously rewrites AI agent harnesses mid-execution, delivering +14.5% avg performance gains — and +44% ...
Because Krea relinquishes centralized control over the downstream deployment of its open weights, the contract legally binds ...
Mistral AI's OCR 4 delivers structured document intelligence with bounding boxes, confidence scores, and self-hosted ...
Anthropic has launched Claude Tag, a persistent AI agent for Slack that lets enterprise teams delegate work, automate tasks, ...
Real environments can't inject edge cases on demand. Alibaba's Qwen-AgentWorld simulates them — and outperformed ...
The companies attributed this speed to a deep software-hardware co-development process that actively used OpenAI’s own models ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results