Prompt injection remains the most effective way to compromise enterprise AI systems because it exploits the fundamental way ...
AI compressed the build. Fundamentals matter more, not less, and the product funnel is now where engineers earn their keep.
Sol and Terra set new high benchmark scores, while Luna performs near GPT-5.5 levels on several tests despite being ...
Xiaomi's HarnessX autonomously rewrites AI agent harnesses mid-execution, delivering +14.5% avg performance gains — and +44% ...
LFM2.5-230M proves that while 3-billion-parameter models like VibeThinker are solving advanced calculus, a ...
OpenAI is moving away from models that require heavy hand-holding and toward systems that can better infer the user’s goal, ...
Anthropic has launched Claude Tag, a persistent AI agent for Slack that lets enterprise teams delegate work, automate tasks, ...
Because Krea relinquishes centralized control over the downstream deployment of its open weights, the contract legally binds ...
Mistral AI's OCR 4 delivers structured document intelligence with bounding boxes, confidence scores, and self-hosted ...
Real environments can't inject edge cases on demand. Alibaba's Qwen-AgentWorld simulates them — and outperformed ...
The companies attributed this speed to a deep software-hardware co-development process that actively used OpenAI’s own models ...
Drug discovery is notoriously inefficient. Pharmaceutical projects span years, moving from one specialized human team to the ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results