Inception, the company behind the first commercial diffusion large language models (dLLMs), today announced the launch of Mercury 2, the fastest reasoning LLM and first reasoning dLLM. Mercury 2 ...
SambaNova today introduced their SN50 AI chip, which boasts a max speed that's 5X faster than competitive chips. The company also announced a planned collaboration with Intel to deliver ...
Taalas has launched an AI accelerator that puts the entire AI model into silicon, delivering 1-2 orders of magnitude greater ...
SUNNYVALE, Calif.--(BUSINESS WIRE)--Meta has teamed up with Cerebras to offer ultra-fast inference in its new Llama API, bringing together the world’s most popular open-source models, Llama, with the ...
PALO ALTO, Calif.--(BUSINESS WIRE)--SambaNova, the AI inference company delivering fast, efficient AI chips and high-performance models, today announced that its AI platform is now available in AWS ...
Anthropic has launched Claude Fast Mode for Opus 4.6, delivering 2.5x faster responses at 6x the standard price for time-sensitive development tasks.
Sometimes, a demo is all you need to understand a product. And that’s the case with Runware. If you head over to Runware’s website, enter a prompt and hit enter to generate an image, you’ll be ...
Most of the investment buzz in AI hardware concentrates on the amazing accelerator chips that crunch the math required for neural networks, like Nvidia’s GPUs. But what about the rest of the story?
AI compute company Cerebras Systems today announced what it said is the fastest AI inference solution. Cerebras Inference delivers 1,800 tokens per second for Llama3.1 8B and 450 tokens per second for ...
OpenAI on Thursday announced a lightweight version of its agentic coding tool Codex, releasing GPT-5.3-Codex-Spark as a smaller model designed for faster inference. The company said Spark is built for ...
Artificial intelligence inference startup Simplismart, officially known as Verute Technologies Pvt Ltd., said today it has closed on $7 million in funding to build out its infrastructure platform and ...
It all started because I heard great things about Kimi K2 (the latest open-source model by Chinese lab Moonshot AI) and its performance with agentic tool calls. The folks at Moonshot AI specifically ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results