Training Process Model

14d

Microsoft's new AI training method eliminates bloated system prompts without sacrificing model performance

Microsoft researchers have developed On-Policy Context Distillation (OPCD), a training method that permanently embeds enterprise system prompt instructions into model weights, reducing inference ...

GeekWire

Ai2’s new Tulu 3 model rivals tech giants in breakthrough for open-source AI post-training

The Allen Institute for AI (Ai2) is releasing a new set of open-source AI models and related resources in an effort to shine a light on a critical but previously mysterious corner of the artificial ...

Science News

A look under the hood of DeepSeek’s AI models doesn’t provide all the answers

It’s been almost a year since DeepSeek made a major AI splash. In January, the Chinese company reported that one of its large language models rivaled an OpenAI counterpart on math and coding ...

VentureBeat

Google’s new AI training method helps small models tackle complex reasoning

Researchers at Google Cloud and UCLA have proposed a new reinforcement learning framework that significantly improves the ability of language models to learn very challenging multi-step reasoning ...

The Atlantic

Things Get Strange When AI Starts Training Itself

ChatGPT exploded into the world in the fall of 2022, sparking a race toward ever more advanced artificial intelligence: GPT-4, Anthropic’s Claude, Google Gemini, and so many others. Just yesterday, ...

CoinTelegraph

Training vs. testing data in machine learning

Machine learning (ML) is a subset of artificial intelligence (AI) that involves using algorithms and statistical models to enable computer systems to learn from data and improve performance on a ...

ZDNet

Beware of AI 'model collapse': How training on synthetic data pollutes the next generation

To feed the endless appetite of generative artificial intelligence (gen AI) for data, researchers have in recent years increasingly tried to create "synthetic" data, which is similar to the ...

Microsoft open-sources multimodal reasoning model with 15B parameters

The company mainly trained Phi-4-reasoning-vision-15B on open-source data. The data included images and text-based descriptions of the objects depicted in those images. Before it started training the ...

Windows Report

Microsoft’s New Phi-4 Vision 15B Model Decides When to Activate Deep Reasoning

Microsoft releases Phi-4 Reasoning Vision 15B, a multimodal AI model that activates its own thinking mode and handles ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results