Large Train Models - Search News

Researchers find you don’t need a ton of data to train LLMs for reasoning tasks

Large language models (LLMs) can learn complex reasoning tasks without relying on large datasets, according to a new study by researchers at Shanghai Jiao Tong University. Their findings show that ...

14d

AI drug startup Insilico Medicine launches an AI ‘gym’ to help models like GPT and Qwen be good at science

With the “gym,” Insilico is now targeting other biotech and pharmaceutical companies, offering to train new AI models for them.

Science Daily

Like human brains, large language models reason about diverse data in a general way

Researchers find large language models process diverse types of data, like different languages, audio inputs, images, etc., similarly to how humans reason about complex problems. Like humans, LLMs ...

Geeky Gadgets

Easily Fine-Tune AI Models by Training Them with Your Own Data Using Encord

Training AI or large language models (LLMs) with your own data—whether for personal use or a business chatbot—often feels like navigating a maze: complex, time-consuming, and resource-intensive. If ...

VentureBeat

Nvidia researchers unlock 4-bit LLM training that matches 8-bit performance

Researchers at Nvidia have developed a novel approach to train large language models (LLMs) in 4-bit quantized format while maintaining their stability and accuracy at the level of high-precision ...

MIT Technology Review

Anthropic can now track the bizarre inner workings of a large language model

What the firm found challenges some basic assumptions about how this technology really works. The AI firm Anthropic has developed a way to peer inside a large language model and watch what it does as ...

MIT Technology Review

Anthropic has a new way to protect large language models against jailbreaks

This line of defense could be the strongest yet. But no shield is perfect. AI firm Anthropic has developed a new line of defense against a common kind of attack called a jailbreak. A jailbreak tricks ...

Wired

Small Language Models Are the New Rage, Researchers Say

The original version of this story appeared in Quanta Magazine. Large language models work well because they’re so large. The latest models from OpenAI, Meta, and DeepSeek use hundreds of billions of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results