Meta unveils a new large language model that can run on a single GPU

LLaMA-13B reportedly outperforms ChatGPT-like tech despite being 10x smaller.

Enlarge (credit: Benj Edwards / Ars Technica)

On Friday, Meta announced a new AI-powered large language model (LLM) called LLaMA-13B that it claims can outperform OpenAI’s GPT-3 model despite being “10x smaller.” Smaller-sized AI models could lead to running ChatGPT-style language assistants locally on devices such as PCs and smartphones. It’s part of a new family of language models called “Large Language Model Meta AI,” or LLAMA for short.

The LLaMA collection of language models range from 7 billion to 65 billion parameters in size. By comparison, OpenAI’s GPT-3 model—the foundational model behind ChatGPT—has 175 billion parameters.

Meta trained its LLaMA models using publicly available datasets, such as Common Crawl, Wikipedia, and C4, which means the firm can potentially release the model and the weights open source. That’s a dramatic new development in an industry where, up until now, the Big Tech players in the AI race have kept their most powerful AI technology to themselves.

Read 6 remaining paragraphs | Comments

ars-rss

Recent Posts

Recent Comments

This devious two-step phishing campaign uses Microsoft tools to bypass email security

I reviewed over 30 pairs of headphones in 2024 and here’s the one I keep coming back to

The Rise and Future Fall of MicroStrategy

Categories

Archives

Recent Posts

Recent Comments

This devious two-step phishing campaign uses Microsoft tools to bypass email security

I reviewed over 30 pairs of headphones in 2024 and here’s the one I keep coming back to

The Rise and Future Fall of MicroStrategy

Categories

Archives

Meta unveils a new large language model that can run on a single GPU

Leave a Reply Cancel reply

Archives

Categories