DeepSeek: everything you need to know about the AI that dethroned ChatGPT

A year-old startup out of China is taking the AI industry by storm after releasing a chatbot which rivals the performance ofChatGPTwhile using a fraction of the power, cooling, and training expense of what OpenAI, Google, and Anthropic’s systems demand. Here’s everything you need to know about Deepseek’s V3 and R1 models and why the company could fundamentally upend America’s AI ambitions.

What is DeepSeek?

DeepSeek (technically, “Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.”) is a Chinese AI startup that was originally founded as an AI lab for its parent company, High-Flyer, in April, 2023. That May, DeepSeek was spun off into its own company (with High-Flyer remaining on as an investor) and also released its DeepSeek-V2 model. V2 offered performance on par with other leading Chinese AI firms, such as ByteDance, Tencent, and Baidu, but at a much lower operating cost.

The company followed up with the release of V3 in December 2024. V3 is a 671 billion-parameter model thatreportedly took less than 2 months to train. What’s more, according to a recent analysis from Jeffries, DeepSeek’s “training cost of only US$5.6m (assuming $2/H800 hour rental cost). That is less than 10% of the cost of Meta’s Llama.” That’s a tiny fraction of the hundreds of millions to billions of dollars that US firms like Google, Microsoft, xAI, and OpenAI have spent training their models.

🚀 Introducing DeepSeek-V3!

Biggest leap forward yet:⚡ 60 tokens/second (3x faster than V2!)💪 Enhanced capabilities🛠 API compatibility intact🌍 Fully open-source models & papers

🐋 1/npic.twitter.com/p1dV9gJ2Sd

— DeepSeek (@deepseek_ai)June 22, 2025

Benchmark tests put V3’s performance on par with GPT-4o and Claude 3.5 Sonnet. A December 2024 Op-Ed inThe Hillcategorized DeepSeek’s success as America’s “Sputnik Moment.”

DeepSeek released its R1-Lite-Preview model in November 2024, claiming that the new model could outperform OpenAI’s o1 family of reasoning models (and do so at a fraction of the price). The company estimates that the R1 model is between 20 and 50 times less expensive to run, depending on the task, than OpenAI’s o1. DeepSeek subsequently released DeepSeek-R1 and DeepSeek-R1-Zero in January 2025. The R1 model, unlike its o1 rival, is open source, which means that any developer can use it.

As such V3 and R1 have exploded in popularity since their release, with DeepSeek’s V3-powered AI Assistantdisplacing ChatGPT at the top of the app stores. Venture capitalist Marc Andreesen, in a recent social media post,called DeepSeek’s chatbot“one of the most amazing and impressive breakthroughs I’ve ever seen” and a “profound gift to the world.”

What can DeepSeek do?

As an open-source large language model, DeepSeek’s chatbots can do essentially everything that ChatGPT, Gemini, and Claude can. That includes text, audio, image, and video generation. What’s more, DeepSeek’s newly released family of multimodal models, dubbedJanus Pro, reportedly outperforms DALL-E 3 as well as PixArt-alpha, Emu3-Gen, and Stable Diffusion XL, on a pair of industry benchmarks. DeepSeek-R1, rivaling o1, is specifically designed to perform complex reasoning tasks, while generating step-by-step solutions to problems and establishing “logical chains of thought,” where it explains its reasoning process step-by-step when solving a problem.

oh boy #deepseek

—Alexios Mantzarlis (@mantzarlis.com)2025-01-27T16:50:40.640Z

What DeepSeek’s products can’t do is talk about Tienanmen Square. Or the Yellow Umbrella protests. Or President Xi Jinping’s likeness to Winnie the Pooh. Basically, if it’s a subject considered verboten by the Chinese Communist Party, DeepSeek’s chatbot will not address it or engage in any meaningful way.

Who can use DeepSeek?

Why is DeepSeek suddenly such a big deal?

Since the release of ChatGPT in November 2023, American AI companies have been laser-focused on building bigger, more powerful, more expansive, more power, and resource-intensive large language models. Rather than seek to build more cost-effective and energy-efficient LLMs, companies like OpenAI, Microsoft, Anthropic, and Google instead saw fit to simply brute force the technology’s advancement by, in the American tradition, simply throwing absurd amounts of money and resources at the problem. In 2024 alone, xAI CEO Elon Musk was expected to personally spend upwards of $10 billion on AI initiatives. OpenAI and its partners just announced a $500 billion Project Stargate initiative that would drastically accelerate the construction of green energy utilities and AI data centers across the US. Google plans toprioritize scaling the Gemini platform throughout 2025, according to CEO Sundar Pichai, and is expected to spend billions this year in pursuit of that goal. Meta announced in mid-January that it would spend as much as $65 billion this year on AI development.

DeepSeek just showed the world that none of that is actually necessary — that the “AI Boom” which has helped spur on the American economy in recent months, and which has made GPU companies like Nvidia exponentially more wealthy than they were in October 2023, may be nothing more than a sham — andthe nuclear power “renaissance”along with it. This revelation also calls into question just how much of a lead the US actually has in AI, despiterepeatedly banning shipments of leading-edge GPUs to Chinaover the past year.

One only needs to look at how much market capitalization Nvidia lost in the hours following V3’s release for example. The company’s stock value dropped 17% and it shed $600 billion (with aB) in a single trading session. That’s the single largest single-day loss by a company in the history of the U.S. stock market, perForbes— topping the company’s (and stock market’s) previous record for losing money which was set in September 2024 and valued at $279 billion. Nvidia literally lost a valuation equal to that of the entire Exxon/Mobile corporation in one day.

“The bottom line is the US outperformance has been driven by tech and the lead that US companies have in AI,” Keith Lerner, an analyst at Truist, toldCNN. “The DeepSeek model rollout is leading investors to question the lead that US companies have and how much is being spent and whether that spending will lead to profits (or overspending).”

In short, DeepSeek just beat the American AI industry at its own game, showing that the current mantra of “growth at all costs” is no longer valid. “DeepSeek clearly doesn’t have access to as much compute as U.S. hyperscalers and somehow managed to develop a model that appears highly competitive,” Srini Pajjuri, semiconductor analyst at Raymond James,told CNBC. If a Chinese startup can build an AI model that works just as well as OpenAI’s latest and greatest, and do so in under two months and for less than $6 million, then what use is Sam Altman anymore?

“Time will tell if the DeepSeek threat is real — the race is on as to what technology works and how the big Western players will respond and evolve,” Michael Block, market strategist at Third Seven Capital, told CNN. “Markets had gotten too complacent on the beginning of the Trump 2.0 era and may have been looking for an excuse to pull back — and they got a great one here.”

What are the Americans going to do about it?

We’ve already seen the rumblings ofa response from American firms, as well as the White House. “The release of DeepSeek, an AI from a Chinese company, should be a wake-up call for our industries that we need to be laser-focused on competing to win,” Donald Trump said,per the BBC. “We always have the ideas, we’re always first. I would say that it could be very much a positive development. Instead of spending billions and billions, you’ll spend less, and you’ll come up with, hopefully, the same solution.”

For his part, Meta CEO Mark Zuckerberg has “assembled four war rooms of engineers” tasked solely with figuring out DeepSeek’s secret sauce. AsFortune reports, two of the teams are investigating how DeepSeek manages its level of capability at such low costs, while another seeks to uncover the datasets DeepSeek utilizes. The final team is responsible for restructuring Llama, presumably to copy DeepSeek’s functionality and success.

— Elon Musk (@elonmusk)August 25, 2025

xAI CEO, Elon Musk, simply went online and started trolling DeepSeek’s performance claims. His firm is currently attempting to build “the most powerful AI training cluster in the world,” just outside Memphis, Tennessee. Conversely, OpenAI CEO Sam Altman welcomed DeepSeek to the AI race, stating “r1 is an impressive model, particularly around what they’re able to deliver for the price,”in a recent post on X. “We will obviously deliver much better models and also it’s legit invigorating to have a new competitor! we will pull up some releases.”

Eventhe U.S. Navy is getting involved. The armed service issued a warning to shipmates in January that DeepSeek was not to be used “in any capacity” because of “potential security and ethical concerns associated with the model’s origin and usage.” It’s “imperative,” the email memo read, that service members not use DeepSeek “for any work-related tasks or personal use.”

What is DeepSeek?#

🚀 Introducing DeepSeek-V3!#

What can DeepSeek do?#

Who can use DeepSeek?#

Why is DeepSeek suddenly such a big deal?#

What are the Americans going to do about it?#