Deepseek Benchmark

About 45,000 results

Open links in new tab

Any time

deepseek.com
https://www.deepseek.com
DeepSeek
DeepSeek-V3 achieves a significant breakthrough in inference speed over previous models. It tops the leaderboard among open-source models and rivals the most advanced closed-source models globally. Benchmark (Metric)
graphcore-research.github.io
https://graphcore-research.github.io › deepseek
DeepSeek-V3 & DeepSeek-R1 Technical Reports - Graphcore …
Jan 30, 2025 · With their V3 and R1 models, DeepSeek sets a new state-of-the-art in open-weight models and trades benchmark to benchmark with the best models from Anthropic, Google and OpenAI.
github.com
https://github.com › deepseek-ai
deepseek-ai/DeepSeek-R1 - GitHub
DeepSeek-R1 achieves performance comparable to OpenAI-o1 across math, code, and reasoning tasks. To support the research community, we have open-sourced DeepSeek-R1-Zero, DeepSeek-R1, and six dense models distilled from DeepSeek-R1 based on Llama and Qwen. DeepSeek-R1-Distill-Qwen-32B outperforms OpenAI-o1-mini across various benchmarks ...
theconversation.com
https://theconversation.com › putting-deepseek-to-the-test-how-its...
Putting DeepSeek to the test: how its performance compares …
Feb 4, 2025 · DeepSeek claims its models perform comparably to OpenAI’s offerings, even exceeding the o1 model in certain benchmark tests. However, benchmarks that use Massive Multitask Language Understanding ...
nbcnews.com
https://www.nbcnews.com › data-graphics › deepseek-ai-comparison...
How DeepSeek stacks up against popular AI models, in three charts
Jan 28, 2025 · DeepSeek released its buzziest large language model, R1, on Jan. 20. The AI assistant hit No. 1 on the Apple App Store in recent days, bumping OpenAI’s long-dominant ChatGPT down to No. 2.
datacamp.com
https://www.datacamp.com › blog
DeepSeek R1: Features, o1 Comparison, Distilled Models & More
Jan 31, 2025 · In mathematics benchmarks, DeepSeek-R1 demonstrates strong performance. On AIME 2024, which evaluates advanced multi-step mathematical reasoning, DeepSeek-R1 scores 79.8%, slightly ahead of OpenAI o1-1217 at 79.2%.
arxiv.org
https://arxiv.org › pdf
[PDF]
DeepSeek-V3 Technical Report - arXiv.org
We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. To achieve eficient inference and cost-effective training, DeepSeek-V3 adopts Multi-head Latent Attention (MLA) and DeepSeekMoE architec-tures, which were thoroughly validated in DeepSeek-V2.
huggingface.co
https://huggingface.co › deepseek-ai
deepseek-ai/DeepSeek-V3 - Hugging Face
DeepSeek-V3 achieves the best performance on most benchmarks, especially on math and code tasks. For more evaluation details, please check our paper. Context Window Evaluation results on the Needle In A Haystack (NIAH) tests. DeepSeek-V3 performs well across all context window lengths up to 128K. Chat Model
huggingface.co
https://huggingface.co › blog › wolfram
‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B ... - Hugging Face
Jan 2, 2025 · DeepSeek-V3 is THE new open-weights star, and it's a heavyweight at 671B, with 37B active parameters in its Mixture-of-Experts architecture. I tested it through the official DeepSeek API and it was quite fast (~50 tokens/s) and …
datacrunch.io
https://datacrunch.io › blog
DeepSeek V3 LLM NVIDIA H200 GPU Inference Benchmarking
Jan 8, 2025 · Our benchmarks highlight the potential for improved throughput and scalability in LLMs. Moving forward, further tuning of kernels and inference engines like SGLang will likely amplify these benefits, setting a new standard for high-performance LLM deployment.
Some results have been removed
Pagination
- 1
- 2
- 3
- 4
- Next

DeepSeek

DeepSeek-V3 & DeepSeek-R1 Technical Reports - Graphcore …

deepseek-ai/DeepSeek-R1 - GitHub

Putting DeepSeek to the test: how its performance compares …

How DeepSeek stacks up against popular AI models, in three charts

DeepSeek R1: Features, o1 Comparison, Distilled Models & More

DeepSeek-V3 Technical Report - arXiv.org

deepseek-ai/DeepSeek-V3 - Hugging Face

‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B ... - Hugging Face

DeepSeek V3 LLM NVIDIA H200 GPU Inference Benchmarking