deepseek-llm:7b-chat-q8_0

152.6K 1 year ago

An advanced language model crafted with 2 trillion bilingual tokens.

7b 67b

1 year ago

83e5a9a28d09 · 7.3GB

llama
·
6.91B
·
Q8_0
{ "num_ctx": 4096 }
{{ .System }} User: {{ .Prompt }} Assistant:

Readme

DeepSeek LLM is an advanced language model available in both 7 billion and 67 billion parameters. Both a chat and base variation are available.

  • Superior General Capabilities: DeepSeek LLM 67B Base outperforms Llama2 70B Base in areas such as reasoning, coding, math, and Chinese comprehension.

  • Proficient in Coding and Math: DeepSeek LLM 67B Chat exhibits outstanding performance in coding (using the HumanEval benchmark) and mathematics (using the GSM8K benchmark).

References

GitHub

HuggingFace