Text Generation
Transformers
PyTorch
llama
text-generation-inference
How to use from
SGLang
Install from pip and serve model
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "WizardLMTeam/WizardLM-13B-V1.0" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "WizardLMTeam/WizardLM-13B-V1.0",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'
Use Docker images
docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "WizardLMTeam/WizardLM-13B-V1.0" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "WizardLMTeam/WizardLM-13B-V1.0",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'
Quick Links

YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

This is WizardLM-13B V1.0 diff weight.

Project Repo: https://github.com/nlpxucan/WizardLM

NOTE: The WizardLM-13B-1.0 and Wizard-7B use different prompt at the beginning of the conversation:

For WizardLM-13B-1.0 , the Prompt should be as following:

A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions. USER: hello, who are you? ASSISTANT:

For WizardLM-7B , the Prompt should be as following:

{instruction}\n\n### Response:

πŸ€— HF Repo β€’ 🐦 Twitter β€’ πŸ“ƒ [WizardLM] β€’ πŸ“ƒ [WizardCoder] β€’ πŸ“ƒ [WizardMath]

πŸ‘‹ Join our Discord

Model Checkpoint Paper HumanEval MBPP Demo License
WizardCoder-Python-34B-V1.0 πŸ€— HF Link πŸ“ƒ [WizardCoder] 73.2 61.2 Demo Llama2
WizardCoder-15B-V1.0 πŸ€— HF Link πŸ“ƒ [WizardCoder] 59.8 50.6 -- OpenRAIL-M
WizardCoder-Python-13B-V1.0 πŸ€— HF Link πŸ“ƒ [WizardCoder] 64.0 55.6 -- Llama2
WizardCoder-3B-V1.0 πŸ€— HF Link πŸ“ƒ [WizardCoder] 34.8 37.4 -- OpenRAIL-M
WizardCoder-1B-V1.0 πŸ€— HF Link πŸ“ƒ [WizardCoder] 23.8 28.6 -- OpenRAIL-M
Model Checkpoint Paper GSM8k MATH Online Demo License
WizardMath-70B-V1.0 πŸ€— HF Link πŸ“ƒ [WizardMath] 81.6 22.7 Demo Llama 2
WizardMath-13B-V1.0 πŸ€— HF Link πŸ“ƒ [WizardMath] 63.9 14.0 Demo Llama 2
WizardMath-7B-V1.0 πŸ€— HF Link πŸ“ƒ [WizardMath] 54.9 10.7 Demo Llama 2
Model Checkpoint Paper MT-Bench AlpacaEval GSM8k HumanEval License
WizardLM-70B-V1.0 πŸ€— HF Link πŸ“ƒComing Soon 7.78 92.91% 77.6% 50.6 pass@1 Llama 2 License
WizardLM-13B-V1.2 πŸ€— HF Link 7.06 89.17% 55.3% 36.6 pass@1 Llama 2 License
WizardLM-13B-V1.1 πŸ€— HF Link 6.76 86.32% 25.0 pass@1 Non-commercial
WizardLM-30B-V1.0 πŸ€— HF Link 7.01 37.8 pass@1 Non-commercial
WizardLM-13B-V1.0 πŸ€— HF Link 6.35 75.31% 24.0 pass@1 Non-commercial
WizardLM-7B-V1.0 πŸ€— HF Link πŸ“ƒ [WizardLM] 19.1 pass@1 Non-commercial

Github Repo: https://github.com/nlpxucan/WizardLM/tree/main/WizardMath

Twitter: https://twitter.com/WizardLM_AI/status/1689998428200112128

Discord: https://discord.gg/VZjjHtWrKs

Inference WizardLM Demo Script

We provide the inference WizardLM demo code here.

Downloads last month
715
Inference Providers NEW

Model tree for WizardLMTeam/WizardLM-13B-V1.0

Quantizations
5 models

Spaces using WizardLMTeam/WizardLM-13B-V1.0 47

Papers for WizardLMTeam/WizardLM-13B-V1.0