Welcome to Qwen! ================ .. figure:: https://qianwen-res.oss-accelerate-overseas.aliyuncs.com/assets/logo/qwen2.5_logo.png :width: 60% :align: center :alt: Qwen2.5 :class: no-scaled-link Qwen is the large language model and large multimodal model series of the Qwen Team, Alibaba Group. Now the large language models have been upgraded to Qwen2.5. Both language models and multimodal models are pretrained on large-scale multilingual and multimodal data and post-trained on quality data for aligning to human preferences. Qwen is capable of natural language understanding, text generation, vision understanding, audio understanding, tool use, role play, playing as AI agent, etc. The latest version, Qwen2.5, has the following features: - Dense, easy-to-use, decoder-only language models, available in **0.5B**, **1.5B**, **3B**, **7B**, **14B**, **32B**, and **72B** sizes, and base and instruct variants. - Pretrained on our latest large-scale dataset, encompassing up to **18T** tokens. - Significant improvements in instruction following, generating long texts (over 8K tokens), understanding structured data (e.g, tables), and generating structured outputs especially JSON. - More resilient to the diversity of system prompts, enhancing role-play implementation and condition-setting for chatbots. - Context length support up to **128K** tokens and can generate up to **8K** tokens. - Multilingual support for over **29** languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more. For more information, please visit our: * `Blog `__ * `GitHub `__ * `Hugging Face `__ * `ModelScope `__ * `Qwen2.5 Collection `__ Join our community by joining our `Discord `__ and `WeChat `__ group. We are looking forward to seeing you there! .. toctree:: :maxdepth: 1 :caption: Getting Started :hidden: getting_started/quickstart getting_started/concepts .. toctree:: :maxdepth: 1 :caption: Inference :hidden: inference/chat .. toctree:: :maxdepth: 1 :caption: Run Locally :hidden: run_locally/ollama run_locally/mlx-lm run_locally/llama.cpp .. toctree:: :maxdepth: 1 :caption: Web UI :hidden: web_ui/text_generation_webui .. toctree:: :maxdepth: 1 :caption: Quantization :hidden: quantization/awq quantization/gptq quantization/llama.cpp .. toctree:: :maxdepth: 1 :caption: Deployment :hidden: deployment/vllm deployment/tgi deployment/skypilot deployment/openllm .. toctree:: :maxdepth: 2 :caption: Training :hidden: training/SFT/index training/RL/index .. toctree:: :maxdepth: 1 :caption: Framework :hidden: framework/function_call framework/qwen_agent framework/LlamaIndex framework/Langchain .. toctree:: :maxdepth: 1 :caption: Benchmark :hidden: benchmark/quantization_benchmark benchmark/speed_benchmark