Contents Menu Expand Light mode Dark mode Auto light/dark, in light mode Auto light/dark, in dark mode Skip to content
Qwen
Qwen

Getting Started

  • Quickstart
  • Key Concepts

Inference

  • Using Transformers to Chat

Run Locally

  • Ollama
  • MLX-LM
  • llama.cpp

Web UI

  • Text Generation Web UI

Quantization

  • AWQ
  • GPTQ
  • llama.cpp

Deployment

  • vLLM
  • TGI
  • SkyPilot

Training

  • SFT
    • LLaMA-Factory

Framework

  • Function Calling
  • Qwen-Agent
  • LlamaIndex
  • Langchain

Benchmark

  • Performance of Quantized Models
  • Speed Benchmark
Back to top
Copyright © 2024, Qwen Team
Made with Sphinx and @pradyunsg's Furo