moeviz

visualize token routing in mixture-of-experts models

Setup

The visualization uses vite and d3, which requires Node.js version 18+/20+. Download Node.js from https://nodejs.org/en/download (recommended v20.19.4).

Then, serve the client:

cd client
npm install
npm run dev

Use poetry to setup and activate Python environment. Then:

# Install in editable mode
poetry install

# Start the server
poetry run python3 moeviz/server.py

Adding new models manually

The model must either be downloaded from huggingface (via huggingface-cli) and the model_id should point to the repo id (e.g. ``). Or, a model must be downloaded locally (as a cloned huggingface repo), where the model_id will be the relative path to the directory containing the checkpoint you want to target (e.g. models/mixtral_5_6gpu/last-checkpoint).

When downloading repositories from HuggingFace, make sure you are using git lfs to pull large files:

# Make sure git-lfs is installed (https://git-lfs.com)
git lfs install

# When prompted for a password, use an access token with write permissions.
# Generate one from your settings: https://huggingface.co/settings/tokens
git clone https://huggingface.co/i-be-snek/mixtral_5_6gpu

To add a new model to the interface, you need to add its configurations in moeviz/config.py and client/src/config.js.

Configuration

moeviz can be configured using environment variables:

Environment Variable	Description	Default
`MOEVIZ_SERVER_HOST`	Server host address	`0.0.0.0`
`MOEVIZ_SERVER_PORT`	Server port	`8000`
`MOEVIZ_BASE_URL`	Base URL for client connections	`http://{host}:{port}`
`MOEVIZ_ENABLE_CORS`	Enable CORS for API	`true`
`MOEVIZ_MAX_NEW_TOKENS`	Max tokens for generation	`128`
`MOEVIZ_THREAD_POOL_WORKERS`	Number of worker threads	`1`

Examples

# Run on port 9000
MOEVIZ_SERVER_PORT=9000 poetry run python3 moeviz/server.py

# Custom server base URL for client (e.g., when behind a proxy)
MOEVIZ_BASE_URL=https://moeviz.example.com poetry run python3 moeviz/server.py

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
assets		assets
client		client
moeviz		moeviz
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
demo.py		demo.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

moeviz

Setup

Adding new models manually

Configuration

Examples

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

moeviz

Setup

Adding new models manually

Configuration

Examples

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages