UI guide - Weights & Biases Documentation

Learn how to use the Serverless Inference service through the web UI. This guide shows you how to access the service, try models in the Playground, compare models side by side, and monitor billing and usage. Use the UI when you want to experiment with models interactively without writing code. Before you use the UI, complete the prerequisites.

Access the Inference service

The following sections describe three ways to access the Inference service. Choose whichever entry point best matches your current workflow.

Direct link

Navigate to https://wandb.ai/inference.

From the Inference tab

Navigate to your W&B account at https://wandb.ai/.
Select Inference from the project sidebar. A page appears with available models and model information.

Using an Inference model in the Playground

From the Playground tab

Select Playground from the project sidebar. The Playground chat UI appears.
Hover over Serverless Inference in the LLM dropdown list. A dropdown with available models appears on the right.
From the models dropdown, you can:
- Click any model name to try it in the Playground.
- Compare multiple models

The Inference models dropdown in Playground

Try a model in the Playground

After you select a model, you can test it in the Playground to evaluate its responses and tune its behavior before you integrate it elsewhere. Available actions include:

Compare multiple models

You can compare Inference models side by side in the Playground to evaluate quality, latency, and output style for the same prompt. You can access the Compare view from two places, described in the following sections.

From the Inference tab

Select Inference from the project sidebar. The available models page appears.
Click anywhere on a model card (except the model name) to select it. The card is highlighted to show it’s selected.
Repeat for each model you want to compare.
Click Compare [N] models in the Playground on any selected card, where [N] is the number of models you selected. The comparison view opens.

Now you can compare models and use all features from Try a model in the Playground.

Select multiple models to compare in Playground

From the Playground tab

Select Playground from the project sidebar. The Playground chat UI appears.
Hover over Serverless Inference in the LLM dropdown list. The models dropdown appears on the right.
Select Compare from the dropdown. The Inference tab appears.
Click anywhere on a model card (except the model name) to select it. The card is highlighted to show it’s selected.
Repeat for each model you want to compare.
Click Compare [N] models in the Playground on any selected card. The comparison view opens.

Now you can compare models and use all features from Try a model in the Playground.

View billing and usage information

Organization admins can track credit balance, usage history, and upcoming bills from the W&B UI. Use this view to monitor spend and forecast upcoming charges.

Navigate to the W&B Billing page in the UI.
Find the Inference billing information card.
From here you can:
- Click View usage to see your usage over time.
- View upcoming inference charges (for paid plans).

Visit the Inference pricing page for per-model pricing details.

Next steps

Review available models to find the best one for your needs.
Try the API for programmatic access.
See usage examples for code samples.

Documentation Index

​Access the Inference service

​Direct link

​From the Inference tab

​From the Playground tab

​Try a model in the Playground

​Compare multiple models

​From the Inference tab

​From the Playground tab

​View billing and usage information

​Next steps

Access the Inference service

Direct link

From the Inference tab

From the Playground tab

Try a model in the Playground

Compare multiple models

From the Inference tab

From the Playground tab

View billing and usage information

Next steps