We built Truss because we were frustrated with the long feedback loops in ML model deployment. When you have to wait for your server to rebuild every time you make a change, iteration is painful. Meanwhile, web developers have enjoyed live reload workflows for years, where changes are patched onto a running server and available almost instantly. With Truss, you get the same live reload workflow for serving ML models.Documentation Index
Fetch the complete documentation index at: https://baseten-philip-copy-0226.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.

Create a Truss
truss init creates those files in a target directory.
To package a model with Truss, follow the quickstart, a step-by-step example, or an example from GitHub.
See the CLI reference for more information on truss init.
Spin up model server
truss push to spin up a model server with your model and config.
See the CLI reference for more information on truss init.
Test the model
truss predict.
See the CLI reference for more information on truss predict.
Watch for changes
truss watch running, it will automatically attempt to patch that change onto the model server. Most changes to model.py and config.yaml can be patched.
See the CLI reference for more information on truss watch.
Publish your model
Once you’re happy with your model, stoptruss watch and run: