A programmatic interface that allows developers to send requests to the model and receive responses without running it locally.