The computing power available to run an AI model for the user. Inference is an AI term that refers to the software that generates the answers. The inference "engine" uses an AI model that has been ...