I am on the AWS SageMaker team. For “Real-Time Inference” you are only charged for:
- usage of the instance types you choose (instance hours)
- storage attached to those instance (GB storage hours)
- data in and out of your Endpoint (Bytes in/out)
See “Pricing Example #6: Real-Time Inference” as well.
CLICK HERE to find out more related problems solutions.