Secure LLM inference API running in GPU Trusted Execution Environment (TEE) for confidential AI computing.
GPU TEE
checkbox:
https://api.red-pill.ai/v1/chat/completions
. A simple request could be like:
signing_address
is the account address generated inside TEE that will be used to sign the chat message later.
The all_attestations
is the list of all the attestations of all GPU nodes since we add more TEE nodes to serve the inference requests. You can utilize the signing_address
from the all_attestations
to select the appropriate TEE node for verifying its integrity.
intel_quote
at TEE Attestation Explorer.
The signing_address
is the account address generated inside TEE that will be used to sign the chat response. You can go to https://etherscan.io/verifiedSignatures, click Verify Signature, and paste the signing_address
and message response to verify it.
nvidia_payload
and intel_quote
are the attestation report from NVIDIA TEE and Intel TEE respectively. You can use them to verify the integrity of the TEE. See Verify the Attestation for more details.
Note: The trust chain works as follows: when you verify the attestation report, you trust the model provider (Redpill) and the TEE providers (NVIDIA and Intel). You then trust the open-source, reproducible code by verifying the source code here. Finally, you trust the cryptographic key derived inside the TEE. This is why we only need to verify the signature of the message during chat.
id
which you can use to get the chat Signature later.
Sample Request
id
is calculated by sha256sum(sha256sum(request_body) + sha256sum(response_body)).
https://api.red-pill.ai/v1/signature/{request_id}?model={model_id}&signing_algo=ecdsa
For example, the response in the previous section, the id is chatcmpl-0cdf7629fcfa4135bbdb9936e737e95c
:
Response