// inferenceport relay

sends trimmed conversation + system prompt → api → reply

reply