Error using function callbacks on custom inference endpoint

devz92 · August 2, 2025, 9:56am

Hi guys, I’m having some problems related to the tool calling
testing it with my custom inference endpoint

using this example:

const testTools = [
      {
        "type": "function",
        "function": {
          "name": "get_number",
          "description": "Get the number",
          "parameters": {
            "type": "object",
            "properties": {
              "number": {
                "type": "string",
                "description": "the number to get",
              },
            },
            "required": ["number"],
          },
        },
      },
    ]

I’m not able to make it work
without the tool, it works perfectly, but adding it to the request payload makes the response something like this:

"error\\\": \\\"\\\\\\\"Tool error: Failed to parse generated text: expected value at line 1 column 1 \\\\\\\\\\\\\\\"<|python_tag

this is the call I’m doing

    const response = await inferenceClient.chatCompletion({
      endpointUrl: endpointURL,
      messages: messages.map((message) => ({
        role: message.role,
        content: message.content,
      })),
      max_tokens: xxx,
      temperature: xxx,
      tools: testTools,
      tool_choice: 'auto'
    });

Maybe I’m missing something? or is anyone having the same problem?
thanks in advance for your help!

John6666 · August 2, 2025, 11:17am

There seems to be some incompatibility between TGI and Llama’s Function Calling, so you may need to use a slightly hacky workaround.

Topic		Replies	Views
Tool calling gets stuck in an infinite loop Inference Endpoints on the Hub	2	712	April 12, 2025
Function calling not working with inference clients on (seemingly) any model Beginners	10	1032	February 8, 2025
Inference Endpoints - No working code examples Inference Endpoints on the Hub	3	232	January 29, 2025
Endpoint issue with GPTQ Inference Endpoints on the Hub	0	238	January 23, 2024
Unable to get inference results after deploying model to Inferende Endpoints Inference Endpoints on the Hub	0	31	May 8, 2025

Error using function callbacks on custom inference endpoint

Resources

Related topics