[Apibench] How to evaluate a model by openai API? #592

djstrong · 2024-08-21T15:12:59Z

I have remotely hosted vllm models. How to evaluate them?

HuanzhiMao · 2024-08-23T20:14:37Z

Hey @djstrong, just want to double-check, are you referring to the evaluation in Apibench or the Berkeley Function Calling Leaderboard (BFCL)?

djstrong · 2024-08-24T11:21:55Z

Sorry, I mean BFCL.

HuanzhiMao · 2024-08-24T23:50:54Z

Take a look at the instructions here. Let me know if you have more questions!

djstrong added the apibench-data APIBench data label Aug 21, 2024

HuanzhiMao added BFCL-General General BFCL Issue and removed apibench-data APIBench data labels Aug 22, 2024

Provide feedback