News
The zero-shot performance concerning medical evidence summarization was evaluated using two models, GPT-3.5 and ChatGPT. Two experimental setups were designed to assess the models' capabilities.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results