Variable Arguments Python

How to choose the best LLM using R and vitals

Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models ...

GitHub

KorMedMCQA-V: A Multimodal Benchmark for Evaluating Vision-Language Models on the Korean Medical Licensing Examination

We introduce KorMedMCQA-V, a Korean medical licensing-exam-style multimodal multiple-choice question answering benchmark for evaluating vision-language models (VLMs). The dataset consists of 1,534 ...

GitHub

MNN Python Model Inference

A ready-to-use Python pipeline for running machine learning model inference using MNN (Mobile Neural Network). It handles the complete flow from loading an image, preprocessing it, executing the model ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

How to choose the best LLM using R and vitals

KorMedMCQA-V: A Multimodal Benchmark for Evaluating Vision-Language Models on the Korean Medical Licensing Examination

MNN Python Model Inference

Trending now