Publications

Papers and published research

DatBench leads the page as the current benchmark story, followed by earlier papers in multimodal representation learning and medical imaging.

Featured research

Current work on making vision-language model evaluation more discriminative, more faithful, and practical enough to run in real research loops.

Featured researcharXiv2026

A benchmark and evaluation framework for vision-language models built around discriminative tasks, faithful scoring, and practical efficiency.

S. Joshi, H. Yin, R. Adiga, R. Monti, A. Carranza, A. Fang, A. Deng, A. Abbas, et al.

Research archive

Earlier work is presented as a quieter reading list, with explicit links out to the paper, code, or publisher page where available.

arXiv2023

J. Crawford, H. Yin, L. McDermott, D. Cummings

A stronger multimodal fusion baseline for re-identification, paired with an open-source release.

arXiv2023

H. Yin, J. Li, E. Schiller, L. McDermott, D. Cummings

Transformer-based multimodal re-identification focused on stronger fusion and cleaner representation learning.

Journal of Medical Imaging2024

H. Yin, R. Eimen, D. Moyer, A. K. Bowden

Reflection-aware medical video restoration combining optical flow with video completion.