Benchmark Dataloader
A benchmarking setup for multimodal dataloaders, built to surface throughput bottlenecks before they become training-time surprises.
Member of Technical Staff at Datology
At Datology, I work across evaluation, curation, distributed training, and launch infrastructure. I try to keep the path from a data choice to a trustworthy comparison short.
The interesting cases are the ones where the benchmark and the training run finally line up with what the data pipeline is doing.I like research surfaces that stay rigorous without feeling bureaucratic.

Research engineer working across multimodal data, evaluation, and launch systems.

Public work on VLM evaluation and benchmark design.
Read paperOpen systems work for finding dataloader bottlenecks before they burn training time.
View repoThree operating lanes keep the loop moving. They cover benchmark design, data handling, and launch.
Built evaluation paths that keep model comparisons useful for curation decisions and model iteration.
Built ingestion and export paths that make large multimodal corpora easier to train on and inspect.
Added vLLM eval support and hardened multi-node launch plus checkpoint behavior for faster experimental turnover.
Open projects that show the same systems taste at a smaller scale.