Data Foundry
curation log
📖 Guidelines
✕ Clear filters

Verify dataset

Reviewer:

About — Data Foundry curation

This page is the curation log behind TabArena and BeyondArena: one record per candidate tabular dataset, tracking whether it belongs in the benchmark, why, how it should be split, and how it is processed. Curators triage the backlog here (edit → commit → PR); an AI assistant can draft a provisional triage (🤖) that a human then verifies.

Goal: assemble a high-quality, representative collection of real-world tabular ML tasks for an open, living benchmark — and keep the curation reasoning transparent and reproducible.

Links