Run Benchmark for custom model #2

AnFreTh · 2024-11-14T14:14:28Z

Hi,

first of all, great Code/Paper!
I wanted to run a simple model to compare it with your results. Is there an easy way to for example run a simple sklearn model such that I can directly compare to the results reported in your paper?

puhsu · 2024-11-14T20:14:45Z

Hi! I'm glad to hear that you are interested in the benchmark!

We plan to add a more streamlined way to setup and process the datasets.

Until then the steps to use TabReD are as follows:

Create an env (following readme instruction)
Run mkdir data
Run python preprocessing/<dataset-name>.py for each dataset (it should be quick, longest parts are the downloads)
Modify any of the bin scripts appropriate. I suggest looking at the bin/xgboost.py and switching the model for the one you would like to test (as the GBDT implementations are sklearn-api compatible).

I'll keep the issue open for now, until we make a more streamlined setup for dataset preparation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run Benchmark for custom model #2

Run Benchmark for custom model #2

AnFreTh commented Nov 14, 2024

puhsu commented Nov 14, 2024

Run Benchmark for custom model #2

Run Benchmark for custom model #2

Comments

AnFreTh commented Nov 14, 2024

puhsu commented Nov 14, 2024