Replicating implementation doubts. Wilderness Impact metrics, training protocol... #18

aitor-martinez-seras · 2024-03-22T17:25:27Z

Hello. First of all, thank you for your work and for sharing it!

I am implementing the evaluation process of OWOD (defined in PascalVOCDetectionEvaluator.evaluate). I have encountered several problems to correctly apply the benchmark to my own work. Here are them:

For evaluating every task and every metric (even WI), I assume you are always performing evaluation against all_task_test.txt. Is that right?
For the Wilderness Impact, they get reported the WI values for different recalls. Which one do you chose?
Are instances categorized as "difficult" used in for the metrics?
The reported number of test instances of T1 in the below image of your paper refer only to the KNOWN classes or are accounting for both KNOWN and UNKNOWN objects?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replicating implementation doubts. Wilderness Impact metrics, training protocol... #18

Replicating implementation doubts. Wilderness Impact metrics, training protocol... #18

aitor-martinez-seras commented Mar 22, 2024 •

edited

Loading

Replicating implementation doubts. Wilderness Impact metrics, training protocol... #18

Replicating implementation doubts. Wilderness Impact metrics, training protocol... #18

Comments

aitor-martinez-seras commented Mar 22, 2024 • edited Loading

aitor-martinez-seras commented Mar 22, 2024 •

edited

Loading