Evaluate Own EvallWeb Portlet
Choose the task you want to consider
![]() Classification Classification evaluation scenario consists in comparing the labels produced by systems for each item with the value provided by the gold. | ![]() Ranking The Ranking evaluation scenario focuses on the priority relationships between items in both the gold and system outputs. | |
![]() Clustering Clustering can be interpreted as the problem of predicting if two items belong or not to the same group. | ![]() Diversification Diversification consists of ranking items according to their relevance while capturing diverse relevance aspects. | |
Select the configuration for the evaluation
![]() |
![]() | |||
Default | Customized | |||
Recommended configuration. EvALL will select the appropriate settings for you. | Choose the set of metrics you want to consider, or set the parameters of the metrics. |
Select the set of metrics for the evaluation
![]() |
![]() | |||
Full set of metrics | Customized set of metrics | |||
Including official evaluation metrics and also all metrics recommended by the EvALL toolkit. | Choose the set of metrics you want to consider, or set the parameters of the metrics. |
Select the settings for the evaluation report
![]() |
![]() |
![]() |
![]() | |||||
Generate pdf/latex report | Generate tsv report | Add metric descriptions | Add output verifications | |||||
Generate pdf/lates report. | Generate tsv report. | Include explanations and definitions for each of the metrics. | Include the results of the verification step for each of the inputs you provide (with warnings in case of inconsistent format). |
Results of the evaluation for the selected configuration
No preview
![]() | ![]() | ![]() | ||||
Report | Latex project | TSV files |