How scoring works

Per task score

Every task defines its own metric, which is used to score your submissions. Only your best score per task counts on the leaderboard. A short cooldown applies between submissions, typically five minutes.

Public and final scores

While the hackathon is live, the leaderboard shows your public score, computed on a public subset of the evaluation set. After submissions close, the displayed score switches to the final score, computed on the held out subset. That final score determines the overall ranking. The split is fixed and identical for every team. The ratio between the public and held out subsets depends on the task:

Overall ranking

For each task, scores are normalized to the [0, 1] range using the lowest and highest scores across all teams. For tasks where lower raw scores are better (such as DUCI), the normalization is inverted so that 1.0 always represents the best result on that task. A team's overall rank is the average of its normalized scores across every task. Teams missing a score on any task are not ranked overall.

CISPA Hackathon Warsaw

Final leaderboard across all tasks. Official results locked.

Final leaderboard
Tasks: DUCI · Memorization in LMMs · LLM Watermark Detection
Powered by CISPA · Warsaw Edition
RankTeamScore
1 BatchNormies3d 0.039300
2 CyberDzik Syndicate 0.041277
3 SPQR 0.047673
4 zer0_day 0.054026
5 ParmaGo 0.070000
6 Ascari 0.070648
7 Świta Znachora 0.074167
8 Syntax Terror 0.086667
9 Sakura 0.098799
10 Team Nepal 0.105606
11 Cyberpros 0.109000
12 outcasts 0.116333
13 ADK 0.121286
14 GradientLabs 0.153317
15 zakaz somsa 0.179028
16 4aufKind 0.236471
17 Czumpers 0.248961
18 Advanced Persistent Thinkers 0.251536
19 TQ2 0.254343
20 S.P.Q.L. 0.276000
21 KNUM MIMUW 0.298750
22 DSC_A/B 0.306150
23 Seal 0.338167
24 KAMP 0.472333
RankTeamScore
1 Czumpers 0.532625
2 zer0_day 0.511518
3 Advanced Persistent Thinkers 0.488446
4 Sakura 0.482405
5 Syntax Terror 0.478029
6 DSC_A/B 0.452852
7 TQ2 0.442570
8 4aufKind 0.418443
9 outcasts 0.414348
10 CyberDzik Syndicate 0.409447
11 Świta Znachora 0.403509
12 Cyberpros 0.388725
13 SPQR 0.384496
14 GradientLabs 0.384219
15 BatchNormies3d 0.383580
16 KNUM MIMUW 0.375768
17 S.P.Q.L. 0.370659
18 Ascari 0.320593
19 ParmaGo 0.291723
20 ADK 0.286566
21 Team Nepal 0.255757
22 zakaz somsa 0.226546
23 Seal 0.226546
RankTeamScore
1 Syntax Terror 0.356397
2 Advanced Persistent Thinkers 0.319843
3 zer0_day 0.285901
4 S.P.Q.L. 0.261097
5 4aufKind 0.258486
6 SPQR 0.255875
7 Czumpers 0.250653
8 GradientLabs 0.249347
9 outcasts 0.194517
10 BatchNormies3d 0.187990
11 TQ2 0.161880
12 ParmaGo 0.148825
13 Ascari 0.148825
14 CyberDzik Syndicate 0.139687
15 Cyberpros 0.131854
16 DSC_A/B 0.120104
17 ADK 0.112272
18 Team Nepal 0.105744
19 Sakura 0.099217
20 Świta Znachora 0.090078
21 Seal 0.088773
22 zakaz somsa 0.080940
23 KAMP 0.078329
24 KNUM MIMUW 0.066580
RankTeamAvg Rank
1 Syntax Terror 0.90
2 zer0_day 0.88
3 Advanced Persistent Thinkers 0.75
4 Czumpers 0.72
5 SPQR 0.72
6 BatchNormies3d 0.64
7 GradientLabs 0.63
8 outcasts 0.63
9 CyberDzik Syndicate 0.62
10 4aufKind 0.61
11 Sakura 0.60
12 S.P.Q.L. 0.53
13 Cyberpros 0.53
14 Świta Znachora 0.53
15 TQ2 0.51
16 Ascari 0.51
17 ParmaGo 0.48
18 DSC_A/B 0.44
19 ADK 0.39
20 Team Nepal 0.36
21 KNUM MIMUW 0.30
22 zakaz somsa 0.24
23 Seal 0.13
Scores are final. You may refresh to re-render this page if needed.