ICU Time-Series Prediction Benchmark
Comprehensive comparison of LLMs and baseline models on ICU prediction tasks
Task
All Tasks
Dataset
All Datasets
Model Type
All Models
LLMs Only
Baselines Only
Primary Metric
AUROC
AUPRC
MCC
F1 Score
-
Total Models
-
Tasks
-
Datasets
-
Best Model
Model Performance Comparison
Task-wise Performance
LLM vs Baseline Comparison
Detailed Results Table
Model
Type
Task
Dataset
AUROC
AUPRC
MCC
F1 Score
Load your JSON data to see results