Metric Definitions
FUNCTION_PURPOSE: Function Purpose Identification
PARAMETER_CHECK: Parameter Order and Type Verification
RETURN_TYPE: Return Type Recognition
GENERAL_KNOWLEDGE: General Spatial Knowledge
| Rank | Model | Category | FUNCTION_PURPOSE | PARAMETER_CHECK | RETURN_TYPE | GENERAL_KNOWLEDGE | Average |
|---|
Metric Definitions
Execution Pass Rate: Query execution success rate (Level 2)
Syntactic Validity: SQL syntax correctness (Level 2)
FNR: Function Name Hit Rate (Level 3)
AMA: Argument Match Accuracy (Level 3)
ACC_ALL: Overall Accuracy (Level 4)
ACC_Geo: Geometry Query Accuracy (Level 4)
ACC_Other: Non-Geometry Query Accuracy (Level 4)
pass@k: Success rate at k attempts (Level 5)
CV: Coefficient of Variation (Level 5)
SA: Semantic Alignment Score (Level 5)
| Rank | Model | Category | Exec Rate | Syntax Valid | FNR | AMA | ACC_ALL | ACC_Geo | ACC_Other | pass@1 | pass@3 | pass@5 | CV | SA | ACC_EXPL | ACC_UNDER | Time | Tokens |
|---|
Metric Definitions
THR: Table Hit Rate (Level 3)
FHR: Field Hit Rate (Level 3)
FNR: Function Name Hit Rate (Level 3)
ACC_ALL: Overall Accuracy (Level 4)
ACC_Geo: Geometry Query Accuracy (Level 4)
Resource Consumption: Time (s) and Tokens
| Rank | Model | Category | Exec Rate | Syntax Valid | THR | FHR | FNR | ACC_ALL | ACC_Geo | pass@1 | pass@3 | pass@5 | CV | SA | ACC_EXPL | ACC_UNDER | Time | Tokens |
|---|
Submit Your Model
Submit Model for Evaluation