Criteria Tree¶

What it is¶

The Criteria Tree is the rubric model used by the grader. It defines what is evaluated and how much each part contributes.

Top-level categories:

base (required)
bonus (optional)
penalty (optional)

Each category can contain nested subjects and tests.

Why it matters¶

The criteria tree is where educational policy becomes executable logic:

Transparency: students see what was expected.
Fairness: every submission is graded with the same rubric structure.
Maintainability: rubric changes are config updates, not code rewrites.

How scoring works¶

Tests produce scores in the 0-100 range.
Subjects aggregate child tests/subjects by weight.
Categories aggregate their children by weight.
Final score is computed in ResultTree root:

final = base + bonus_contribution - penalty_contribution

The root score is capped to [0, 100].

Key concepts you should model explicitly¶

1. Weight hierarchy¶

Weights are local to sibling groups. A test's final impact depends on all ancestor weights.

2. `subjects_weight`¶

When a node has both direct tests and child subjects, subjects_weight splits influence between the two groups.

3. Educational intent¶

Use subject names that represent real learning objectives ("Input validation", "API contracts", "Accessibility"), not implementation jargon.

Concrete configuration example¶

{
  "base": {
    "weight": 100,
    "subjects": [
      {
        "subject_name": "Correctness",
        "weight": 70,
        "tests": [
          {"name": "expect_output", "weight": 70, "parameters": {"expected_output": "42", "inputs": ["10"]}},
          {"name": "dont_fail", "weight": 30, "parameters": {"user_input": "10"}}
        ]
      },
      {
        "subject_name": "Code Quality",
        "weight": 30,
        "tests": [
          {"name": "forbidden_import", "weight": 100, "parameters": {"forbidden_imports": ["os.system"]}}
        ]
      }
    ]
  }
}

Common mistakes¶

Setting arbitrary weights without tying them to learning objectives
Overusing deeply nested subjects that are hard for students to interpret
Mixing "must-have" and "nice-to-have" requirements in the same weighted group