F1 Strategy RL Environment

Python, RL environments, deterministic verification, evaluation design

F1 Strategy RL Environment is an evaluation and training environment for race-strategy reasoning. It uses scenario-based tasks and deterministic verification to test whether an agent can make coherent strategic decisions under constrained conditions.

What it includes

  • OpenF1-derived scenarios for race strategy decisions
  • Deterministic verifiers and tool-use rubrics for grading outputs
  • Baseline comparisons with confidence intervals, ablations, and stress tests
  • Support for deeper reasoning modes and hosted RL training workflows

Project goal

The environment is designed to make strategy evaluation concrete. Rather than relying on subjective judgments, it turns domain-specific reasoning into tasks with explicit rules, testable outputs, and comparable baselines.

Code