Add files using upload-large-folder tool

6ba2f9b verified 4 months ago

2.06 kB

library_name: stable-baselines3
tags:
  - reinforcement-learning
  - stable-baselines3
  - deep-reinforcement-learning
  - fluidgym
  - active-flow-control
  - fluid-dynamics
  - simulation
  - RBC2D-easy-v0
model-index:
  - name: SAC-RBC2D-easy-v0
    results:
      - task:
          type: reinforcement-learning
          name: reinforcement-learning
        dataset:
          name: FluidGym-RBC2D-easy-v0
          type: fluidgym
        metrics:
          - type: mean_reward
            value: 0.65
            name: mean_reward

SAC on RBC2D-easy-v0 (FluidGym)

This repository is part of the FluidGym benchmark results. It contains trained Stable Baselines3 agents for the specialized RBC2D-easy-v0 environment.

Evaluation Results

Global Performance (Aggregated across 5 seeds)

Mean Reward: 0.65 ± 0.09

Per-Seed Statistics

Run	Mean Reward	Std Dev
Seed 0	0.63	0.19
Seed 1	0.70	0.24
Seed 2	0.77	0.25
Seed 3	0.66	0.28
Seed 4	0.49	0.15

About FluidGym

FluidGym is a benchmark for reinforcement learning in active flow control.

Usage

Each seed is contained in its own subdirectory. You can load a model using:

from stable_baselines3 import SAC
model = SAC.load("0/ckpt_latest.zip")

Important: The models were trained using fluidgym==0.0.2. In order to use them with newer versions of FluidGym, you need to wrap the environment with a FlattenObservation wrapper as shown below:

import fluidgym
from fluidgym.wrappers import FlattenObservation
from stable_baselines3 import SAC

env = fluidgym.make("RBC2D-easy-v0")
env = FlattenObservation(env)
model = SAC.load("path_to_model/ckpt_latest.zip")

obs, info = env.reset(seed=42)

action, _ = model.predict(obs, deterministic=True)
obs, reward, terminated, truncated, info = env.step(action)

safe-autonomous-systems
/

ma-sac-RBC2D-easy-v0