Qwen3-0.6B โ€” Fine-tuned with GRPO on Agent Safety

Trained using OpenEnv + TRL GRPO on the Agent Safety environment as part of the Meta PyTorch OpenEnv Hackathon.

Training Details

  • Environment: agent_safety_env
  • Algorithm: GRPO (Group Relative Policy Optimization)
  • Episodes: 16
  • Reward: Partial credit per safety criterion

Environment

  • HF Space: https://huggingface.co/spaces/amulyalakku/agent-safety-env
Downloads last month
39
Safetensors
Model size
0.6B params
Tensor type
F32
ยท
Video Preview
loading