Qwen3-0.6B โ Fine-tuned with GRPO on Agent Safety
Trained using OpenEnv + TRL GRPO on the Agent Safety environment as part of the Meta PyTorch OpenEnv Hackathon.
Training Details
- Environment:
agent_safety_env - Algorithm: GRPO (Group Relative Policy Optimization)
- Episodes: 16
- Reward: Partial credit per safety criterion
Environment
- HF Space:
https://huggingface.co/spaces/amulyalakku/agent-safety-env
- Downloads last month
- 39