LlamaGym

LlamaGym simplifies the fine-tuning of LLM agents with online reinforcement learning, providing a framework to iterate and experiment across Gym envir

Author: community

Open source post

What was done

LlamaGym simplifies the fine-tuning of LLM agents with online reinforcement learning, providing a framework to iterate and experiment across Gym environments for efficient agent prompting and hyperparameter tuning

Stack

Llama

Similar use cases

Pieces0 votes

Continue0 votes

Cody0 votes

Jan0 votes

LlamaGym

What was done

Stack

Share

Similar use cases