freshcrate

GEA

Group Evolving Agents: Open-Ended Self-Improvement via Experience Sharing

Description

Group Evolving Agents: Open-Ended Self-Improvement via Experience Sharing

README

Group Evolving Agents:
Open-Ended Self-Improvement via Experience Sharing

Repository for Group Evolving Agents (GEA), a new paradigm for open-ended self-improvements, which treats a group of agents as the fundamental evolutionary unit, enabling explicit experience sharing and reuse within the group throughout evolution.

Setup

# API keys, add to ~/.bashrc
export OPENAI_API_KEY='...'
export ANTHROPIC_API_KEY='...'
# Verify that Docker is properly configured in your environment.
docker run hello-world
 
# If a permission error occurs, add the user to the Docker group
sudo usermod -aG docker $USER
newgrp docker
# Install dependencies
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

# Optional: for running analysis
sudo apt-get install graphviz graphviz-dev
pip install -r requirements_dev.txt
# Clone SWE-bench
cd swe_bench
git clone https://github.com/princeton-nlp/SWE-bench.git
cd SWE-bench
git checkout dc4c087c2b9e4cefebf2e3d201d27e36
pip install -e .
cd ../../

# Prepare Polyglot
# Make sure git is properly configured in your environment with username and email
python -m polyglot.prepare_polyglot_dataset

Running the GEA

python GEA_outer.py

File Structure

  • analysis/ scripts used for plotting and analysis
  • initial/ SWE-bench logs and performance of the initial agent
  • initial_polyglot/ Polyglot logs and performance of the initial agent
  • swe_bench/ code needed for SWE-bench evaluation
  • polyglot/ code needed for Polyglot evaluation
  • prompts/ prompts used for foundation models
  • tools/ tools available to the foundation models
  • coding_agent.py main implementation of the initial coding agent
  • GEA_outer.py entry point for running the GEA algorithm

Acknowledgement

This codebase is built upon the Darwin GΓΆdel Machine (DGM). We sincerely thank the authors for their inspiring and impactful work.

The evaluation framework implementations are based on the SWE-bench and polyglot-benchmark repositories.

Citing

If you find this project useful, please consider citing:

@article{weng2026group,
  title={Group-Evolving Agents: Open-Ended Self-Improvement via Experience Sharing},
  author={Weng, Zhaotian and Antoniades, Antonis and Nathani, Deepak and Zhang, Zhen and Pu, Xiao and Wang, Xin Eric},
  journal={arXiv preprint arXiv:2602.04837},
  year={2026}
}

Release History

VersionChangesUrgencyDate
0.0.0No release found β€” using repo HEADHigh4/7/2026
main@2026-04-07Latest activity on main branchHigh4/7/2026
main@2026-04-07Latest activity on main branchMedium4/7/2026
main@2026-04-07Latest activity on main branchMedium4/7/2026
main@2026-04-07Latest activity on main branchMedium4/7/2026
main@2026-04-07Latest activity on main branchMedium4/7/2026
main@2026-04-07Latest activity on main branchMedium4/7/2026
main@2026-04-07Latest activity on main branchMedium4/7/2026
main@2026-04-07Latest activity on main branchMedium4/7/2026
main@2026-04-07Latest activity on main branchMedium4/7/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

ai-dataset-generatorπŸ€– Generate tailored AI training datasets quickly and easily, transforming your domain knowledge into essential training data for model fine-tuning.main@2026-04-21
dopEffectCSharpπŸš€ Maximize your C# productivity with advanced techniques in strings, LINQ, and clean code, inspired by the book "Produtivo com C#."master@2026-04-21
antonMost advanced AI coworkerv2.0.2
geon-decoderGEON: Structure-first decoding via equivalence classes and field closuremain@2026-04-11
cyber-pilotCyber Pilot is a traceable delivery system for requirements, design, plans, and code.v3.7.0-beta