Skip to content

MLSysOps/MLE-agent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

739 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

MLE-Agent: Your intelligent companion for seamless AI engineering and research.

kaia-llama MLSysOps%2FMLE-agent | Trendshift

๐Ÿ’Œ Fathers' love for Kaia ๐Ÿ’Œ

PyPI - Version Downloads GitHub License Join our Discord community

๐Ÿ“š Docs | ๐Ÿž Report Issues | ๐Ÿ‘‹ Join us on Discord

Overview

MLE-Agent is designed as a pairing LLM agent for machine learning engineers and researchers. It is featured by:

  • ๐Ÿค– Autonomous Baseline: Automatically builds ML/AI baselines and solutions based on your requirements.
  • ๐Ÿ…End-to-end ML Task: Participates in Kaggle competitions and completes tasks independently.
  • ๐Ÿ” Arxiv and Papers with Code Integration: Access best practices and state-of-the-art methods.
  • ๐Ÿ› Smart Debugging: Ensures high-quality code through automatic debugger-coder interactions.
  • ๐Ÿ“‚ File System Integration: Organizes your project structure efficiently.
  • ๐Ÿงฐ Comprehensive Tools Integration: Includes AI/ML functions and MLOps tools for a seamless workflow.
  • โ˜• Interactive CLI Chat: Enhances your projects with an easy-to-use chat interface.
  • ๐Ÿง  Smart Advisor: Provides personalized suggestions and recommendations for your ML/AI project.
  • ๐Ÿ“Š Weekly Report: Automatically generates detailed summaries of your weekly works.
Workflow Agent
Screenshot 2025-06-18 at 2 54 55 PM Screenshot 2025-06-18 at 2 55 04 PM

Video Demo

mle_v030.mp4

Milestones

  • ๐Ÿš€ 09/24/2024: Release the 0.4.2 with enhanced Auto-Kaggle mode to complete an end-to-end competition with minimal effort.
  • ๐Ÿš€ 09/10/2024: Release the 0.4.0 with new CLIs like MLE report, MLE kaggle, MLE integration and many new models like Mistral.
  • ๐Ÿš€ 07/25/2024: Release the 0.3.0 with huge refactoring, many integrations, etc. (v0.3.0)
  • ๐Ÿš€ 07/11/2024: Release the 0.2.0 with multiple agents interaction (v0.2.0)
  • ๐Ÿ‘จโ€๐Ÿผ 07/03/2024: Kaia is born
  • ๐Ÿš€ 06/01/2024: Release the first rule-based version of MLE agent (v0.1.0)

Get started

Installation

From PyPI

# With pip:
pip install -U mle-agent

# With uv:
uv pip install -U mle-agent

From source

  1. Clone the repo
    git clone https://github.com/MLSysOps/MLE-agent.git
    cd MLE-agent
  2. Create & activate a virtual env
    uv venv .venv
    source .venv/bin/activate      # Linux/macOS
  3. Editable install
    pip install -e .               # or: pip install -e .

Usage

mle new <project name>

And a project directory will be created under the current path, you need to start the project under the project directory.

cd <project name>
mle start

You can also start an interactive chat in the terminal under the project directory:

mle chat

Use cases

๐Ÿงช Prototype an ML Baseline

MLE agent can help you prototype an ML baseline with the given requirements, and test the model on the local machine. The requirements can be vague, such as "I want to predict the stock price based on the historical data".

cd <project name>
mle start

๐Ÿ“Š Generate Work Report

MLE agent can help you summarize your weekly report, including development progress, communication notes, reference, and to-do lists.

Mode 1: Web Application to Generate Report from GitHub

cd <project name>
mle report

Then, you can visit http://localhost:3000/ to generate your report locally.

Mode 2: CLI Tool to Generate Report from Local Git Repository

cd <project name>
mle report-local --email=<git email> --start-date=YYYY-MM-DD --end-date=YYYY-MM-DD <path_to_git_repo>
  • --start-date and --end-date are optional parameters. If omitted, the command will generate a report for the default date range of the last 7 days.
  • Replace <git email> with your Git email and <path_to_git_repo> with the path to your local Git repository.

๐Ÿ† Start with Kaggle Competition

MLE agent can participate in Kaggle competitions and finish coding and debugging from data preparation to model training independently. Here is the basic command to start a Kaggle competition:

cd <project name>
mle kaggle

Or you can let the agents finish the Kaggle task without human interaction if you have the dataset and submission file ready:

cd <project name>
mle kaggle --auto \
--datasets "<path_to_dataset1>,<path_to_dataset2>,..." \
--description "<description_file_path_or_text>" \
--submission "<submission_file_path>" \
--sub_example "<submission_example_file_path>" \ 
--comp_id "<competition_id>"

Please make sure you have joined the competition before running the command. For more details, see the MLE-Agent Tutorials.

Roadmap

The following is a list of the tasks we plan to do, welcome to propose something new!

๐Ÿ”จ General Features
  • Understand users' requirements to create an end-to-end AI project
  • Suggest the SOTA data science solutions by using the web search
  • Plan the ML engineering tasks with human interaction
  • Execute the code on the local machine/cloud, debug and fix the errors
  • Leverage the built-in functions to complete ML engineering tasks
  • Interactive chat: A human-in-the-loop mode to help improve the existing ML projects
  • Kaggle mode: to finish a Kaggle task without humans
  • Summary and reflect the whole ML/AI pipeline
  • Integration with Cloud data and testing and debugging platforms
  • Local RAG support to make personal ML/AI coding assistant
  • Function zoo: generate AI/ML functions and save them for future usage
โญ More LLMs and Serving Tools
  • Ollama LLama3
  • OpenAI GPTs
  • Anthropic Claude 3.5 Sonnet
๐Ÿ’– Better user experience
  • CLI Application
  • Web UI
  • Discord
๐Ÿงฉ Functions and Integrations
  • Local file system
  • Local code exectutor
  • Arxiv.org search
  • Papers with Code search
  • General keyword search
  • Hugging Face
  • SkyPilot cloud deployment
  • Snowflake data
  • AWS S3 data
  • Databricks data catalog
  • Wandb experiment monitoring
  • MLflow management
  • DBT data transform

Contributing

We welcome contributions from the community. We are looking for contributors to help us with the following tasks:

  • Benchmark and Evaluate the agent
  • Add more features to the agent
  • Improve the documentation
  • Write tests

Please check the CONTRIBUTING.md file if you want to contribute.

Support and Community

Citation

@misc{zhang2024mleagent,
title = {MLE-Agent: Your Intelligent Companion for Seamless AI Engineering and Research},
author = {Huaizheng Zhang*, Yizheng Huang*, Lei Zhang},
year = {2024},
note = {\url{https://github.com/MLSysOps/MLE-agent}},
}

License

Check MIT License file for more information.

About

๐Ÿค– MLE-Agent: Your intelligent companion for seamless AI engineering and research. ๐Ÿ” Integrate with arxiv and paper with code to provide better code/research plans ๐Ÿงฐ OpenAI, Anthropic, Gemini, Ollama, etc supported. ๐ŸŽ† Code RAG

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Packages

 
 
 

Contributors