TPOT is built on top of several existing Python libraries, including:

Most of the necessary Python packages can be installed via the Anaconda Python distribution, which we strongly recommend that you use. We also strongly recommend that you use of Python 3 over Python 2 if you're given the choice.

NumPy, SciPy, scikit-learn, and pandas can be installed in Anaconda via the command:

conda install numpy scipy scikit-learn pandas

DEAP, update_checker, tqdm and stopit can be installed with pip via the command:

pip install deap update_checker tqdm stopit

For the Windows users, the pywin32 module is required if Python is NOT installed via the Anaconda Python distribution and can be installed with pip for Python verion <=3.3 or conda (e.g. miniconda) for any Python version:

conda install pywin32

Optionally, you can install XGBoost if you would like TPOT to use the eXtreme Gradient Boosting models. XGBoost is entirely optional, and TPOT will still function normally without XGBoost if you do not have it installed. Windows users: pip installation may not work on some Windows environments, and it may cause unexpected errors.

pip install xgboost

If you have issues installing XGBoost, check the XGBoost installation documentation.

If you plan to use Dask for parallel training, make sure to install dask[delay] and dask_ml.

pip install dask[delayed] dask-ml

If you plan to use the TPOT-MDR configuration, make sure to install scikit-mdr and scikit-rebate:

pip install scikit-mdr skrebate

Finally to install TPOT itself, run the following command:

pip install tpot

Please file a new issue if you run into installation problems.