Installing and Using Anaconda for Data Science - Beginner's Guide

Installing and Using Anaconda for Data Science - Beginner's Guide


Data Science is an intriguing field that leverages scientific methods, processes, algorithms, and system knowledge to extract insights from structured and unstructured data. Crucial to every step of this analysis, and making the lives of data analysts, scientists, and machine learning engineers easier, is Anaconda, one of the most preferred tools in the data science domain.

Anaconda is an open-source distribution of Python and R programming languages used extensively for scientific computing (data science, machine learning applications, large-scale data processing, and predictive analytics). In this tutorial, we will delve into how to install and use Anaconda effectively for your data science endeavors.

Installation of Anaconda


Before installing Anaconda, you need to verify whether your computer has an existing install of Python. Although Anaconda comes with its version of Python, it's best to ensure compatibility. A clear installation process is essential to avoid any software conflicts or errors in the future.

Here are the steps to install Anaconda in specific Operating Systems:

For Windows:


1. Go to https://www.anaconda.com/distribution/

2. Download the Python 3.7 version (as it’s the latest stable release).

3. Run the installer, and during installation, check the option "Add Anaconda to my PATH environment variable".

4. Once installed, you will find Anaconda Navigator in your apps, which is a GUI to manage Anaconda.

For macOS:


1. Go to https://www.anaconda.com/distribution/

2. Download the Python 3.7 version.

3. Open the terminal, install the package with bash by running the following command on terminal:

bash ~/Downloads/Anaconda3-2018.12-MacOSX-x86_64.sh

4. Follow the prompts on the installer screens.

For Linux:


1. Go to https://www.anaconda.com/distribution/.

2. Download the Python 3.7 version.

3. Open terminal, navigate to the directory with the .sh file and run the bash install:

bash Anaconda3-2018.12-Linux-x86_64.sh

4. Follow the prompts on the installer screens.

After installation, let confirm the installation by checking the Anaconda version:

conda --version

Once you see the Anaconda version, that means Anaconda is successfully installed.

Using Anaconda


Once Anaconda is successfully installed, you use Anaconda Navigator, an easy, point-and-click way to work with packages and environments without needing to key in command lines.

You can create an environment in Anaconda using the following command:

conda create -n env_name python=X.X anaconda

Replace 'env_name' with the name for your environment, and 'X.X' with the Python version you wish to use.

To verify the environment creation, use the following command:

conda info --envs

To activate your environment, use:

source activate env_name

h2>Conclusion

With Anaconda, you have a robust system ready for your data science and machine learning projects. This beginner’s guide attempts to give you a basic understanding and hands-on experience of how to install and use Anaconda. Happy exploring in the world of data science!