Our process machine learning process will follow these steps:
1- Import Data
2- Clean the Data
3- Split the Data into Training/Test Sets
4- Create a Model
5- Train the Model
6- Make Predictions
7- Evaluate and Improve
When we train a model, we give it two separate data sets: the input set and the output set. Output set, contains the predictions. So, we train our model.
The CSV file used in this tutorial: <a href="https://bit.ly/3muqqta" class="autolinkedURL autolinkedURL-url" target="_blank">bit.ly/3muqqta</a>
In this tutorial, we have elements of age, gender and music genre. So, we will eventually try to make predictions according to our data.
<pre><code>import pandas as pd
music=pd.read_csv('music.csv')
music
</code></pre>Now, we should use the "drop" method to prepare our data.
It works like this:
<pre><code>import pandas as pd
music=pd.read_csv('music.csv')
X = music.drop(columns=['genre'])
X
</code></pre><img src="https://cdn.hashnode.com/res/hashnode/image/upload/v1652719482012/Pr6GGqCOu.png" alt="data set me.png" />

Our process machine learning process will follow these steps:

1- Import Data
2- Clean the Data
3- Split the Data into Training/Test Sets
4- Create a Model
5- Train the Model
6- Make Predictions
7- Evaluate and Improve


When we train a model, we give it two separate data sets: the input set and the output set. Output set, contains the predictions. So, we train our model.

The CSV file used in this tutorial: https://bit.ly/3muqqta

In this tutorial, we have elements of age, gender and music genre. So, we will eventually try to make predictions according to our data.

```
import pandas as pd
music=pd.read_csv('music.csv')
music

```

Now, we should use the "drop" method to prepare our data.
It works like this:

```
import pandas as pd
music=pd.read_csv('music.csv')
X = music.drop(columns=['genre'])
X

```


![data set me.png](https://cdn.hashnode.com/res/hashnode/image/upload/v1652719482012/Pr6GGqCOu.png align="left")


Preparing the Data for Machine Learning Problems

I am writing about Python, machine learning, robotics and data science