Introduction of Orange Tools

Dhruvipatel
4 min readSep 14, 2021

--

This blog is all about introduction of Orange Tool . We can do a lot of stuff with the help of the orange tool like visual programming, data visualization, data exploration, data mining and many more . The orange tool is free and open-source and you can install it very easily on any operating system.

Orange is an open-source data visualization, machine learning and data mining toolkit. It features a visual programming front-end for explorative data analysis and interactive data visualization, and can also be used as a Python library . Orange quickly formats the data in a pattern that you can move with widgets.

For download click here .

Let’s start with Orange Tool ,

Home screen of Orange tool

Here you can see the canvas of orange where you will do all your data exploration. On the left-hand side, there is a total of 5 sections and that all 5 sections contain different-different widgets like data , Visualize , Model , Evaluate and Unsupervised which we will use in the future for data exploration and evaluation of different model .

Orange consists of a canvas interface onto which the user places widgets and creates a data analysis workflow. Widgets offer basic functionalities such as reading the data, showing a data table, selecting features, training predictors, comparing learning algorithms, visualizing data elements, etc.

For this tutorial we use California Housing Prices dataset .

Step-1

We have to import our dataset . Orange can load data from any of the major standard spreadsheet file types, like CSV and Excel .

In our case we have to give file location of our dataset .

Load Dataset

For basic information like number of rows and column , number of numeric and categorical column and many information we use function data info .

Data Info
Data Table

Step-2

We have to visualize our dataset .

Scatter Plot
Scatter plot
Distribution Plot

Step-3

Here for cleaning purpose, we will impute missing values.

Imputation is a very important step in understanding and making the best use of our data.

Here, I have selected the default method to be Average for numerical values and Most Frequent for text based values (categorical).

Fill missing values

Step-4

Create a machine learning model using Orange Tool . For our dataset we ae using simple logistic regression model .

Apply Logistic Regression Model

After applying model , we have to evaluate our model . For evaluating model we use test and score method .

step-5

Test & Score

And last we created confusion matrix .

Confusion matrix

Other Widgets Information here.

Github Link

https://github.com/dhruvi410/-Introduction-to-Orange-Tools

Conclusion

I hope now you can work by yourself in the orange tool. I tried to cover as many things as I can. Now you can explore more by yourself.

Do check out more features of the Orange tool here.

--

--