Orange Blog

Categories:
 orange3
 visualization
 analysis
 addons
 workshop
 education
 classification
 data
 examples
 gsoc
By: AJDA, Nov 30, 2016
Data Mining for Political Scientists
Being a political scientist, I did not even hear about data mining before I’ve joined Biolab. And naturally, as with all good things, data mining started to grow on me. Give me some data, connect a bunch of widgets and see the magic happen! But hold on! There are still many social scientists out there who haven’t yet heard about the wonderful world of data mining, text mining and machine learning.
By: AJDA, Nov 25, 2016
Celebrity Lookalike or How to Make Students Love Machine Learning
Recently we’ve been participating at Days of Computer Science, organized by the Museum of Post and Telecommunications and the Faculty of Computer and Information Science, University of Ljubljana, Slovenia. The project brought together pupils and students from around the country and hopefully showed them what computer science is mostly about. Most children would think programming is just typing lines of code. But it’s more than that. It’s a way of thinking, a way to solve problems creatively and efficiently.
By: BLAZ, Nov 2, 2016
Orange at Eurostat's Big Data Workshop
A Eurostat’s Big Data Workshop recently took place in Ljubljana. In a presentation we have showcased Orange as a tool to teach data science. The meeting was organised by Statistical Office of Slovenia and by Eurostat, a Statistical Office of the European Union, and was a primary gathering of representatives from national statistical institutes joined within European Statistical System. The meeting discussed possibilities that big data offers to modern statistics and the role it could play in statistical offices around the world.
By: AJDA, Oct 17, 2016
10 Tips and Tricks for Using Orange
TIP #1: Follow tutorials and example workflows to get started. It’s difficult to start using new software. Where does one start, especially a total novice in data mining? For this exact reason we’ve prepared Getting Started With Orange  YouTube tutorials for complete beginners. Example workflows on the other hand can be accessed via Help  Examples. TIP #2: Make use of Orange documentation. You can access it in three ways:
By: BLAZ, Oct 2, 2016
Intro to Data Mining for Life Scientists
RNA Club Munich has organized Molecular Life of Stem Cells Conference in Ljubljana this past Thursday, Friday and Saturday. They asked us to organize a fourhour workshop on data mining. And here we were: four of us, Ajda, Anze, Marko and myself (Blaz) run a workshop for 25 students with molecular biology and biochemistry background. We have covered some basic data visualization, modeling (classification) and model scoring, hierarchical clustering and data projection, and finished with a touch of deeplearning by diving into image analysis by deep learningbased embedding.
By: PRIMOZGODEC, Aug 25, 2016
Visualizing Gradient Descent
This is a guest blog from the Google Summer of Code project. Gradient Descent was implemented as a part of my Google Summer of Code project and it is available in the Orange3Educational addon. It simulates gradient descent for either Logistic or Linear regression, depending on the type of the input data. Gradient descent is iterative approach to optimize model parameters that minimize the cost function. In machine learning, the cost function corresponds to prediction error when the model is used on the training data set.
By: PRIMOZGODEC, Aug 12, 2016
Interactive kMeans
This is a guest blog from the Google Summer of Code project. As a part of my Google Summer of Code project I started developing educational widgets and assemble them in an Educational AddOn for Orange. Educational widgets can be used by students to understand how some key data mining algorithms work and by teachers to demonstrate the working of these algorithms. Here I describe an educational widget for interactive kmeans clustering, an algorithm that splits the data into clusters by finding cluster centroids such that the distance between data points and their corresponding centroid is minimized.
By: BLAZ, Mar 12, 2016
Overfitting and Regularization
A week ago I used Orange to explain the effects of regularization. This was the second lecture in the Data Mining class, the first one was on linear regression. My introduction to the benefits of regularization used a simple data set with a single input attribute and a continuous class. I drew a data set in Orange, and then used Polynomial Regression widget (from Prototypes addon) to plot the linear fit.
By: AJDA, Dec 2, 2015
Hierarchical Clustering: A Simple Explanation
One of the key techniques of exploratory data mining is clustering – separating instances into distinct groups based on some measure of similarity. We can estimate the similarity between two data instances through euclidean (pythagorean), manhattan (sum of absolute differences between coordinates) and mahalanobis distance (distance from the mean by standard deviation), or, say, through Pearson correlation or Spearman correlation. Our main goal when clustering data is to get groups of data instances where:
By: BLAZ, Oct 9, 2015
Data Mining Course in Houston
We have just completed an Introduction to Data Mining, a graduate course at Baylor College of Medicine in Texas, Houston. The course was given in September and consisted of seven twohour lectures, each one followed with a homework assignment. The course was attended by about 40 students and some faculty and research staff. This was a challenging course. The audience was new to data mining, and we decided to teach them with the newest, third version of Orange.