This sample demonstrates how to use multiclass classifiers and feature hashing in Azure ML Studio to classify news into different categories.
We used the 2004 Reuters news dataset. The training set has about 10,000 news examples, and the test set is 50% of this data. The original dataset has 103 categories that are organized into four hierarchies: ![enter image description here] - Corporate-Industrial (CCAT) - Government and Social (GCAT) - Economics and Economic Indicators (ECAT) - Securities and Commodities Trading and Market (MCAT) For this experiment, we used the names of the hierarchies as the label, or attribute to predict. Thus we were solving a multiclass classification problem with four classes. The original news articles might belong to one or more hierarchies. For those articles, a separate example was created for each combination of label and article, so that the articles had the same features but different label. For instance, if an article belonged to CCAT and GCAT, two examples would exist in the label data set, one for CCAT, and the other one for GCAT. ![enter image description here] <br><br> ---------- > This ML experiment is for [Microsoft Azure Machine Learning Course].<br> For the complete experiment list [Click here].<br> Laploy | email@example.com | 084 007 5544 | [www.laploy.com]<br> ![enter image description here] ---------- : https://notebooks.azure.com/laploy/libraries/loyml/html/00001%20Sessions%20summary.ipynb : https://gallery.cortanaintelligence.com/Home/Author?authorId=81E333F747E3429B55A3445E6714C36F60B397C13B4D0B07F34DEF1421F64D73 : http://laploy.com : https://raw.githubusercontent.com/laploy/mli/master//loy-small.jpg : https://raw.githubusercontent.com/laploy/mli/master//12530-000.jpg : https://raw.githubusercontent.com/laploy/mli/master//12530-001.jpg : https://gallery.cortanaintelligence.com/browse?s=laploy