Diamond dataset
The objective of the overall study was to identify the factors associating with the price of a round shaped diamond and to predict the price
The following is an analysis of information on 35000 round shape diamonds collected under 10 aspects, namely weight, the type of the cut, color, the clarity, the table value, length, width, depth, total depth percentage and finally the price. Along with an analysis of variables, several models have been developed to predict the price with a minimum error and the best of the said models has been recommended.
The following is a brief description of variables.
Continuous Variables
• Price –The price of the diamond in US dollars ($326–$18,823)
• Carat- The weight of the diamond where 1carat=200mg.
• x- Length in mm (0–10.74)
• y- Width in mm (0–58.9)
• z- Depth in mm (0–31.8)
• Depth-Total depth percentage
• Table- Width of top of diamond relative to widest point (43–95)
Categorical Variables
• Cut- The quality of the cut (Fair < Good< Very Good< Premium< Ideal)
• Color- Color of the diamond [D (best), E, F, G, H, I, J (worst)]
• Clarity- Measures how clear the diamond is (I1 (worst), SI1, SI2, VS1, VS2, VVS1, VVS2, IF (best))