December 4, 2019
Ans 1) The top 4 reasons are - Another Position, Unhappy, More Money, an Career Change Ans 2) 16 Ans 3) 11 Ans 4) Most expensive recruitment channels are - Career Builder, Pay Per Click, MBTA Ads and On-Campus Recruiting Ans 5) Retention rate of employee referral would be 82.76 % keeping future joinees out of the calculation for "Employee Referral" category. Ans 6) For linear regression mode, I took Pay rate (dependent Variable) because this is the rate of money attained by an employee by keeping position, manager name, employee source, performance score and department as independent variables. This model explains that the amount of money mostly depends on the position, department and manager as well, which explains the general rule of salary earned also. Ans 7) For logistic regression model dependent variable - employment status (categorical nature) explaining the status of employment. Independent variable - payrate, department, position, manager name and performance score. These variables affect the status for any employee by a large degree. Ans 8) For CART, Reason for Termination is taken additionally. So as to the variables taken into consideration for logistic regression as well in order to predict the results more accurately. Ans 9) The larger the lift ratio, the more significant is the correlation between the two variables. It also means that the two variables are independent and have nothing in common, still they show a high degree of association. In the first case, performance score=fully meets and terminated for a cause have high correlation. A lift ratio larger than 1 implies that the relationship between the Marital description= divorced and employee status=voluntarily terminated is more significant than would be expected if the two sets were independent. Also, the count of 3 shows, the three case where a person was divorced has voluntarily left the organisation. If the employee is able to meet performance score in 90 days, the employee is voluntarily terminated. A lift ratio larger than 1 implies that the relationship between the Performance score= 90 days meets and employee status=voluntarily terminated is more relevant than would be expected if the two items were independent. The count of 3 shows, the three transactions where a person has met his performance score and has voluntarily left the organisation.