Seismic Activity Predictor

February 15, 2015
This experiment predicts whether a high energy seismic event is likely to occur. It is based on UCI seismic-bumps data.
This experiment uses the [UCI Seismic bumps dataset](http://archive.ics.uci.edu/ml/datasets/seismic-bumps) to forecast whether a high energy (higher that 10^4 J) is likely to occur on a coal mine. Data ----- These are the columns of our data. 1. *MineHazardAssessmentClass*: result of shift seismic hazard assessment in the mine working obtained by the seismic method (a - lack of hazard, b - low hazard, c - high hazard, d - danger state); 2. *SeismoAccousticAssessmentClass*: result of shift seismic hazard assessment in the mine working obtained by the seismoacoustic method; 3. *ShiftType*: information about type of a shift (W - coal-getting, N -preparation shift); 4. *GeophoneEnergy*: seismic energy recorded within previous shift by the most active geophone (GMax) out of geophones monitoring the longwall; 5. *GPuls*: a number of pulses recorded within previous shift by GMax; 6. *EnergyDeviation*: a deviation of energy recorded within previous shift by GMax from average energy recorded during eight previous shifts; 7. *PulseDeviation*: a deviation of a number of pulses recorded within previous shift by GMax from average number of pulses recorded during eight previous shifts; 8. *SeismoAccousticAssessment*: result of shift seismic hazard assessment in the mine working obtained by the seismoacoustic method based on registration coming form GMax only; 9. *NBumps*: the number of seismic bumps recorded within previous shift; 10. *NBumps2*: the number of seismic bumps (in energy range [10^2,10^3)) registered within previous shift; 11. *NBumps3*: the number of seismic bumps (in energy range [10^3,10^4)) registered within previous shift; 12. *NBumps4*: the number of seismic bumps (in energy range [10^4,10^5)) registered within previous shift; 13. *NBumps5*: the number of seismic bumps (in energy range [10^5,10^6)) registered within the last shift; 14. *NBumps6*: the number of seismic bumps (in energy range [10^6,10^7)) registered within previous shift; 15. *NBumps7*: the number of seismic bumps (in energy range [10^7,10^8)) registered within previous shift; 16. *NBumps89*: the number of seismic bumps (in energy range [10^8,10^10)) registered within previous shift; 17. *SeismicEnergyPrevious*: total energy of seismic bumps registered within previous shift; 18. *MaxEnergy*: the maximum energy of the seismic bumps registered within previous shift; 19. *HazardousState*: (This is the label we will train on and predict) - '1' means that high energy seismic bump occurred in the next shift ('hazardous state'), '0' means that no high energy seismic bumps occurred in the next shift ('non-hazardous state'). Training -------- Given we did not have a lot of data, we used the sweeping parameters module, and did not pass testing data. This means that all data was used for testing by using cross-validation.