J. Alcala-fdez, L. Sanchez, S. Garcia, M. J. Del-jesus, S. Ventura et al., KEEL: a software tool to assess evolutionary algorithms for data mining problems, Soft Computing, vol.13, issue.3, pp.307-318, 2009.

M. Antonelli, D. Bernardo, H. Hagras, and F. Marcelloni, Multiobjective Evolutionary Optimization of Type-2 Fuzzy Rule-Based Systems for Financial Data Classification, IEEE Transactions on Fuzzy Systems, vol.25, issue.2, pp.249-264, 2017.

N. Attoh-okine, Big data challenges in railway engineering, 2014 IEEE International Conference on, pp.7-9, 2014.

S. Barua, M. M. Islam, X. Yao, and K. Murase, MWMOTE--majority weighted minority oversampling technique for imbalanced data set learning, IEEE Transactions on Knowledge and Data Engineering, vol.26, issue.2, pp.405-425, 2014.

G. E. Batista, C. Ronaldo, M. C. Prati, and . Monard, A study of the behavior of several methods for balancing machine learning training data, ACM Sigkdd Explorations Newsletter, vol.6, issue.1, pp.20-29, 2004.

R. Batuwita and V. Palade, FSVM-CIL: fuzzy support vector machines for class imbalance learning, IEEE Transactions on Fuzzy Systems, vol.18, issue.3, pp.558-571, 2010.

C. Bezerra and . Gomes, An evolving approach to unsupervised and Real-Time fault detection in industrial processes, Expert Systems with Applications, vol.63, pp.134-144, 2016.

N. V. Chawla, W. Kevin, L. O. Bowyer, W. P. Hall, and . Kegelmeyer, SMOTE: synthetic minority over-sampling technique, Journal of artificial intelligence research, vol.16, pp.321-357, 2002.

A. Chauhan, D. Chauhan, and C. Rout, Role of gist and phog features in computer-aided diagnosis of tuberculosis without segmentation, PloS one, vol.9, issue.11, p.112980, 2014.

N. V. Chawla, A. Lazarevic, L. O. Hall, and K. W. Bowyer, SMOTEBoost: Improving prediction of the minority class in boosting, European Conference on Principles of Data Mining and Knowledge Discovery, pp.107-119, 2003.

S. Choi and P. Rockett, The training of neural classifiers with condensed datasets, IEEE Transactions on Systems, Man, and Cybernetics, vol.32, issue.2, pp.202-206, 2002.

B. S. Costa and . Jales, Unsupervised classification of data streams based on Typicality and Eccentricity Data Analytics, IEEE International Conference on Fuzzy Systems IEEE, 2016.

C. Elkan, The foundations of cost-sensitive learning, International joint conference on artificial intelligence, vol.17, pp.973-978, 2001.

Q. Fan, Z. Wang, D. Li, D. Gao, and H. Zha, Entropy-based fuzzy support vector machine for imbalanced datasets, Knowledge-Based Systems, vol.115, pp.87-99, 2017.

E. Fehr and S. Gächter, Altruistic punishment in humans, Nature, vol.415, issue.6868, pp.137-140, 2002.

S. Fine and K. Scheinberg, Efficient SVM training using low-rank kernel representations, Journal of Machine Learning Research, vol.2, pp.243-264, 2001.

M. Gao, X. Hong, and C. J. Harris, Construction of neurofuzzy models for imbalanced data classification, IEEE Transactions on Fuzzy Systems, vol.22, issue.6, pp.1472-1488, 2014.

. M-a-n-u-s-c-r-i-p-t,

J. Guzinski, H. Abu-rub, M. Diguet, Z. Krzeminski, and A. Lewicki, Speed and load torque observer application in high-speed train electric drive, IEEE Transactions on Industrial Electronics, vol.57, issue.2, pp.565-574, 2010.

H. Hu, . Tang, X. Bo, W. Gong, H. Wei et al., Intelligent fault diagnosis of the high-speed train with big data based on deep neural networks, IEEE Transactions on Industrial Informatics, 2017.

J. Huang and C. X. Ling, Using AUC and accuracy in evaluating learning algorithms, IEEE Transactions on knowledge and Data Engineering, vol.17, issue.3, pp.299-310, 2005.

. Huang, G. Su-dan, Z. Cao, J. F. He, J. Pan et al., Nonlinear modeling of the inverse force function for the planar switched reluctance motor using sparse least squares support vector machines, IEEE Transactions on Industrial Informatics, vol.11, issue.3, pp.591-600, 2015.

H. V. Jagadish, J. Gehrke, A. Labrinidis, Y. Papakonstantinou, J. M. Patel et al., Big data and its technical challenges, Communications of the ACM, vol.57, issue.7, pp.86-94, 2014.

D. Jiang, L. I. Nian, and . Wei, Fault detection method based on data-driven residual evaluation strategy, Control & Decision, vol.32, pp.1181-1188, 2017.

P. Kecman, . Rob, and . Goverde, Online data-driven adaptive prediction of train event times, IEEE Transactions on Intelligent Transportation Systems, vol.16, issue.1, pp.465-474, 2015.

S. Kim, Forecasting short-term air passenger demand using big data from search engine queries, Automation in Construction, vol.70, pp.98-108, 2016.

A. Lemos, W. Caminhas, and F. Gomide, Adaptive fault detection and diagnosis using an evolving fuzzy classifier, Information Sciences, vol.220, issue.1, pp.64-85, 2013.

H. Li, B. Qian, D. Parikh, and A. Hampapur, Alarm prediction in large-scale sensor networks-A case study in railroad, Big Data, 2013 IEEE International Conference on, pp.7-14, 2013.

X. Li, J. Lv, and Z. Yi, An Efficient Representation-Based Method for Boundary Point and Outlier Detection, IEEE Transactions on Neural Networks and Learning Systems, 2016.

Z. Li and Q. He, Prediction of Railcar Remaining Useful Life by Multiple Data Source Fusion, IEEE Transactions on Intelligent Transportation Systems, vol.16, issue.4, pp.2226-2235, 2015.

C. Lin and W. Sheng-de, Training algorithms for fuzzy support vector machines with noisy data, Pattern recognition letters, vol.25, issue.14, pp.1647-1656, 2004.

J. Liu, Y. Li, and E. Zio, A SVM framework for fault detection of the braking system in a high speed train, Mechanical Systems and Signal Processing, vol.87, pp.401-409, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01408781

X. Liu, Statistical Causal Analysis of Freight-Train Derailments in the United States, Journal of Transportation Engineering, Part A: Systems, p.4016007, 2016.

V. López, An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics, Information Sciences, vol.250, issue.11, pp.113-141, 2013.

M. Radovanovi?, A. Nanopoulos, and M. Ivanovi?, Reverse nearest neighbors in unsupervised distance-based outlier detection, IEEE transactions on knowledge and data engineering, vol.27, issue.5, pp.1369-1382, 2015.

M. Tanaka, Prospective study on the potential of big data, Quarterly Report of RTRI, vol.56, issue.1, pp.5-9, 2015.

. M-a-n-u-s-c-r-i-p-t,

A. Thaduri, D. Galar, and U. Kumar, Railway assets: a potential domain for big data analytics, Procedia Computer Science, vol.53, pp.457-467, 2015.

A. Tisan and J. Chin, An End-User Platform for FPGA-Based Design and Rapid Prototyping of Feedforward Artificial Neural Networks With On-Chip Backpropagation Learning, IEEE Transactions on Industrial Informatics, vol.12, issue.3, pp.1124-1133, 2016.

X. Yan, B. Cai, B. Ning, and W. Shangguan, Online distributed cooperative model predictive control of energy-saving trajectory planning for multiple high-speed train movements, Transportation Research Part C: Emerging Technologies, vol.69, pp.60-78, 2016.

D. You, X. Gao, and S. Katayama, Multisensor fusion system for monitoring highpower disk laser welding using support vector machine, IEEE Transactions on Industrial Informatics, vol.10, issue.2, pp.1285-1295, 2014.

A. M. Zarembski, Some examples of big data in railroad engineering, 2014 IEEE International Conference on, pp.96-102, 2014.

X. Zhang, E. Onieva, A. Perallos, E. Osaba, C. S. Victor et al., Hierarchical fuzzy rule-based system optimized with genetic algorithms for short term traffic congestion prediction, Transportation Research Part C: Emerging Technologies, vol.43, pp.127-142, 2014.

A. A. Zilko, D. Kurowicka, and R. Goverde, Modeling railway disruption lengths with Copula Bayesian Networks, Transportation Research Part C: Emerging Technologies, vol.68, pp.350-368, 2016.

E. Zio, Prognostics and health management of industrial equipment, Diagnostics and prognostics of engineering systems: methods and techniques, pp.333-356, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00778377