Work

I am a computer science engineer specialized in data mining, currently working at ANDATA Development Technology GmbH in Hallein, Austria. My focus is traffic and pollution prediction.

I received my Ph.D. in computer science in June 2013 at the University of Málaga, under the supervision of Rafael Morales-Bueno and Manuel Baena-García. My thesis (New trends in data mining theory and applications for natural language text classification) is devoted to the application of sub-linear space streaming algorithms for feature selection, summarization and categorization of high-dimensional multi-label data sources with concept drift, specifically text streams. The developed algorithms can evolve over time and adapt to concept drift, allowing the use of a dynamical feature space and label set. You can find a list of the publications here.

During my Ph.D. I have collaborated with University of Aveiro (10 days internship under supervision of Gladys Castillo) and Karlsruher Institut für Technologie (3 months internship in the AIFB group under supervision of Prof. Dr. Rudi Studer), as well as with top researchers such as João Gama, Albert Bifet, Ricard Gavaldà, Achim Rettinger, José del Campo, Jesse Read, Rafael Morales-Bueno and Manuel Baena-García, among others.

I am one of the developers of the GNUsmail project, an open source framework which includes a number of ready-to-use language processing and machine learning resources. The framework allows other developers to incorporate new tools in order to produce customized classifiers.

I collaborate with Gladys Castillo in the development and experimentation with the AdPreqFr4SL and ABC-DynF frameworks, which we have used for email categorization.

I have also collaborated with Hospital Costa del Sol in the design and implementation of DB4US, a Business Analytics system for laboratory management. I was invited by Hospital Costa del Sol to teach a course about Data Mining software and processes during the 5th Spanish Congress on Clinical Laboratories.

Navigation