Title: Tools for mining Massive Data Sets
Place: Celis 116 @10:30am
In this talk we will review some tools for mining massive data sets, which is one of the main task in the Big Data world. First, we will give an introduction to data mining and Big Data. After that, we will explain the use of Hadoop and Mapreduce, two powerful computational tools based on Google’s technology, for dealing with Big data. Application of these two tools to clustering by k-means and k-nn classification will be shown. Finally , we will mention the research that we are currently involved.
Dr. Edgar Acuña is a full professor at the Department of Mathematical Sciences of the University of Puerto Rico at Mayaguez. His research work is in Computational and Statistical learning for Knowledge Discovery in Databases. In particular, he is interested in data pre-processing methodology and data engineering. He is currently working on mining massive datasets and in applications of functional data analysis to data coming from different scientific areas. He has supervised, 23 master students and five doctoral students in Computing and Information Sciences and Engineering. In 2009, Dr Acuna was selected as a Fulbright Scholar. He has delivered lectures in more than twenty countries all over the world. Dr. Acuña has been the PI of four grants, three from ONR and one from DoD, and he has been co-PI in one NSF grant. During his career Dr. Acuna has also received funds from Intel and Hewlett Packard to acquire computer equipment as well to support research assistants.