We propose to implement the ideas on parallelization of the Mapper (see [14]) visualization methodology developed under our SBIR Phase I effort. Specifically, we will use the MapReduce model, within the Hadoop framework. This development will permit the construction of Mapper outputs for very large data sets. Such methods can then be used to obtain understanding of the massive data sets coming out of the study of internet traffic and advertising, financial market time series, monitoring of consumer behavior within the retail area, and from many other settings.
Keywords: Topology, Parallel Computing, Data Analysis, Mapping, Clustering