MapReduce is a software architecture proposed by Google for parallel operations of large data sets. Typically, a Map (mapping) function is used to Map the key-value of a collection to a new collection, key-value, and to compute the new collection generated in the Map using the concurrent running Reduce (inductive) function.
The primary consideration of MapReduce is the distribution of data and the reliability of the nodes, which in fact coincide with the characteristics of the blockchain. Because CyberVein doomed will have massive data, in the process of big data analysis and processing, relational database in terms of scalability became the main bottleneck of restricting its performance, the deficiency of the SQL language is difficult to meet the demand of complicated data analysis.
So CyberVein architecture will graphs architecture scalability, fault tolerance, etc advantages combined with relational database of high performance, design and a relational database performance and graphs CyberVeinDB high scalability of data warehouse query system. Participants role in CyberVeinDB divided into two categories: data storage (DataStore) and using (DataUse), data storage, on the surface of the graphs is the physical and data using the side of the graphs is logic level.
The DataStore design is based on the architecture of the Distributed file system HDFS. Based on the characteristics of the CyberVein network architecture, the database model of the Distributed Block (Distributed Block) is proposed.
The frame structure of the DataStore model is as follows: