Data processing component of the Velox Big Data Framework (VBDF)
VeloxMR is a MapReduce framework implemented in C++ on the top of VeloxDF
Key features of current VeloxMR include:
VeloxMR default launcher is not included in this repository, if you want to use it you can find it [here][eclipsed].
The reason not to include the launcher inside the package is to let the user to choose any launcher or service managers such as:
Once the system is up and running, you can interact with VeloxDFS with the following commands:
$ dfs put|get|cat|ls|rm|format|pget|update|append
Detailed information can be found in the wiki of this repository
$ mkdir -p local_eclipse/{tmp,sandbox} # Create a sandbox directories
$ cd local_eclipse # enter in the directory
$ git clone git@github.com:DICL/VeloxDFS.git # Clone the project from github
$ cd VeloxDFS
$ sh autogen.sh # Generate configure script
$ cd ../tmp # Go to building folder
$ sh ../VeloxDFS/configure --prefix=`pwd`/../sandbox # Check requirements and generate the Makefile
# If you get a boost error go the FAQ section of the README
### This last command will be needed whenever you want to recompile the source
$ make [-j#] install # Compile & install add -j flag to speed up
Now edit in your ~/.bashrc or ~/.profile:
export PATH="/home/*..PATH/To/eclipse/..*/sandbox/bin":$PATH
export LIBRARY_PATH="/home/*..PATH/To/eclipse/..*/sandbox/lib"
export C_INCLUDE_PATH="/home/*..PATH/To/eclipse/..*/sandbox/include"
"log" : {
"type" : "LOG_LOCAL6"
"name" : "ECLIPSE"
"mask" : "DEBUG"
},
"cache" : {
"numbin" : 100,
"size" : 200000,
"concurrency" : 1
},
"filesystem" : {
"block" : 137438953,
"buffer" : 512,
"replica" : 1
}
configure
stops with errors related to boost library.In this example we assume that the boost headers are in
sh ../VeloxDFS/configure --prefix ~/sandbox --with-boost=/usr/local --with-boost-libdir=/usr/local/lib
/usr/local/include
while the library files/usr/local/lib
.