项目作者: flokkr

项目描述 :
Docker image for main Apache Hadoop components (Yarn/Hdfs)
高级语言: Shell
项目地址: git://github.com/flokkr/docker-hadoop.git
创建时间: 2016-10-30T08:17:43Z
项目社区:https://github.com/flokkr/docker-hadoop

开源协议:Apache License 2.0

下载


Apache Hadoop docker images

These images are part of the Bigdata docker image series. All of the images use the same base docker image which contains plugin scripts to launch different project in containerized environments.

For more detailed instruction about the available environment variables see the README in the flokkr/docker-baseimage repository.

Docker images are tested with Kubernetes

Getting started with Kubernetes

The easiest way to start is to do a kubectl apply -f . from the ./exmaples directories (Using ephemeral storage!)

For more specific use case it’s recommended to use flekszible. The resource definitions can be found in this repository (./hadoop,./hdfs,./yarn…)

Getting started with Flekszible

Install Flekszible (download binary and put it to the path)

  1. Create a working dir
  1. cd /tmp
  2. mkdir cluster
  3. cd cluster
  1. Add this repository as a source
  1. flekszible source add github.com/flokkr/docker-hadoop
  1. Choose and add required services:
  1. flekszible app add hdfs
  1. Generate Kubernetes resource files
  1. flekszible generate
  1. Lunch the rockets:
  1. kubectl apply -f .

Additional Flekszible options

You can list available apps (after source import):

  1. flekszible app search
  2. +---------+-------------------------------+
  3. | path | description |
  4. +---------+-------------------------------+
  5. | hdfs | Apache Hadoop HDFS base setup |
  6. | hdfs-ha | Apache Hadoop HDFS, HA setup |
  7. ...

The base setup can be modified with additional transformatios:

  1. flekszible definitions search | grep hdfs
  2. ...
  3. | hdfs/persistence | Add real PVC based persistence |
  4. | hdfs/onenode | remove scheduling rules to make it possible to run multiple datanode on the same k8s node. |
  5. ...

You can apply transformations with modifing the Flekszible descriptor file:

Original version:

  1. source:
  2. - url: github.com/flokkr/docker-hadoop
  3. import:
  4. - path: hdfs

Modified:

  1. source:
  2. - url: github.com/flokkr/docker-hadoop
  3. import:
  4. - path: hdfs
  5. transformations:
  6. - type: hdfs/onenode
  7. - type: image
  8. image: flokkr/hadoop:3.2.0