Big Data: 2014

Hadoop EcoSystem:

As we know there are many other projects based around core components of Hadoop, often reffered to as the "Hadoop Ecosystem". Below is the exhaustive list which continues to be grown.......

Distributed Filesystem
- Hadoop Distributed File System (Apache Software Foundation)
- Amazon S3 file system
- Google File System (Google Inc.)
- Ceph (Inktank, Red Hat)
- GlusterFS (Red Hat)
- Lustre (OpenSFS & Lustre)
Distributed Programming
- MapReduce (Apache Software Foundation)
- Apache Pig
- JAQL
- Apache Spark
- Stratosphere
- Netflix PigPen
- AMPLab SIMR
- Facebook Corona
- Apache Twill
- Damballa Parkour
- Apache Hama
- Datasalt Pangool
- Apache Tez
- Apache DataFu
- Pydoop
NoSQL Databases
- Column Data Model
  - Apache HBase
  - Apache Cassandra
  - Hypertable
  - Apache Accumulo
- Document Data Model
  - MongoDB
  - RethinkDB
  - ArangoDB
- Stream Data Model
  - EventStore
- Key-value Data Model
  - Redis DataBase
  - Linkedin Voldemort
  - RocksDB
  - OpenTSDB
- Graph Data Model
  - ArangoDB
  - Neo4j
NewSQL Databases
- TokuDB
- HandlerSocket
- Akiban Server
- Drizzle
- Haeinsa
- SenseiDB
- Sky
- BayesDB
- InfluxDB
SQL-on-Hadoop
- Apache Hive
- Apache HCatalog
- AMPLAB Shark
- Apache Drill
- Cloudera Impala
- Facebook Presto
- Datasalt Splout SQL
- Apache Tajo
- Apache Phoenix
Data Ingestion
- Apache Flume
- Apache Sqoop
- Facebook Scribe
- Apache Chukwa
- Apache Storm
- Apache Kafka
- Netflix Suro
- Apache Samza
- Cloudera Morphline
- HIHO
Service Programming
- Apache Thrift
- Apache Zookeeper
- Apache Avro
- Apache Curator
- Apache karaf
- Twitter Elephant Bird
- Linkedin Norbert
Scheduling
- Apache Oozie
- Linkedin Azkaban
- Apache Falcon
Machine Learning
- Apache Mahout
- WEKA
- Cloudera Oryx
- MADlib
Bechmarking
- Apache Hadoop Benchmarking
- Yahoo Gridmix3
- PUMA Benchmarking
- Berkeley SWIM Benchmark
- Intel HiBench
Security
- Apache Sentry
- Apache Knox Gateway
System Deployment
- Apache Ambari
- Apache Whirr
- Cloudera HUE
- Buildoop
- Apache Bigtop
- Apache Helix
- Hortonworks HOYA
- Brooklyn
- Marathon
- Apache Mesos
Applications
- Revolution R
- Apache Nutch
- Sphnix Search Server
- Apache OODT
- HIPI Library
- PivotalR
Development Frameworks
- Spring XD
Misselenious
- Talend
- Apache Tika
- Twitter Finagle
- Apache Giraph
- Concurrent Cascading
- S4 Yahoo
- Intel GraphBuilder
- Spango BI
- Jedox Palo
- Twitter Summingbird
- Apache Kiji
- Tableau
- D3.JS

Friday, May 23, 2014

Hadoop EcoSystem - a list growing >

Hadoop EcoSystem:

Wednesday, May 14, 2014

Hadoop at a glance

Wednesday, April 23, 2014

Challenges of Big Data; Is Hadoop meeting the Big Data Challenge?