Cloudera
Encyclopedia
Cloudera Inc. is a Palo Alto-based enterprise software company which provides Apache Hadoop-based software and services. It contributes to Hadoop and related Apache projects and provides a distribution for Hadoop for the enterprise. Cloudera has two products: Cloudera's Distribution including Apache Hadoop (CDH) and Cloudera Enterprise. CDH is a data management platform which incorporates HDFS, Hadoop MapReduce, Hive
, Pig
, HBase
, Sqoop, Flume, Oozie, ZooKeeper
and Hue and is available free under an Apache license
. Cloudera Enterprise is a package which includes Cloudera's Distribution including Apache Hadoop, production support and tools designed to make it easier to run Hadoop in a production environment. Cloudera offers services including support, consulting services and training (both public and private).
In March 2009, Cloudera announced the availability of CDH in conjunction with a $5 million capital injection led by Accel Partners
. The launch was first announced by the New York Times.
In May 2010, Cloudera was named by Thomson Reuters
’ Venture Capital Journal
as the most promising startup funded in 2009.
The preferred demonym
for an employee of Cloudera is "Clouderan."
Apache Hive
Apache Hive is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis. While initially developed by Facebook, Apache Hive is now used and developed by other companies such as Netflix...
, Pig
Pig (programming language)
Pigis a high-level platform for creating MapReduce programs used with Hadoop. The language for this platform is called Pig Latin. Pig Latin abstracts the programming from the Java MapReduce idiom into a notation which makes MapReduce programming high level, similar to that of SQL for RDBMS...
, HBase
HBase
HBase is an open source, non-relational, distributed database modeled after Google's BigTable and is written in Java. It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS , providing BigTable-like capabilities for Hadoop...
, Sqoop, Flume, Oozie, ZooKeeper
Apache ZooKeeper
Apache ZooKeeper is a software project of the Apache Software Foundation, providing an open source centralized configuration service and naming registry for large distributed systems. ZooKeeper is a sub project of Hadoop....
and Hue and is available free under an Apache license
Apache License
The Apache License is a copyfree free software license authored by the Apache Software Foundation . The Apache License requires preservation of the copyright notice and disclaimer....
. Cloudera Enterprise is a package which includes Cloudera's Distribution including Apache Hadoop, production support and tools designed to make it easier to run Hadoop in a production environment. Cloudera offers services including support, consulting services and training (both public and private).
In March 2009, Cloudera announced the availability of CDH in conjunction with a $5 million capital injection led by Accel Partners
Accel Partners
Accel Partners is a global venture and growth equity firm funding companies from inception through the growth stage.The firm is based in Palo Alto, California with major offices in Bangalore, Beijing, London, and Shanghai....
. The launch was first announced by the New York Times.
In May 2010, Cloudera was named by Thomson Reuters
Thomson Reuters
Thomson Reuters Corporation is a provider of information for the world's businesses and professionals and is created by the Thomson Corporation's purchase of Reuters Group on 17 April 2008. Thomson Reuters is headquartered at 3 Times Square, New York City, USA...
’ Venture Capital Journal
Venture capital journal
The Venture Capital Journal, or VCJ, is a monthly glossy magazine that covers investment trends, financing techniques and news from across the Venture Capital industry. The magazine, founded in 1961, focuses on venture capital and features expert analysis and commentary...
as the most promising startup funded in 2009.
The preferred demonym
Demonym
A demonym , also referred to as a gentilic, is a name for a resident of a locality. A demonym is usually – though not always – derived from the name of the locality; thus, the demonym for the people of England is English, and the demonym for the people of Italy is Italian, yet, in english, the one...
for an employee of Cloudera is "Clouderan."
See also
- Apache Software FoundationApache Software FoundationThe Apache Software Foundation is a non-profit corporation to support Apache software projects, including the Apache HTTP Server. The ASF was formed from the Apache Group and incorporated in Delaware, U.S., in June 1999.The Apache Software Foundation is a decentralized community of developers...
- Big dataBig dataBig data are datasets that grow so large that they become awkward to work with using on-hand database management tools. Difficulties include capture, storage, search, sharing, analytics, and visualizing...
- BigTableBigTableBigTable is a compressed, high performance, and proprietary database system built on Google File System , Chubby Lock Service, SSTable and a few other Google technologies; it is currently not distributed nor is it used outside of Google, although Google offers access to it as part of their Google...
- Cloud computingCloud computingCloud computing is the delivery of computing as a service rather than a product, whereby shared resources, software, and information are provided to computers and other devices as a utility over a network ....
- Cloud infrastructure
- Database-centric architectureDatabase-centric architectureDatabase-centric architecture or data-centric architecture has several distinct meanings, generally relating to software architectures in which databases play a crucial role. Often this description is meant to contrast the design to an alternative approach...
- Datastructure
- HadoopHadoopApache Hadoop is a software framework that supports data-intensive distributed applications under a free license. It enables applications to work with thousands of nodes and petabytes of data...
- MapReduceMapReduceMapReduce is a software framework introduced by Google in 2004 to support distributed computing on large data sets on clusters of computers. Parts of the framework are patented in some countries....
- HBaseHBaseHBase is an open source, non-relational, distributed database modeled after Google's BigTable and is written in Java. It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS , providing BigTable-like capabilities for Hadoop...
- Online databaseOnline databaseAn online database is a database accessible from a network, including from the Internet.It differs from a local database, held in an individual computer or its attached storage, such as a CD....
- Real time databaseReal time databaseA real-time database is a processing system designed to handle workloads whose state is constantly changing . This differs from traditional databases containing persistent data, mostly unaffected by time. For example, a stock market changes very rapidly and is dynamic...