Cloudera
Encyclopedia
Cloudera Inc. is a Palo Alto-based enterprise software company which provides Apache Hadoop-based software and services. It contributes to Hadoop and related Apache projects and provides a distribution for Hadoop for the enterprise. Cloudera has two products: Cloudera's Distribution including Apache Hadoop (CDH) and Cloudera Enterprise. CDH is a data management platform which incorporates HDFS, Hadoop MapReduce, Hive
Apache Hive
Apache Hive is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis. While initially developed by Facebook, Apache Hive is now used and developed by other companies such as Netflix...

, Pig
Pig (programming language)
Pigis a high-level platform for creating MapReduce programs used with Hadoop. The language for this platform is called Pig Latin. Pig Latin abstracts the programming from the Java MapReduce idiom into a notation which makes MapReduce programming high level, similar to that of SQL for RDBMS...

, HBase
HBase
HBase is an open source, non-relational, distributed database modeled after Google's BigTable and is written in Java. It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS , providing BigTable-like capabilities for Hadoop...

, Sqoop, Flume, Oozie, ZooKeeper
Apache ZooKeeper
Apache ZooKeeper is a software project of the Apache Software Foundation, providing an open source centralized configuration service and naming registry for large distributed systems. ZooKeeper is a sub project of Hadoop....

 and Hue and is available free under an Apache license
Apache License
The Apache License is a copyfree free software license authored by the Apache Software Foundation . The Apache License requires preservation of the copyright notice and disclaimer....

. Cloudera Enterprise is a package which includes Cloudera's Distribution including Apache Hadoop, production support and tools designed to make it easier to run Hadoop in a production environment. Cloudera offers services including support, consulting services and training (both public and private).

In March 2009, Cloudera announced the availability of CDH in conjunction with a $5 million capital injection led by Accel Partners
Accel Partners
Accel Partners is a global venture and growth equity firm funding companies from inception through the growth stage.The firm is based in Palo Alto, California with major offices in Bangalore, Beijing, London, and Shanghai....

. The launch was first announced by the New York Times.

In May 2010, Cloudera was named by Thomson Reuters
Thomson Reuters
Thomson Reuters Corporation is a provider of information for the world's businesses and professionals and is created by the Thomson Corporation's purchase of Reuters Group on 17 April 2008. Thomson Reuters is headquartered at 3 Times Square, New York City, USA...

Venture Capital Journal
Venture capital journal
The Venture Capital Journal, or VCJ, is a monthly glossy magazine that covers investment trends, financing techniques and news from across the Venture Capital industry. The magazine, founded in 1961, focuses on venture capital and features expert analysis and commentary...

 as the most promising startup funded in 2009.

The preferred demonym
Demonym
A demonym , also referred to as a gentilic, is a name for a resident of a locality. A demonym is usually – though not always – derived from the name of the locality; thus, the demonym for the people of England is English, and the demonym for the people of Italy is Italian, yet, in english, the one...

 for an employee of Cloudera is "Clouderan."

See also

  • Apache Software Foundation
    Apache Software Foundation
    The Apache Software Foundation is a non-profit corporation to support Apache software projects, including the Apache HTTP Server. The ASF was formed from the Apache Group and incorporated in Delaware, U.S., in June 1999.The Apache Software Foundation is a decentralized community of developers...

  • Big data
    Big data
    Big data are datasets that grow so large that they become awkward to work with using on-hand database management tools. Difficulties include capture, storage, search, sharing, analytics, and visualizing...

  • BigTable
    BigTable
    BigTable is a compressed, high performance, and proprietary database system built on Google File System , Chubby Lock Service, SSTable and a few other Google technologies; it is currently not distributed nor is it used outside of Google, although Google offers access to it as part of their Google...

  • Cloud computing
    Cloud computing
    Cloud computing is the delivery of computing as a service rather than a product, whereby shared resources, software, and information are provided to computers and other devices as a utility over a network ....

  • Cloud infrastructure
  • Database-centric architecture
    Database-centric architecture
    Database-centric architecture or data-centric architecture has several distinct meanings, generally relating to software architectures in which databases play a crucial role. Often this description is meant to contrast the design to an alternative approach...

  • Datastructure
  • Hadoop
    Hadoop
    Apache Hadoop is a software framework that supports data-intensive distributed applications under a free license. It enables applications to work with thousands of nodes and petabytes of data...

  • MapReduce
    MapReduce
    MapReduce is a software framework introduced by Google in 2004 to support distributed computing on large data sets on clusters of computers. Parts of the framework are patented in some countries....

  • HBase
    HBase
    HBase is an open source, non-relational, distributed database modeled after Google's BigTable and is written in Java. It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS , providing BigTable-like capabilities for Hadoop...

  • Online database
    Online database
    An online database is a database accessible from a network, including from the Internet.It differs from a local database, held in an individual computer or its attached storage, such as a CD....

  • Real time database
    Real time database
    A real-time database is a processing system designed to handle workloads whose state is constantly changing . This differs from traditional databases containing persistent data, mostly unaffected by time. For example, a stock market changes very rapidly and is dynamic...

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK