Data virtualization
Encyclopedia
Data virtualization describes the process of abstracting disparate data sources (databases, applications, file repositories, websites, data services vendors, etc.) through a single data access layer (which may be any of several data access mechanisms).
This abstraction enables data access clients to target a single data access layer, serialization, format, structure, etc., rather than making each client tool handle multiples of any or all of these.
This concept and software is commonly used within data integration
, business intelligence
, service-oriented architecture
data services, cloud computing
, enterprise search
, master data management
and virtual master data management.
Data virtualization software may includes functions for development, operation, and/or management.
(EII) and data federation have been used by some vendors to describe a core element of data virtualization: the capability to create relational JOINs in a federated VIEW. Some forms of legacy data virtualization build on knowledge and concepts developed within EII and Data Federation.
Newer types of data virtualization do not always require movement of the data to construct the view. They may allow you to see the results of the relational joins before any data is moved anywhere. This additional capability is a very significant differentiation point between legacy data virtualization vendors (older EII technology) and newer technologies based upon persistent metadata servers.
This abstraction enables data access clients to target a single data access layer, serialization, format, structure, etc., rather than making each client tool handle multiples of any or all of these.
This concept and software is commonly used within data integration
Data integration
Data integration involves combining data residing in different sources and providing users with a unified view of these data.This process becomes significant in a variety of situations, which include both commercial and scientific domains...
, business intelligence
Business intelligence
Business intelligence mainly refers to computer-based techniques used in identifying, extracting, and analyzing business data, such as sales revenue by products and/or departments, or by associated costs and incomes....
, service-oriented architecture
Service-oriented architecture
In software engineering, a Service-Oriented Architecture is a set of principles and methodologies for designing and developing software in the form of interoperable services. These services are well-defined business functionalities that are built as software components that can be reused for...
data services, cloud computing
Cloud computing
Cloud computing is the delivery of computing as a service rather than a product, whereby shared resources, software, and information are provided to computers and other devices as a utility over a network ....
, enterprise search
Enterprise search
Enterprise search is the practice of making content from multiple enterprise-type sources, such as databases and intranets, searchable to a defined audience.-Enterprise search summary:...
, master data management
Master Data Management
In computing, master data management comprises a set of processes and tools that consistently defines and manages the non-transactional data entities of an organization...
and virtual master data management.
Examples
- Linked DataLinked DataIn computing, linked data describes a method of publishing structured data so that it can be interlinked and become more useful. It builds upon standard Web technologies such as HTTP and URIs, but rather than using them to serve web pages for human readers, it extends them to share information in a...
can use a single hyperlink-based Data Source Name (DSN) to provide a connection to a virtual database layer that is internally connected to a variety of back-end data sources using ODBC, JDBC, OLE DBOLE DBOLE DB is an API designed by Microsoft for accessing data from a variety of sources in an uniform manner. It is a set of interfaces implemented using the Component Object Model ; it is otherwise unrelated to OLE...
, ADO.NETADO.NETADO.NET is a set of computer software components that programmers can use to access data and data services. It is a part of the base class library that is included with the Microsoft .NET Framework. It is commonly used by programmers to access and modify data stored in relational database systems,...
, SOASOA-Computers, Science, engineering and information technology:* Service-oriented architecture, type of software architecture for creating and using business processes, packaged as services...
-style services, and/or RESTRestRest may refer to:* Leisure* Human relaxation* SleepRest may also refer to:* Rest , a pause in a piece of music* Rest , the relation between two observers* Rest , a 2008 album by Gregor Samsa...
patterns.
- Database virtualizationDatabase VirtualizationDatabase virtualization is the decoupling of the database layer, which lies between the storage and application layers within the application stack...
may use a single ODBC-based DSN to provide a connection to a similar virtual database layer.
Functionality
Data Virtualization software is an enabling technology which provides some or all of the following capabilities:- Abstraction – Abstract data the technical aspects of stored data, such as location, storage structure, API, access language, and storage technology.
- Virtualized Data Access – Connect to different data sources and make them accessible from one logical place
- Transformation / Integration – Transform, improve quality, and integrate data based on need across multiple sources
- Data Federation – Combine results sets from across multiple source systems.
- Flexible Data Delivery – Publish result sets as views and/or data services executed by consuming application or users when requested
Data virtualization software may includes functions for development, operation, and/or management.
History
Enterprise Information IntegrationEnterprise Information Integration
Enterprise Information Integration , is a process of information integration, using data abstraction to provide a unified interface for viewing all the data within an organization, and a single set of structures and naming conventions to represent this data; the goal of EII is to get a large set of...
(EII) and data federation have been used by some vendors to describe a core element of data virtualization: the capability to create relational JOINs in a federated VIEW. Some forms of legacy data virtualization build on knowledge and concepts developed within EII and Data Federation.
Newer types of data virtualization do not always require movement of the data to construct the view. They may allow you to see the results of the relational joins before any data is moved anywhere. This additional capability is a very significant differentiation point between legacy data virtualization vendors (older EII technology) and newer technologies based upon persistent metadata servers.
Data Virtualization Vendors
- Composite SoftwareComposite SoftwareComposite Software, Inc is a privately held data virtualization software company based San Mateo, CA. Composite Software was founded in October 2001 by Michael R. Abbott...
- Denodo Technologies
- IBMIBMInternational Business Machines Corporation or IBM is an American multinational technology and consulting corporation headquartered in Armonk, New York, United States. IBM manufactures and sells computer hardware and software, and it offers infrastructure, hosting and consulting services in areas...
- InformaticaInformaticaInformatica Corporation is a NASDAQ listed company with ticker INFA. Founded in 1993, its headquarters is in Redwood City, California. Founded by Diaz Nesamoney and Gaurav Dhillon...
- JBOSS - TEIID Data Virtualization
- OpenLink SoftwareOpenLink SoftwareFounded in 1992, OpenLink Software, Inc., is a software company headquartered in Burlington, Massachusetts, USA.The company develops and deploys standards-compliant middleware products that cover:...
- Queplix
- Radiant LogicRadiant LogicRadiant Logic, Inc. is an American computer software corporation focusing on solutions for Identity Integration, Enterprise Information Integration, Information Security, and Data Management...
- SASSAS- Special forces :* Special Air Service, a special forces unit of the British Army* Australian Special Air Service Regiment * New Zealand Special Air Service * Rhodesian Special Air Service...
- VirtualWorks Group
See also
- Data integrationData integrationData integration involves combining data residing in different sources and providing users with a unified view of these data.This process becomes significant in a variety of situations, which include both commercial and scientific domains...
- Enterprise Information IntegrationEnterprise Information IntegrationEnterprise Information Integration , is a process of information integration, using data abstraction to provide a unified interface for viewing all the data within an organization, and a single set of structures and naming conventions to represent this data; the goal of EII is to get a large set of...
(EII) - Master data managementMaster Data ManagementIn computing, master data management comprises a set of processes and tools that consistently defines and manages the non-transactional data entities of an organization...
- Database virtualizationDatabase VirtualizationDatabase virtualization is the decoupling of the database layer, which lies between the storage and application layers within the application stack...