TREX search engine
Encyclopedia
TREX is a search engine in the SAP NetWeaver
integrated technology platform produced by SAP AG
. The TREX engine is a standalone component that can be used in a range of system environments but is used primarily as an integral part of such SAP products as Enterprise Portal, Knowledge Warehouse, and Business Intelligence (BI, formerly SAP Business Information Warehouse
). In SAP NetWeaver BI, the TREX engine powers the BI Accelerator, which is a plug-in appliance for enhancing the performance of online analytical processing. The name "TREX" stands for Text Retrieval and information EXtraction, but it is not a registered trade mark of SAP and is not used in marketing collateral.
TREX supports text mining and classification using a vector space model
. Groups of documents can be classified using query based classification, example based classification, or a combination of these plus keyword management.
TREX supports structured data search not only for document metadata but also for mass business data and data in SAP business objects. Indexes for structured data are implemented compactly using data compression
and the data can be aggregated in linear time, to enable large volumes of data to be processed entirely in memory.
Recent developments include:
NetWeaver
SAP NetWeaver is SAP's integrated technology computing platform and is the technical foundation for many SAP applications since the SAP Business Suite. SAP NetWeaver is marketed as a service-oriented application and integration platform...
integrated technology platform produced by SAP AG
SAP AG
SAP AG is a German software corporation that makes enterprise software to manage business operations and customer relations. Headquartered in Walldorf, Baden-Württemberg, with regional offices around the world, SAP is the market leader in enterprise application software...
. The TREX engine is a standalone component that can be used in a range of system environments but is used primarily as an integral part of such SAP products as Enterprise Portal, Knowledge Warehouse, and Business Intelligence (BI, formerly SAP Business Information Warehouse
SAP Business Information Warehouse
SAP Netweaver Business Warehouse is the name of the Business Intelligence, analytical, reporting and Data Warehousing solution produced by SAP AG. It was originally named SAP BIW , then abbreviated to SAP BW, but is now known as "SAP BI" at the end user level...
). In SAP NetWeaver BI, the TREX engine powers the BI Accelerator, which is a plug-in appliance for enhancing the performance of online analytical processing. The name "TREX" stands for Text Retrieval and information EXtraction, but it is not a registered trade mark of SAP and is not used in marketing collateral.
Search functions
TREX supports various kinds of text search, including exact search, boolean search, wildcard search, linguistic search (grammatical variants are normalized for the index search) and fuzzy search (input strings that differ by a few letters from an index term are normalized for the index search). Result sets are ranked using term frequency-inverse document frequency (tf-idf) weighting, and results can include snippets with the search terms highlighted.TREX supports text mining and classification using a vector space model
Vector space model
Vector space model is an algebraic model for representing text documents as vectors of identifiers, such as, for example, index terms. It is used in information filtering, information retrieval, indexing and relevancy rankings...
. Groups of documents can be classified using query based classification, example based classification, or a combination of these plus keyword management.
TREX supports structured data search not only for document metadata but also for mass business data and data in SAP business objects. Indexes for structured data are implemented compactly using data compression
Data compression
In computer science and information theory, data compression, source coding or bit-rate reduction is the process of encoding information using fewer bits than the original representation would use....
and the data can be aggregated in linear time, to enable large volumes of data to be processed entirely in memory.
Recent developments include:
- A join engine to join structured data from different fields in business objects
- A fast update capability to write a delta index beside a main index and to merge them offline while a second delta index takes updates
- A data miningData miningData mining , a relatively young and interdisciplinary field of computer science is the process of discovering new patterns from large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics and database systems...
feature pack for advanced mathematical analysis