OpenPipeline
Encyclopedia
OpenPipeline is open source
software for crawling, parsing, analyzing and routing documents. It is intended to tie together otherwise incomplete solutions for enterprise search and document processing. OpenPipeline provides a common architecture for connectors to data sources, file filters, text analyzers and modules to distribute documents across a network.
, text analytics
, connector firms, and consultancies. A full list can be found on the OpenPipeline Team page
Open source
The term open source describes practices in production and development that promote access to the end product's source materials. Some consider open source a philosophy, others consider it a pragmatic methodology...
software for crawling, parsing, analyzing and routing documents. It is intended to tie together otherwise incomplete solutions for enterprise search and document processing. OpenPipeline provides a common architecture for connectors to data sources, file filters, text analyzers and modules to distribute documents across a network.
Features
- Installer
- Job scheduler
- File scanner
- Crawlers
- Doc filters
- Point and click admin interface
- Extensible through plugins
Advisory Board
The OpenPipeline Advisory Board includes individuals from enterprise searchEnterprise search
Enterprise search is the practice of making content from multiple enterprise-type sources, such as databases and intranets, searchable to a defined audience.-Enterprise search summary:...
, text analytics
Text analytics
The term text analytics describes a set of linguistic, statistical, and machine learning techniques that model and structure the information content of textual sources for business intelligence, exploratory data analysis, research, or investigation. The term is roughly synonymous with text mining;...
, connector firms, and consultancies. A full list can be found on the OpenPipeline Team page