Opait Text Filters Specifications
Extract text from formatted and marked-up documents for indexing, aggregation, or data mining.
Many applications that deal with unstructured data require access to the text content of formatted or marked-up documents. Organizations that archive documents often require access to the textual content to make the documents searchable and enable content aggregation, reporting and mining of the document archives. Search and retrieval application also need to extract and tokenize text from various file formats.
Download (143.85K)