Gartner defines enterprise AI search as platforms that enable retrieval and synthesis of information across enterprise repositories. They are a key technology for developing AI assistants and AI agents that scale to enterprise needs using retrieval-augmented generation (RAG). They integrate with a wide range of advanced natural language processing (NLP), machine learning (ML) and large language model (LLM) technologies that are essential to knowledge management processes. They are designed to be customized and tuned for specific domains but often come with prepackaged integrations and experiences for some enterprise applications. Enterprise AI search tools are pivotal tools for humans and machines that need to find information and synthesize it to derive insight, so they can subsequently make decisions and take actions. These platforms connect to a wide variety of data sources, normalize and classify information, index it, and match and rank the most relevant results. Their user experiences are commonly customized and are increasingly used as a platform for building AI assistants for a wide variety of operational use cases. Those building RAG-based systems should consider how to configure enterprise search platforms to deliver AI assistants and, in the future, AI agents.
Gartner defines intelligent document processing (IDP) solutions as specialized data integration tools that enable automated extraction of data from multiple formats and various layouts of document content. IDP solutions ingest data for dependent applications and workflows and can be provided as a software product and/or as a service. Organizations receive and process documents in multiple formats to enable activities such as onboarding new suppliers, receiving applications for loans or insurance claims. This results in large volumes of documents, the content of which is designed for human comprehension rather than machine processing. Extracting data from content is essential for document processing and the automated activities this supports. IDP solutions fulfill this role, augmented by and potentially replacing people. Documents are received in physical form, typically paper, which must be scanned for digitization, or in digital form, such as emails and PDFs. The content of these documents has varying layouts, ranging from structured formats, such as tabular or outline (e.g., list or hierarchy of headings) or invoices or contracts, to unstructured formats (i.e., free-flowing, such as an email). Layouts that fall between structured and unstructured, or mixing the two, are often referred to as semistructured.