IBM Deep Search

Deep Search

Deep Search extracts and structures data from documents in four steps: Parse, Interpret, Index, and Integrate. Try out the first steps on our public system.

Parse

Deep Search parses large collections of PDF documents, such as scientific publications and patents. It detects and delimits document objects that have content, such as paragraphs and tables. This content is then extracted from the objects, such as text from paragraphs and cells from tables.

Try it
Parse tables

Interpret

After parsing a document for its content, Deep Search interprets and enriches it. Paragraphs of text are passed through natural language models. These models identify language structures such as sentences and terms, which are then classified into entity types such as a country or a physical property of a material. Likewise, image objects are detected and interpreted by computer vision models.

Try it
Interpret paragraphs

Index

Deep Search parses and interprets large collections of documents quickly and concurrently on scalable cloud infrastructure. These collections are stored and indexed such that you can search and retrieve any document according to their contents, down to the values and physical units in tables.

Besides indexing your personal document collections, Deep Search already indexes millions of documents from public sources, such as arXiv, Pubmed, and patent offices. These documents are updated regulary and include records from curated databases.

Get access
Search and retrieve documents

Integrate

Use Deep Search to integrate your document collections into knowledge graphs. Knowledge graphs interconnect the entity types that have been discovered across documents. You can then query a graph to answer analytical questions that go beyond searching for keywords. For example:

For each material type that is mentioned in my collection of scientific papers, which of its physical properties have been tested and under which conditions?

For each company that is mentioned in my collection of annual reports, what was its total revenue per year?

Get access
Integrate documents

Unlimited access

Contact us if you are considering a broader use of Deep Search. We can give you access to an unlimited Deep Search service, depending on your use case. You will be able to use this service to evaluate Deep Search for larger document collections.