Infoloom Logo

Content Import and Integration

Content in multiple formats can be integrated by creating scripts that extract topics from sources in a variety of formats. 

We will create custom scripts allowing you to map from fields or patterns in your input sources to topics that can be curated within the Networker.

We use Python scripts, and we plan in the future to provide ways for your IT team to directly create the scripts that fit your needs. For the time being, we propose this feature as a service.

Ingest sources

Icon-Funnel-256.png

Extract Topics from Content

The ingest process can be run at any time whenever the content is updated.

If your content is structured, you can define the kind of topics that will be created depending on various elements from your content.

If your content is not structured, we can either create specific processes or integrate third-party tools to extract topics.

Customized Formats

We will extract topics from sources in a variety of formats.

Topics can be acquired from documents in a variety of source formats.

JSON Extract JSON objects into Topics
XML Extract XML elements or attribute values based on XPath patterns.
RDF Extract topics from sources in RDF in multiple notations.
XTM Extract topics and relations from XML Topic Maps format.
Excel Spreadsheet, CSV Use data stored in Excel spreadsheets or CSV files. Customize extraction by assigning a topic type depending on the column.
Databases Extract from fields to topics with field-type dependent topic types.
HTML Extract metadata to create topics. Customize extraction by using headers.
EPUB You can extract topics from ebooks in EPUB format.
Word, OpenOffice Extract topics based on metadata, styles, index terms.
Markdown Extract topics based on headers, or specific patterns
Text Use a list of terms to extract topics from text. Integrate third-party data mining software to get smart extraction of topics from full text.