Content Import and Integration
Content in multiple formats can be integrated by creating scripts that extract topics from sources in a variety of formats.
We will create custom scripts allowing you to map from fields or patterns in your input sources to topics that can be curated within the Networker.
We use Python scripts, and we plan in the future to provide ways for your IT team to directly create the scripts that fit your needs. For the time being, we propose this feature as a service.
Extract Topics from Content
The ingest process can be run at any time whenever the content is updated.
If your content is structured, you can define the kind of topics that will be created depending on various elements from your content.
If your content is not structured, we can either create specific processes or integrate third-party tools to extract topics.
We will extract topics from sources in a variety of formats.
Topics can be acquired from documents in a variety of source formats.
|JSON||Extract JSON objects into Topics|
|XML||Extract XML elements or attribute values based on XPath patterns.|
|RDF||Extract topics from sources in RDF in multiple notations.|
|XTM||Extract topics and relations from XML Topic Maps format.|
|Excel Spreadsheet, CSV||Use data stored in Excel spreadsheets or CSV files. Customize extraction by assigning a topic type depending on the column.|
|Databases||Extract from fields to topics with field-type dependent topic types.|
|HTML||Extract metadata to create topics. Customize extraction by using headers.|
|EPUB||You can extract topics from ebooks in EPUB format.|
|Word, OpenOffice||Extract topics based on metadata, styles, index terms.|
|Markdown||Extract topics based on headers, or specific patterns|
|Text||Use a list of terms to extract topics from text. Integrate third-party data mining software to get smart extraction of topics from full text.|