Semantic toolkit for SharePoint

Access Innovations has announced that its Data Harmony suite of content enrichment and thesaurus management tools can now be fully integrated with Microsoft SharePoint 2010.

Data Harmony fills semantic gaps in SharePoint to help users take full advantage of their metadata through auto classification, enterprise taxonomy management, entity extraction, and search enhancements. The end result is information assets that are more searchable and more accessible.

“While SharePoint 2010 enables basic importing of an external taxonomy file and some ongoing management, it lacks a truly useful taxonomy management tool. By integrating SharePoint with Data Harmony’s MAIstro products, users can easily create and manage a robust taxonomy that offers extensive subject metadata with document contributor access, immediate and accurate term suggestions for efficient tagging, expanded search through semantic associations and collaboration through discovered metadata,” said Margie Hlava, president of Access Innovations.

Hlava added, “By combining SharePoint with Data Harmony, an organisation can organise its information more accurately, making it easier to file and share that information, locate and retrieve that information, and collaborate with colleagues. As the information in SharePoint is tagged by adding controlled subject keywords, the content becomes much more valuable to a company and its users. The system is then truly collaborative by allowing reuse of earlier findings, saving staff time – which is money – and ensuring positive growth for the organisation.”

MAIstro combines taxonomy and thesaurus construction and management with automatic machine aided indexing to produce indexing that can be more than 90 percent accurate and that enables browsing by subject, query auto-completion, broader terms, narrower terms and related terms. Automatic completion of thoughts as staff members type is also supported by the taxonomy tools, Hlava said.

Under the integrated system, an Event Handler sends the document being uploaded to SharePoint to the Data Harmony server first. Documents can be sent to the Data Harmony server in full text, all MS Office formats, HTML, PDF formats or other data feeds.

From there, the Data Harmony server attaches indexing terms and other desired metadata using Machine Aided Indexer (M.A.I.) in combination with a metadata and entity extractor, with Thesaurus Master hosting the client taxonomy. The indexed document is then uploaded to Microsoft SharePoint Server 2010. Search can be done using the MS SharePoint Search, FAST Search or other search software such as Perfect Search.

Integrating Data Harmony with SharePoint 2010 can help users continually add to and revise their taxonomy, reuse and download their taxonomy as needed and implement their taxonomy on the search side of their website.

In addition, the taxonomy created through the integration of SharePoint with Data Harmony follows the ANSI/NISO Z39.19 standard for taxonomy construction and the comparable international standards.

Data Harmony can also integrate with other systems, such as those of OpenText, EMC Documentum, and MarkLogic, as well as SharePoint 2007, to support an enterprise-wide taxonomy strategy.

 

Business Solution: