Last week we posted on a publishing approach for enterprise search. Along with enterprise search, we have seen more and more ECM clients look to publish content out of the ECM repository for a variety of business reasons including performance, business continuity and reducing costs. This post will highlight how Hadoop can be used within a publishing architecture and explain some of the benefits.
After upgrading Alfresco for one of our financial clients, we started looking at how to improve the tools and process they use to manage web content for multiple external sites. Beginning with an informal survey of how our other customers curate and publish web content it became apparent that each one handles it slightly different. This post will summarize seven different clients and how they are adding, reviewing, approving, and publishing information to the web.
Recently one of our large pharmaceutical clients replaced a Documentum interface in a .NET application with access to an HPI Portal cache. The HPI Portal cache utilizes Apache Solr to store and serve approved content outside of Documentum for scalability and business continuity. This change resolved a significant maintenance problem for the application team, removed a roadblock that had halted a server upgrade, and improved performance of the .NET application for document retrieval. Continue reading
TSG has several clients using Documentum as a repository and a custom front end application for consumption of the records or renditions of records. In most cases there is a mechanism in place such as SCS (Site Caching Services) or TSG’s OpenMigrate PUMA (See CIS Case Study for more details). While a typical Documentum application (ex: Webtop) provides a “one stop shop” for authors and approvers, the interface can be challenging when “consumers” are just looking for quick search and retrieval. This solution provides improved performance, business continuity, and ability to add documents from other systems. One potential risk to using a cache of documents and metadata for search and retrieval is the integrity of data. Publishing techniques are designed to accurately cache records; however there are uncontrollable circumstances that may result in a mismatch. Continue reading