System architecture
This chapter describes the architecture and deployment models of IBM Production Imaging Edition with an emphasis on IBM Datacap Taskmaster Capture (Taskmaster).
This chapter includes the following sections:
2.1 Architecture overview of Production Imaging Edition
The Production Imaging Edition offering consists of Taskmaster, FileNet Content Manager (including the business process management (BPM) tools), FileNet Workplace XT, and Daeja ViewONE Pro. Figure 2-1 illustrates these components and their interactions at a high level. From top to bottom, you see the client layer, the application logic and presentation layer, and the data layer. The Taskmaster components are highlighted in green, and the components delivered with the FileNet Content Manager platform are highlighted in yellow.
Figure 2-1 Production Imaging Edition architecture
2.2 Components of the Taskmaster system
The Taskmaster system includes the following components:
Business applications or databases
Datacap Studio
File server
Lightweight Directory Access Protocol (LDAP) or Active Directory
NENU
Rulerunner service
RV2
Taskmaster databases
Taskmaster Server
Taskmaster thick client
Taskmaster Web Client
Taskmaster Web Server
Figure 2-2 shows a detailed view of Taskmaster system architecture.
Figure 2-2 Details of the Taskmaster system architecture
This section highlights each component in the Taskmaster system.
Taskmaster Server
The Taskmaster Server is the heart of the Taskmaster system. It manages and serves batches to workstations and users. It also orchestrates the tasks according to the workflow of the Taskmaster application. It provides user authentication and access control, assigns batch IDs, controls batch queuing, and controls access to the Taskmaster databases.
All communications between the Taskmaster Server and its clients, or the other core Taskmaster Server components, use the Datacap Taskmaster socket protocol. For communicating with the databases, it uses Microsoft Object Linking and Embedding for Database (OLE DB). It also uses the Common Internet File System (CIFS) interface to mount the file share that is required to access batches. Taskmaster Server also uses Active Directory Service Interfaces (ADSI) or LDAP to communicate with the Directory Service for user authentication.
Taskmaster databases
For its operation, a Taskmaster application relies on the following relational databases that are hosted in a production system in Microsoft SQL Server or Oracle:
The Administrator database stores information about users, groups, workstation, auditing, functional security, and application configuration. It also stores workflow configurations.
The Engine database stores information about batches, statistics, and queue states.
The Fingerprints database manages the pointers to the fingerprints that are used in a given application.
Each application has its own set of self-contained databases. Also, in many cases, Taskmaster applications need access to other databases to perform lookups or export data to line of business (LOB) systems and databases.
In Taskmaster sample and add-on applications, Microsoft Access database is also used for portability reasons but must not be used in production.
Taskmaster file server
A file server hosts image files, extracted data and control files, and files that are required for running various applications such as the fingerprint files and document hierarchy definition files. The file server must be shared across all the components that need to process the batches.
Taskmaster thick client
The Taskmaster thick client is used to launch user-attended tasks, user and functional security administration, and application configuration. In addition to communicating with the Taskmaster Server, the client requires access to the file server for processing batches (such as viewing a page). It also needs direct access to the fingerprint database and possibly to the lookup databases.
Datacap Studio
Datacap Studio is used to configure the Taskmaster applications, by defining and assembling the document hierarchies, recognition zones and fields, rules, and actions. It requires access to the file server and the Taskmaster databases.
Taskmaster Web
Taskmaster Web interfaces with the Taskmaster Server to serve web pages and documents to web users and to upload documents that are scanned or imported remotely. It is configured as a virtual directory (or site) in Internet Information Services (IIS) for Windows Server. It handles all communications with the back-end services through the Taskmaster Server.
Taskmaster Web client
The Taskmaster Web client runs inside an Internet browser. The ActiveX components are required for scanning and uploading scanned pages to the server and for data entry or verification.
The Rulerunner service
The Rulerunner service runs all tasks that do not require operator intervention, such as image cleaning, conversion, recognition, classification, and export to FileNet Content Manager and databases. It interfaces with FileNet Content Engine through the Web Services application programming interface (API).
RV2
RV2 is a web application that is used to display Taskmaster reports on system activities such as batch status, station activity, or problem batches.
NENU
NENU is used to automate system health and housekeeping tasks. Such tasks include batch monitoring, status notification, and automatic deletion of completed batches. Tasks are scheduled by using the Microsoft Windows Scheduler.
Connection to a business application or database
Typically, connection to a business application or database is through Open Database Connectivity (ODBC). For example, a customer, vendor, or purchase information can be queried against a database and used for image process verification purposes. In addition, information that is extracted from an image can be exported to business applications and databases.
Connection to an LDAP or Active Directory service
An LDAP or Active Directory service is also often part the configuration for Taskmaster users to authenticate.
2.3 Components of FileNet Content Manager
FileNet Content Manager bundled with Production Imaging Edition consists of the following components:
Content Engine
FileNet Enterprise Manager
Process Engine
Case Analyzer
Process Simulator
Business Process Manager Tools
Daeja ViewOne Pro viewer
Figure 2-3 shows a detailed view of the FileNet Content Manager architecture.
Figure 2-3 FileNet Content Manager architecture
This section highlights each component of FileNet Content Manager.
Content Engine
The Content Engine runs in a Java 2 Platform, Enterprise Edition (J2EE), application server. It interfaces with the object stores in Microsoft SQL Server, Oracle, or IBM DB2® to store the document metadata. It also interfaces with File Storage Areas or Fixed Storage Areas to store document content files. In addition, it interfaces with the LDAP server for the administration of users.
FileNet Enterprise Manager
The FileNet Enterprise Manager is used for the administration and configuration of FileNet Content Manager. It runs as a thick client application that connects to the Content Engine by using its COM API.
Process Engine
The Process Engine runs the workflows that route image documents that are stored in the Content Engine to users. It typically runs on its own machine and connects to a Microsoft SQL Server, Oracle, or DB2 database to store runtime workflow data and queue states.
Case Analyzer
The Case Analyzer interfaces with the Process Engine to accumulate statistical information about workflows in an Online Analytical Processing (OLAP) cube in Microsoft SQL Server.
Process Simulator
The Process Simulator interfaces with the Process Analyzer database to use historical data. Alternatively, it interfaces with the workflow definitions in the Content Engine to run simulations.
Business Process Manager Tools
The BPM Tools consist of the Process Designer, Process Tracker, Process Administrator, and the Simulation Console and Designer. They are all Java-based applications that run on the Application Engine of FileNet Content Manager as a thin client.
Daeja ViewONE Pro
Daeja ViewONE Pro is used by FileNet Content Manager users who run FileNet Workplace XT in Microsoft Internet Explorer to display, annotate, and redact documents. Daeja ViewONE Pro also offers additional servlets in addition to the standard version for performing redaction and streaming. These additional servlets require extra licensing.
Figure 2-4 shows a diagram of the Daeja ViewONE redaction and streaming architecture.
Figure 2-4 Daeja ViewONE redaction and streaming architecture
2.4 Overall system architecture
Figure 2-5 shows the overall system architecture of Production Imaging Edition.
Figure 2-5 Overall architecture of Production Imaging Edition
For more information about FileNet Content Manager, FileNet P8 Platform in general, or FileNet Business Process Manager (BPM), see the following IBM Redbooks publications:
IBM FileNet Content Manager Implementation Best Practices and Recommendations, SG24-7547
IBM FileNet P8 Platform and Architecture, SG24-7667
Introducing IBM FileNet Business Process Manager, SG24-7509
2.5 Deployment of Production Imaging Edition
Production Imaging Edition combines the deployment capabilities of FileNet Content Manager and Taskmaster to provide flexible deployment models. These models include centralized deployment, distributed deployment, and web deployment.
2.5.1 Centralized deployment
Centralized deployment is where all the production imaging operations, including capture and content management, are done in one location. This type of deployment is used typically when operations need to be concentrated in one place, such as in a traditional mailroom scenario. This approach is best suited when incoming image volumes are high and when economies of scale can be derived from pooling resources and specializing operators to specific tasks, similar to an assembly line.
For example, in a mailroom, high-end production scanners can be used to handle the bulk of the scanning volumes of an organization. Paper documents are collected and shipped to the central location where they are scanned, processed, and exported to FileNet Content Manager. After the documents are ingested in FileNet Content Manager, users get their work items and documents over the wide area network (WAN) or over the local area network (LAN) if they are at the central site.
Figure 2-6 shows an example of a centralized deployment.
Figure 2-6 Centralized deployment
2.5.2 Distributed deployment
A distributed deployment configuration is ideal for organizations with geographically dispersed user populations and resources and where key system resources can be located closest to users. A distributed deployment can be thought of as a variant of the central deployment model. In a distributed deployment, a large population of users and sizable capture operations justify installing system resources in regional offices.
For example, in a regional office, you might want to install an Application Engine and a Content Engine with content caching, an instance of Taskmaster, and a departmental scanner.
Figure 2-7 shows an example of distributed deployment.
Figure 2-7 Distributed deployment
After the documents are ingested into FileNet Content Manager, users in the regional office can connect to the local Application Engine. In turn, the Application Engine transparently communicates with the remote Process Engine (at the central site) to get their work items. With the local Content Engine cache, the users can get the images that have been scanned locally and released by Taskmaster to the local Content Engine instance. The images stay in cache for local consumption while they are written to the file storage areas at the central site. This setup provides the best performance for local users because images do not need to be transferred over the WAN.
Also, because the object stores that hold the document metadata remain at the central site, enable “request forwarding” on the central Content Engine. Enabling request forwarding can improve performance by forwarding queries to the central Content Engine for execution against the object stores and by sending back only the result sets.
For more information about the deployment options of the FileNet P8 platform, see IBM FileNet P8 Platform and Architecture, SG24-7667.
2.5.3 Taskmaster Web deployment
Flexibility can be added to the previous two deployment scenarios with Taskmaster Web. Taskmaster Web provides close to the same functionality that is available in its thick client, including scanning, importing, indexing, and verifying documents, in addition to administering the Taskmaster system. Essentially, all user-attended functions of the typical Taskmaster process can be performed through a browser.
For example, by adding Taskmaster Web to the deployment scenarios described earlier, you can perform the following tasks:
Supplement the indexing and verification operations for documents that have been scanned at the central location by using resources in remote locations. This task is ideal in situations where remote users are most familiar with the content being processed or where additional assistance is required to handle peak scan volumes.
Distribute “document-at-a-time” scanning at the source, while having the indexing and verification done centrally. For example, in this scenario, shipping personnel scan documents locally, while the indexing and verification are executed centrally, where more customer information might be available from the LOB systems.
Offload all scanning, indexing, and verifying operations to the local offices. These offices have all the information necessary for these operations and are most likely to use the documents after they are committed to FileNet Content Manager. This task is possible considering that the volumes for each individual are manageable. In this scenario, you do not need many resources in the central location beyond simply monitoring and maintaining the systems.
2.6 Conclusion
This chapter described the system architecture of Production Imaging Edition and its components. It also highlighted the various deployment options.
Chapter 3, “Production imaging functionality” on page 51, focuses on the capabilities, user interfaces, and internal processes of Taskmaster.
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset