PSIcapture Administrator Guide: Microsoft Azure Storage Migration Configuration

version 7.9.x   Download Pending

 Note

This article includes advanced Administrator areas for PSIcapture as well as Microsoft Azure's proprietary systems.

 Audience

This article is meant for PSIcapture Administrators.

 

Overview

 

The Microsoft Azure Migration allows users to migrate content from PSIcapture to Microsoft Azure's platform, with a wide variety of configuration options. Below is a breakdown of configuration options available in this migration, and how to link those elements to your Microsoft Azure configuration.

 

Microsoft Azure Storage Migration General Options

 

1.png

 

Azure Storage Connection Options

Authentication Type

  • Storage Access Keys: An access key created along with the storage resource, providing authenticated, full access.
  • Shared Access Signature: A temporary access token to the storage resource, usually with a policy defining its expiration, access area, read/write permissions, etc.

Account Name

This is the name specified to the storage resource when it is created. It usually ends up being the first portion of the endpoint subdomain, (e.g. psigentest.blob.core.windows.net)

Access Key / SAS Token

Enter the value found in your Microsoft Azure storage account's Shared Access Signature or Access Keys configuration. 

 

adv111.png

 

Advanced

Endpoint Suffix

If your endpoint URL is non-standard, change its suffix and protocol here.

 

Azure Storage Options

Storage Type

  • Blob Storage: Unstructured data storage (e.g. text or binary data).
  • File Storage: Structured file storage, such as files or images to be stored in a folder hierarchy.

Important Note: The "Directory Design" tab will be disabled for Blob Storage targets as this storage type does not handle structured data.

Blob Container / File Share

  • Existing: Migrate files and data to a currently created storage container within the Storage account.
  • Map to field: Migrate files and data to a storage container using the name associated with an Index Field.
  • Manual: Enter the static name of the storage container.
  • Auto-create missing Container if it does not exist: Given the above options (where applicable), enable Azure to create the target storage container if it does not existing already.

 

Image File Options

Migrate Image Files

Select Migrate Image Files to output the tiff files.

Image Source

Select the Image Source:

  • Main Image – Image you see within the capture viewer
  • Original Image – Only available if option on step 2 of 9 is selected (this image is not modified by Image Processing)
  • Alternate Images – Alternate images can be created at various times during the workflow (IE Image Processing: Crop)

Storage Location

Select Storage Location by choosing a pre-defined location or configuring a new one.

Image Output Type

Select the Image Output Type: Multi-Page Tiff Image, Single Page Tiff Images, Single Page JPEG Images, Single Page BMP Images, Single Page GIF Images, Single Page PNG Images, Multi-Page PDF (Image Only), or Single Page PDFs (Image Only).

JPEG Quality

The default is 75 and is a good blend between quality and size. Raising the number will increase size and may increase the quality of the file. This setting affects Color, Grayscale and JPEG files only.

Tags to Output

When Alternate Images are selected as the Image Source, user can specify the image tag here. Only alternate images with the matching tag will be outputted. NOTE: Separate multiple tags using commas.

Alt. Page Behavior

Select Alt. Page Behavior: One Document with all tags, or One Document with each tag.

 

OCR and Extended File Format File Options

 

Migrate OCR Files

Select Migrate OCR Files to output the OCR files. This is for a files created in the OCR module.

Storage Location

Select Storage Location by choosing a pre-defined location or configuring a new one.

OCR Files to Migrate

Select OCR Files To Migrate: All OCR Files for Document, Single OCR File by workflow position, or OCR Files by file tags.

OCR File Tags to Output

Select OCR File Tags to Output: When OCR Files by file tags are selected, only files with matching image tags will be outputted. NOTE: Separate multiple tags using commas.

 

Advanced Migration Settings

 

2.png

 

Single Page Output Options

Storage Option

Select storage options that best suit your business needs.

Naming Option

Select format by which the files are named.

 

Other Microsoft Azure Options

Duplicate image/OCR file behavior: Choose the naming scheme for duplicate image/OCR files when migrating.

 

External Application Options

The user may wish to execute another windows application as a part of this migration. In order to accomplish this, the user must configure the following:

 

Application to Launch

Select the windows executable to launch.

Application Arguments

Supply any command line arguments, if any. These are supplied by the executable’ developer.

Launch Timing

The user can choose between the following timing options to launch the executable:

  • Launch application after each Batch is migrated
  • Launch application after all selected Batches are migrated

Wait for Application to Complete - PSIcapture will wait for the application to complete its process before continuing the workflow. 

  • Maximum Time (in Minutes) - This specifies how long PSIcapture will wait for the given process above to complete before proceeding with the workflow.

 

Field Settings

 

3.png

 

The Field Settings tab provides a tool to map index values from the capture process to index or metadata fields in Microsoft Azure's Web Portal.

Important Note: Microsoft Azure can host separate index values at the Document and Folder level. Ensure you specify the appropriate index fields when mapping under the "Field setting options - Documents" and "Field setting options - Folders".

The Field Name column represents the index field name defined for this Document Type, while the Linked Field Name column represents the index field name in Microsoft Azure.

The Output Format supports Text Field Masking.

To include a field to be migrated, select the corresponding checkbox in the "Include" column.

NOTE: If there are fields that do not match fields in the dropdown list they will be highlighted yellow.

 

Custom File Naming

 

4.png

 

File naming tabs have a universal interface. The example shown here is the Single Page File Naming tab. It supports the use of system and index data to name the files that are being migrated to the Microsoft Azure Portal. For example, the Client, Platform, Batch_Number, and Load_Date as index fields could be concatenated to create a naming pattern for the migrated files in Microsoft Azure.

These settings apply to custom files migrated.

File Extension is available with the Text migration settings. Users can add a custom file name.

 

Single Page File Naming

 

5.png

 

File naming tabs have a universal interface. The example shown here is the Single Page File Naming tab. It supports the use of system and index data to name the files that are being migrated to the Microsoft Azure Portal. For example, the Client, Platform, Batch_Number, and Load_Date as index fields could be concatenated to create a naming pattern for the migrated files in Microsoft Azure.

These settings apply to single page custom files migrated.

File Extension is available with the Text migration settings. Users can add a custom file name.

 

Directory Design

 

6.png

 

Important Note: The "Directory Design" tab will be disabled for Blob Storage targets as this storage type does not handle structured data.

The directory design tab functions similarly to a folder design tab, and supports the use of system and index data in naming of the directories that are being migrated to the target system.

Directory Design should be constructed initially by the Microsoft Azure Administrator, and then that same design can be customized here to match your Microsoft Azure organization scheme.

NOTE: If a folder of the same value already exists nothing is done.

 

Pre-Processing

 

7.png

 

Bates Stamping Options

Bates Stamping is a legal industry standard for organizing and numbering multi-page legal documents. PSIcapture provides two methodologies for applying Bates Stamping: Capture Bates Stamping and Migration Bates Stamping. Capture based stamping applies Bates Stamps during the initial capture phase of a workflow. Migration based stamping applies Bates Stamps in the Migration phase of a workflow, which is typically the final phase.

See PSIcapture Administrator Guide: Bates Stamping

 

Imaging Processing Options

Select Enable Image Processing and then specify the image processing functions to run.

NOTE: The Enable Image Processing affects ONLY images files and not OCR. This feature is intended to be used when the desired images being migrated need a particular function run on them like Despeckle when the user has scanned in color and wishes a second output stream of Black and White images.

See PSIcapture Administrator Guide: Image Processing

 

Image Tag Output

See PSIcapture Administrator Guide: Image Tag Output

 

Output Options

Remove specified page from Migration Output: Page to remove from Output

A specified page from each document will be omitted from the output. NOTE: The Remove Page from Output affects ONLY images files and not OCR. This feature is intended to be used when the desired page to be removed is NOT a Folder or Document Separator. OCR is handled in the OCR Workflow Configuration section of this manual.

Remove last page from Migration Output

The last page from each document will be omitted from the output.

Include Folder Separators in Output

If data is included on the Folder Separator which is important to the user during Quality Assurance or Index but is NOT desired to be left in the output viewed by the end user; de-selecting this option will remove the Folder Separator sheet before outputting the file.

Include Document Separators in Output

If data is included on the Document Separator which is important to the user during Quality Assurance or Index but is NOT desired to be left in the output viewed by the end user; de-selecting this option will remove the Document Separator sheet before outputting the file.

Do not output items marked with Skip flag

Items marked with the Skip flag will be omitted from the output.

NOTE: The Keep/Remove Separators defined in migrations affects ONLY images files and not OCR. OCR is handled in the OCR Workflow Configuration section of this manual.

 

Document Filtering

 

8.png

 

Document Filtering allows you to only output documents which contain a Regular Expression match within the selected index field(s). This is commonly used when creating multiple migrations within a single workflow where you want an index field to direct documents to the desired migration.
NOTE: This feature requires Advanced Data Extraction (ADE) license.
Was this article helpful?
0 out of 0 found this helpful

Comments

0 comments

Please sign in to leave a comment.