Word Search & Redaction
    • PDF

    Word Search & Redaction

    • PDF

    Article Summary

    The Glasswall Embedded Engine can search and redact the text and metadata of a file for forbidden words in the following file types:

    • Microsoft Binary Office
    • Office Open XML

    The search strings are configured via a policy file, where the type of action to be applied to the matching text is also defined. The supported actions are:

    • Allow - Produce an XML analysis report specifying the number of matching strings within the file and their location
    • Redact - Report matches and regenerate the input file with all instances replaced with a character specified in the policy file with `replacementChar`
    • Disallow - Report all matches and do not regenerate the input file if any are found
    • Require - Report all matches and do not regenerate the input file unless at least one match is found. This action is only available for files comprised exclusively of text.

    Using methods such as regular expressions or by text strings in the configuration will allow you detect text at various scopes, e.g. words, paragraphs or sentences.

    The APIs for Word Search support string, character based and regular expression matching. A full list of the Word Search API functions can be found in Word Search Library.

    Learn more



    Was this article helpful?