In our tutorials we regularly present application examples, APIs and connectors that can be connected with native features of Synesty, without there being a ready-to-use Synesty add-on yet.
In this article we take a look at the REST API Magellan Risk Guard by OpenText. We show how to build an interface with Synesty to use it to automatically check documents.
Thanks to the presence of ChatGPT, AI services are on everyone's lips right now. The information management provider OpenText uses AI in many of their products.
The product Magellan Risk Guard caught our eye because it helps us with an important topic that we are constantly facing within the e-commerce and IT world: data protection and compliance
Today, the amount of information stored in enterprise systems is increasing exponentially. Unfortunately, this also increases the risks for your company and its reputation due to inappropriate or confidential content and information. A manual classification of this content is time-consuming or even impossible due to the amount of data and is also prone to misjudgments caused by human error.
OpenText Magellan Risk Guard uses machine learning to recognize content of documents. This can be used for text recognition and classification, for example to uncover harmful, sensitive and inappropriate texts, images, video and audio files in corporate content. Available as an AI product and an information risk management API service, Magellan Risk Guard enables organizations to act on detected risky content to increase compliance and improve data governance.
The Magellan Risk Guard REST Service is a stateless API that can extract information from documents using Magellan Text Mining. This can be used to e.g.:
There are various reasons why companies should or must automatically scan their data for certain content, such as:
To prevent sensitive data from being inadvertently transmitted to external services, such as ChatGPT, (e.g. if you use our ChatGPT add-on), you could have your respective requests checked by the Magellan Risk Guard API for personal or sensitive data before transmission, and only those Submit data that is considered non-critical.
The banning of the AI-based chatbot ChatGPT by Italy's data protection authority shows that this is a problem.
In the following section we show step by step how to create a connection with Synesty.
In our example use case we must detect documents containing personal data, that normally should not contain that. An email alert must then be sent to a specific person to draw attention to the document and its contents.
First of all, to use Magellan Risk Guard, an OpenText Developer test account must be created. A new tenant and a new app are created in its backend. Later we use their API keys to send requests to the Magellan Risk Guard REST API.
The Risk Guard API uses the OAuth2 ClientCredentials Authentication, which can be configured in Synesty as an HTTP account. To do this, you create an HTTP account with the following data:
OAuth 2.0
https://na-1-dev.api.opentext.com/mtm-riskguard/api/v1/process
Client Credentials
ot2_client_details_{YOUR APP NAME}_confidential.json
which was downloaded at the end of the app creation)ot2_client_details_{YOUR APP NAME}_confidential.json
which was downloaded at the end of the app creation)Token URL: https://na-1-dev.api.opentext.com/tenants/{MY TENANT ID}/oauth2/token
ot2_client_details_{YOUR APP NAME}_confidential.json
which was downloaded at the end of the app creation.Bearer
A click on "Start Configuration" then fills the access token that is needed for the API calls.
To check a document for risks by the Risk Guard, we create a Flow.
The Flow initially consists of the steps URLDownload, JSONReaderVisual and Mapper.
With the URLDownload (1) an example-document is uploaded via the API. The URL download response is a .JSON file with the found data records.
The JSONReaderVisual (2) then reads the file and extracts the data fields we need.
Categories of records found:
Every single record found:
The result is a spreadsheet of those found in the document:
With the Mapper Step, the spreadsheet can then be edited according to your own wishes.
To improve the overview, the column titles can be shortened/simplified:
With a Mappingset you can make the CartridgeIDs
(entity type of the data record) readable.
ConfidenceScore
= How confident Risk Guard is that it has assigned the data record to the correct entity type.RelevancyScore
= How important the entity or classification is to a file, e.g. B. If a "phone number" is found three times in a document, it is more relevant than a number found only once.Frequency
= How many occurrences of a single record were found?Subterm.value
= Value of extracted data record
nfinderNormalized
= Value of extracted data record
ClientNormalized
= Value of extracted data record
MainTermValue
= Value of extracted data recordSubterm.value_1
= Value of extracted data recordSubterm.value_2
= Value of extracted data recordThe raw data of the API can now be processed a little further. For this we will edit the spreadsheet in such a way (remove columns, group by CartridgeIDs) that we can see how often which records were found in our document:
We are now expanding the flow, to send an alarm e-mail if personal data is found in a document.
If no personal data was found, the flow can be stopped at this point. You can do this with the StopFlowIf step .
Otherwise it should be continued.
Now the relevant data records are categorized with several filter steps:
Filter 1: Filter - Personally Identifiable Information
filter condition:
(For data records which are only considered personal in combination with others, conditions with AND can also be created here.)
Filter 2: Filter - Personally Identifiable Financial information
filter condition:
Filter 3: Filter - Access Data
After filtering, the alarm email can now be generated and sent. To do this, we use the EmailSend step.
We choose a recipient and enter the subject.
In the Message field we write our message and list the data records found with their frequency according to our categories. The email message can be created using Freemarker scripting. It contains the results of the previous filter steps.
This is what it looks like when we click on "Step-Preview":
...and this is how the e-mail looks after the flow execution:
Due to the constantly growing amount of data and content, the risks for companies are increasing. A manual evaluation of content is time-consuming due to the amount of data and prone to misjudgments caused by human error. Suitable software tools can be used to identify and react to these risks.
With our test example, we have shown how the OpenText Magellan Risk Guard API can be connected in a short time in order to automatically analyze your own content.
If you want to test this yourself now, take a look at our flow template.
You can find out more about the possibilities of Magellan Risk Guard in this video:
Feel free to contact us if you have any further questions.