AI Classify
Published 09 December 2025
AI Classify finds sensitive data that rule-based classification misses. It samples and examines actual data from your database, not just column names and metadata. This makes it particularly effective for columns with non-informative names where rules alone can't help.
AI Classify runs a local AI model. No data leaves your environment. No data is shared with third parties, and your data is not used to train the model.
How to use AI Classify
AI Classify can be turned on in the rganonymize CLI by entering --scan-type Ai or --scan-type RulesAndAi when running classify.
The first option (data) performs identification and classification of sensitive information using AI Classify exclusively, while the second option (RulesAndAi) combines AI Classify and the approach based on classification rules and the structure of the database.
The full command-line options for rganonymize classify are documented here. The output is a classification file in the usual format, ready for use in the rganonymize mask command.
GUI
AI Classify can be turned on in the settings page. Once enabled, AI Classify will perform a data scan on all newly created database models.
How does AI Classify work
AI Classify results take precedence over built-in and custom classification rules. Your rules are still applied, but where AI Classify identifies a column differently, its classification wins.
You can override AI Classify's decisions:
- In the GUI: change the dataset from the dropdown for any column. Your choice always takes precedence.
- In the CLI: use individual overrides in your classification JSON to make changes that persist across runs.
System Requirements
When using the Test Data Manager GUI there are no prerequisites.
If using the command lines, there is one pre-requisite: please install the Microsoft Visual C++ 2019 Redistributable from: https://aka.ms/vs/17/release/vc_redist.x64.exe.
Troubleshooting AI Classify
Failed to load ML model
If you see an error message Failed to load ML model, this usually means you need to install the Microsoft Visual C++ redistributable. Please follow the instructions above under system requirements.
DXGI Factory Message
When you use AI Classify with data scanning you may see a message Failed to create DXGI factory
This message can be ignored. AI Classify will still complete successfully without any issues.
Note
- The machine learning model is hosted on your machine so no information from your database will ever leave your environment.
- The machine learning model will not be trained on your data during use.
- There is no guarantee that AI Classify data scanning or structure scanning will find all of the sensitive data within a database. False positives and false negatives are possible.