Overview
Duplicate detection in External File Source can be enabled from M-Files Admin under: Configurations → Advanced Vault Settings → Connection to external sources → File Sources → Duplicate Detection.
However, by design this does not work in 2 cases:
- The file source uses OCR (the documents will always be different on binary level as OCR will include hidden metadata - including date).
- File name has changed since the document has been imported to M-Files (as default duplicate check with file source is done against the file name).
Solution
- You can check if the vault uses OCR to enable full-text search of scanned documents from the Connection properties. For more information see our user guide: https://userguide.m-files.com/user-guide/latest/eng/Searchable_PDF.html
- You can set Use Original File Path (see screenshot) to "Yes", to use the original file path to detect duplicate files instead of the current filename. This way, renaming imported files in M-Files does not prevent duplicate detection.
