Problem: Duplicate "Level of description" terms created during CSV imports
Current Behavior
Steps to reproduce the behavior
- Import a CSV with "Level of description" values that differ in capitalization/spelling from the canonical terms (e.g.
Unidad Documental Simpleinstead ofUnidad documental simple). - After import, check the "Levels of description" taxonomy (id=34 in my DB).
- Duplicate terms are created:
- Unidad Documental Simple (new, ID 5027)
- Unidad Documental Compuesta (new, ID 5023)
Some descriptions (143 in my case) were assigned to the duplicate term instead of the canonical ones (ID 241 = File, ID 242 = Item).
Expected Behavior
During CSV import, AtoM should match the provided values against the existing terms in the taxonomy (case-insensitive) and reuse them, instead of creating new duplicates for minor differences in capitalization or spelling.
Possible Solution
- Normalize term matching during import (ignore case, possibly accent-insensitive).
- Provide a clear warning in logs or in the UI when an incoming value does not match any existing term.
- Alternatively, allow admins to enforce "strict matching only" to avoid silent creation of new terms.
Context and Notes
No response
Version used
Atom 2.9
Operating System and version
Ubuntu 22.04
Default installation culture
es
PHP version
8.3