Test Data: Pattern Matching Bundling Strategy¶
Purpose¶
This demonstrates advanced regex pattern matching to extract location-based identifiers from complex filenames.
Processing Mode Configuration¶
- Batch Entity: root
- Bundle Congruent Patterns: Yes (checked)
- Bundling Strategy: pattern
- Bundling Pattern:
^([A-Za-z]+)_(extracts location prefix before underscore) - Primary Source Extension: .obj
File Extensions to Select¶
- .obj
- .mtl
OBJ File Options¶
- Add MTL Files: Yes (checked)
- Add Texture Files: No (unchecked)
Expected Behavior¶
When scanned: - 2 Zenodo records will be created (Athens and Rome) - The regex pattern captures the location name before the first underscore - Multiple sections from the same location are grouped together
Bundle 1 (Athens):¶
- Athens_Temple_Section_A.obj (source)
- Athens_Temple_Section_B.obj (source)
- Athens_Temple_materials.mtl (primary)
Bundle 2 (Rome):¶
- Rome_Forum_Column_01.obj (source)
- Rome_Forum_Column_02.obj (source)
- Rome_Forum_materials.mtl (primary)
File Count¶
- Total files: 6
- Primary sources: 4 (.obj files)
- Dependencies: 2 (.mtl files)
Pattern Explanation¶
- Pattern:
^([A-Za-z]+)_ ^- Start of filename([A-Za-z]+)- Capture group: one or more letters_- Literal underscore character- Result: Extracts "Athens" from "Athens_Temple_Section_A.obj"
Alternative Patterns to Try¶
model_(\d+)- Extracts numeric IDs (model_001 → "001")([A-Z]+)_- Extracts uppercase prefixes onlysite(\d+)- Extracts site numbers (site042 → "042")