GDPRScanner

Author	SHA1	Message	Date
StyxX65	2c5f5d3283	Add OCR language override setting Operators can now choose Tesseract language pack(s) per profile via a sidebar select (#optOcrLang) and profile editor (#peOptOcrLang). Presets: dan+eng (default), dan, eng, dan+eng+deu, dan+eng+swe, dan+eng+fra. The ocr_lang option flows from the UI through all three scan engines (M365 files/attachments, Google Drive, Gmail) down to document_scanner.scan_pdf and scan_image — including the spawned PDF-OCR subprocess worker. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-28 09:59:40 +02:00
StyxX65	23b9555dcf	Built-in file redaction for local files	2026-05-27 14:49:06 +02:00
StyxX65	8b55e9d933	Extended the M365 checkpoint/resume mechanism to all three scan engines. Each engine writes its own +file (`checkpoint_m365.json`, `checkpoint_google.json`, `checkpoint_file_{source_id}.json`) every 25 + items.	2026-04-25 20:30:59 +02:00
StyxX65	d42518dc81	Added tests for Video & Audio feat: video/audio metadata scanning, profile rename fix, route tests - Scan .mp4/.mov/.avi/.mkv and .mp3/.flac/.ogg/.m4a/.wma (+ 7 more) for GPS coordinates, artist/author, title, comment — metadata only, no frame or audio analysis. Uses mutagen (added to requirements.txt). GPS-tagged phone recordings now flag with gps_location like photos. - Fix _extract_audio_metadata silently returning empty results: mutagen.File() first positional arg is `filename`, not `fileobj` — was passing BytesIO as the filename. Fixed to keyword args. - Fix profile copy rename not reflected in left column until modal reopen: _pmgmtSaveFullEdit called loadProfiles() but never _renderProfileMgmt(). Added re-render and active-row highlight. - Add TestProfileRoutes (10 tests) covering all profile API endpoints including a rename regression test. Total: 182 tests. - generate_fixtures.py now produces 6 audio/video fixtures (14–19): 2 MP3, 2 FLAC, 2 MP4 — 4 flagged, 2 negative cases.	2026-04-21 21:26:58 +02:00
StyxX65	2a2d79de90	Added testing of Profile	2026-04-21 20:51:37 +02:00
StyxX65	c350014b16	fix: scan button stuck, CPR dedup crash, role scope filter, profile race conditions; add auto-email toggle and route integration tests	2026-04-21 18:43:25 +02:00
StyxX65	7c1afca80b	Bugfixes fix: select mode onclick exports, multi-source progress counter, OCR page-by-page	2026-04-21 13:12:54 +02:00
Henrik Højmark	9c7df76fbd	Initial commit	2026-04-11 04:38:11 +02:00

8 Commits