Use cases and risk areas to investigate¶
1. Areas to explore for privacy risks¶
- Personal/sensitive data storage
- Egress to openclaw orchestrator and models (perplexity, anthropic, others) (Prio 1)
- Export of reports to storage
- Role base access controls (Prio 2)
- Telemetry: evaluation user workflow tracking (Prio 3)
- Structured logging: coding workflow tracking (Prio 3)
- Secrets, tokens, and keys (Prio 4)
- CI
2. General application workflow¶
- User interacts with application
- User logs in
- User uploads document
- Document is persisted in DB
- Document is analyzed by AI
- AI suggests risk templates
- AI suggests document analysis
- AI suggests human review
- Human review process HITL (Right now always the same user, because it's connected to the login. May change in the future.)
- Store reviewed document
- Store reports
- Export reports to recipients
3. General software processes and services¶
- Access controls (RBAC)
- Authentication
- Authorization
- Logging
- Telemetry
- Secrets, keys, and tokens
- CI/CD
- Docker containerization
4. Data storage¶
Database tables for application (ORM: src/doc_quality/models/orm.py)¶
app_usersuser_sessionspassword_recovery_tokensskill_documentshitl_reviewsbridge_human_reviewsaudit_eventsaudit_schedulesskill_findingsquality_observationsstakeholder_employee_assignmentsrisk_templatesdocument_locks
Auth-related audit payloads (src/doc_quality/api/routes/auth.py)¶
- Logins
- password recovery
- cookies
- session tokens
Structured logging¶
- Application logs
- errors
- requests
- status events
- Spans for tracing
- Document skills output persisted
- Access logs
- AI services: document excerpts, errors, events, and logs
5. AI orchestrator and models — Anthropic and Perplexity¶
OpenClaw¶
- Orchestrate messages to AI agents
Perplexity¶
- Requests
- Prompts
- Responses
- Metadata
- Payloads
Anthropic¶
- Requests
- Prompts
- Responses
- Metadata
- Payloads
Agents: - Document analysis agent (doc_check_agent) - Compliance agent (compliance_agent)
Risk mitigation: - in example template ->>> to be removed
6. Exports, remote uploads, records retention¶
Export registry¶
- local downloads
- remote exports
Records retention¶
Who has access to what? How long?
- local
- offsite
Risk mitigation: - RBAC - admin account can define roles and access - storage is forever right now - admin can delete users and records
IDE / MCP / Copilot¶
What is used
- VS Code & GitHub Copilot
7. Access controls (individuals / org)¶
- RBAC
- Authenticated users and HITL reviewers
- Audit logs
Document and Human in the Loop (HITL)¶
- Document access
- HITL review access and actions
Secrets, tokens, and keys (src/doc_quality/core/config.py)¶
- Secret key for application
- DB access
- Anthropic and Perplexity API keys
- Auth logins
- Tokens
- Cookies
8. Logging, configuration, telemetry, CI, Docker¶
Logging¶
Structured logging and information collected.
Telemetry and metrics¶
- Traces and spans stored
- Data collection by Prometheus
- Metrics exposed to endpoints
Docker¶
Container logs and info to stdoout or stderr
CI¶
- action artifacts
- testing artifacts, like Playwright screenshots
9. Documentation and dev data¶
- Snippets of real data
- Mocked data for testing