Decision memo
This repository lands at T2 Caution with a final score of 60/100. The result is driven more by boundary, workflow-support, and governance weaknesses than by classic code-pattern failures.
Research reference and supervised non-clinical technical review only.
This repository lands at T2 Caution with a final score of 60/100. The result is driven more by boundary, workflow-support, and governance weaknesses than by classic code-pattern failures.
Review after 45 days. Expires on 2026-07-05.
Change-triggered re-audit now: False
Mirror-only policy surface: selected profile metadata is shown in this report, but authoritative scan scoring still follows deterministic runtime constants. Preview-only posture changes, including Stage 4 replication emphasis, do not change the formal score until a future read-through phase.
Use stem policy simulate with a governed profile file when you want to preview a different review posture. The authoritative score path stays deterministic; the profile is surfaced as metadata and preview-only interpretation.
If you only need the default posture, you do not need a profile file at all.
{
"profile_name": "strict_clinical_adjacency",
"profile_read_mode": "mirror_only"
}stem policy simulate /path/to/repo --profile-file profile.json
Calibration Effect: mirror-only Policy metadata surfaced Formal score unchanged
Claim language, limitation posture, and clinical boundary wording.
Internal contradictions between README, workflow claims, and support surfaces.
Engineering accountability, provenance, and reviewable responsibility surfaces.
Reproducibility evidence is reported separately and does not alter the formal tier.
Because Code Integrity is a narrow detector family. The formal score is still driven mainly by Stage 1, Stage 2R, and Stage 3 evidence posture.
C4 is now reserved for executable fail-open exception behavior, C5 for unsupported compliance or boundary integrity claims, and C6 for mock-auth or no-auth trust-boundary signals.
AIRI is used here as a bounded risk-vocabulary layer around deterministic repository findings. The report uses the curated runtime bundle, not the full upstream AIRI universe.
Derived from The AI Risk Repository V4_03. Original source remains MIT-licensed and must be attributed in README, docs, runtime artifacts, and local registry metadata.
It means seven AIRI risk IDs are currently reached by active local detector mappings, out of thirty-two AIRI risk IDs in the current detector scope.
Each covered AIRI row carries a bounded explanation built from the triggered detector, the local mapping justification, and the trigger reason surfaced by the scan.
AIRI does not independently verify harm, causality, clinical failure, or legal noncompliance. It is a risk-vocabulary layer around local findings.
The report does not infer AIRI coverage from prose alone. Coverage appears when a local detector fires and a governed mapping exists in the current AIRI runtime bundle.
C6_mock_auth_or_fail_open_boundary status: detected
R2R_D5_single_external_service_dependency → 72.04.02 Market Concentration
covered_by: detector id why: bounded mapping reason
| ID | Risk | Domain | Covered by / Note |
|---|---|---|---|
| 24.01.03 | Safe exploration problem with widely deployed AI assist | Lack of capability or robustness | C5_compliance_boundary_integrity: Clinical-adjacent surfaces exis |
| 69.01.00 | False information | False or misleading information | C5_compliance_boundary_integrity: Clinical-adjacent surfaces exis |
| 65.03.03 | Reidentification | 2.1 | CC-3 catches shallow validators; dedicated reidentify() API expos |
| 70.02.02 | Misinformation — hallucination of clinical knowledge | 3.1 | CC-1 catches threshold=0.0 default; actual output-level hallucina |
| 39.25.00 | Verifiability — black-box AI in medical healthcare | 7.4 | B2 detects surface language only; Model Card / interpretability a |
| 11.02.00 | Allocative Harms — withheld resources in healthcare | 1.1 | Subgroup performance disparities require dynamic evaluation; outs |
| 72.04.02 | Market Concentration — healthcare single-point failures | 6.1 | Systemic risk beyond single-repository scope. |
| SEV | Detector | Finding | File | |
|---|---|---|---|---|
| INFO | S1_readme_bio_terms | README exposes bio/medical vocabulary. | README.md | |
| INFO | S1_readme_bio_terms | README exposes bio/medical vocabulary. | README.md | |
| INFO | S1_clinical_boundary | No evidence detected for S1_clinical_boundary. | . | |
| INFO | S1_H1_clinical_certainty_hype | No evidence detected for S1_H1_clinical_certainty_hype. | . | |
| INFO | S1_H2_regulatory_approval_hype | No evidence detected for S1_H2_regulatory_approval_hype. | . | |
| INFO | S1_H3_autonomous_replacement_hyp | No evidence detected for S1_H3_autonomous_replacement_hype. | . | |
| INFO | S1_H4_breakthrough_marketing_hyp | No evidence detected for S1_H4_breakthrough_marketing_hype. | . | |
| INFO | S1_H5_universal_generalization_h | No evidence detected for S1_H5_universal_generalization_hype. | . | |
| INFO | S1_H6_perfect_accuracy_hype | No evidence detected for S1_H6_perfect_accuracy_hype. | . | |
| INFO | S1_R1_limitations_section | No evidence detected for S1_R1_limitations_section. | . | |
| INFO | S1_R2_regulatory_framework | No evidence detected for S1_R2_regulatory_framework. | . | |
| INFO | S1_R2_weak_regulatory_self_asser | No evidence detected for S1_R2_weak_regulatory_self_assertion. | . | |
| INFO | S1_R2_unsupported_legal_or_compl | No unsupported legal or compliance claim pattern was detected. | . | |
| INFO | S1_R4_demographic_bias_boundary | Demographic, subgroup, fairness, bias, or validation-cohort language detected. | docs/SDK_DEEP_DIVE.md | |
| INFO | S1_R5_reproducibility_provisions | No evidence detected for S1_R5_reproducibility_provisions. | . | |
| INFO | S3_T1_workflow_files | Workflow file exists. | .github/workflows/skills-only.yml | |
| INFO | S3_T1_workflow_files | Workflow file exists. | .github/workflows/test.yml | |
| INFO | S3_T2_domain_tests | No evidence detected for S3_T2_domain_tests. | . | |
| INFO | S3_T3_changelog_release_hygiene | No evidence detected for S3_T3_changelog_release_hygiene. | . | |
| INFO | S3_T3_changelog_bugfix_evidence | No evidence detected for S3_T3_changelog_bugfix_evidence. | . | |
| INFO | S3_B1_dependency_manifest | Dependency or environment manifest exists. | package-lock.json | |
| INFO | S3_B1_dependency_manifest | Dependency or environment manifest exists. | package.json | |
| INFO | S3_B1_data_source_language | Data source, dataset citation, IRB, or provenance language detected. | README.md | |
| INFO | S3_B1_data_source_language | Data source, dataset citation, IRB, or provenance language detected. | README.md | |
| INFO | S3_B1_data_source_language | Data source, dataset citation, IRB, or provenance language detected. | docs/BEGINNER_GUIDE.md | |
| INFO | S3_B1_data_source_language | Data source, dataset citation, IRB, or provenance language detected. | docs/BEGINNER_GUIDE.zh-CN.md | |
| INFO | S3_B2_bias_limitations | Bias, limitation, or validation-boundary language detected. | docs/CHANNELS.md | |
| INFO | S3_B2_bias_limitations | Bias, limitation, or validation-boundary language detected. | docs/CHANNELS.md | |
| INFO | S3_B2_measurement_evidence | No evidence detected for S3_B2_measurement_evidence. | . | |
| INFO | S3_B3_coi_funding | COI, funding, sponsor, or acknowledgement language detected. | docs/CHANNELS.md | |
| INFO | S2_package_bio_terms | No evidence detected for S2_package_bio_terms. | . | |
| INFO | R2R_D5_single_external_service_d | No named required external service dependency pattern was detected. | . | |
| INFO | C6_mock_auth_or_fail_open_bounda | No mock-auth or fail-open local-boundary pattern was detected. | . | |
| INFO | C1_hardcoded_credentials | Credential-like placeholder or test/example fixture ignored for C1 penalty. | scripts/setup.sh | |
| INFO | C2_dependency_pinning | No loose dependency evidence detected. | . | |
| INFO | C3_dead_or_deprecated_patient_ad | No evidence detected for C3_dead_or_deprecated_patient_adjacent_paths. | . | |
| INFO | C4_exception_handling_clinical_a | No fail-open exception handler detected in executable Python code. | . | |
| INFO | BIO_smiles_surface_integrity | No malformed or suspicious SMILES-like strings detected by conservative surface checks. | . | |
| INFO | BIO_smiles_rdkit_validation | RDKit optional validation lane not exercised because no SMILES-like candidates were detect | . | |
| INFO | BIO_smiles_parser_guard | No missing None/invalid guards detected after SMILES parser calls. | . | |
| INFO | BIO_silent_mock_fallback | No silent mock or simulated-data fallback patterns detected in production code paths. | . | |
| INFO | BIO_trace_manifest | No traceability manifest or runtime audit-log schema surface detected. | . | |
| INFO | BIO_run_trace | No risky subprocess or os.system bio-tool execution patterns detected. | . | |
| INFO | AST_argparse_cli | argparse CLI interface detected. | container/skills/bio-tools/templates/pymol_render_ | |
| INFO | AST_argparse_cli | argparse CLI interface detected. | container/skills/bio-tools/templates/qc_summary_pl | |
| INFO | AST_argparse_cli | argparse CLI interface detected. | container/skills/bio-tools/templates/volcano_plot_ | |
| INFO | AST_argparse_cli | argparse CLI interface detected. | container/skills/sec-report/sec_pipeline.py | |
| INFO | AST_argparse_cli | argparse CLI interface detected. | container/skills/sec-report/tests/generate_test_da | |
| INFO | S4_container_environment | No evidence detected for S4_container_environment. | . | |
| INFO | S4_make_reproduce_target | No Makefile detected. | . | |
| INFO | S4_environment_lock_evidence | Environment, dependency, or lock manifest detected. | package-lock.json | |
| INFO | S4_exact_dependency_pins_or_hash | Lock manifest with exact dependency resolution detected. | package-lock.json | |
| INFO | S4_readme_reproducibility_sectio | README exists but no reproducibility or replication section heading was detected. | . | |
| INFO | S4_checksum_files | No evidence detected for S4_checksum_files. | . | |
| INFO | S4_dataset_url | Documentation exists but no dataset URL or data source URL was detected. | . | |
| INFO | S4_model_weight_url_or_checksum | Model weight, checkpoint, or model artifact URL detected. | README.md | |
| INFO | S4_model_weight_url_or_checksum | Model weight, checkpoint, or model artifact URL detected. | docs/BEGINNER_GUIDE.md | |
| INFO | S4_citation_cff | No evidence detected for S4_citation_cff. | . | |
| INFO | S4_license_restriction | License/readme/docs surfaces exist but no restriction language was detected. | . | |
| INFO | S4_cli_entrypoint | argparse CLI evidence detected by AST summary. | . | |
| INFO | S4_seed_setting | No deterministic seed setting evidence detected by AST summary. | . | |
| INFO | S4_runnable_examples | No evidence detected for S4_runnable_examples. | . |