STEM BIO-AI Local Audit
Facebook LinkedIn Email
Link copied ✓
48/ 100
STEM BIO-AI Local Audit  |  2026-05-18

yorkeccak/bio

T1 QuarantineDeterministic local scanNo LLM / no network / no runtime execution

Exploratory review only; no patient-adjacent use.

Executive Summary

48
Final Score
75
S1 Intent
40
S2 Repo
25
S3 Code/Bio
30
S4 Replication
TL;DR

Decision memo

This repository lands at T1 Quarantine with a final score of 48/100. The result is driven more by boundary, workflow-support, and governance weaknesses than by classic code-pattern failures.

Policy: defaultStatus: authoritative_releaseMode: mirror_only
Primary Risks

What pushed the review down

  • Clinical-adjacent surfaces exist without an explicit non-diagnostic/non-clinical boundary.
  • Self-asserted compliance or privacy-governance claim requires independent verification.
  • Legal, privacy, or compliance claim appears without supporting governance or security-grounding evidence in reviewed repository sources.
  • Core workflow appears materially dependent on named external service providers; local or self-host claims may overstate operational independence.
  • C2_dependency_pinning: WARN
Positive Evidence

What still supports reviewability

  • Package metadata was available for repo-local consistency checks.
Freshness

When to re-check

Review after 45 days. Expires on 2026-07-02.

Change-triggered re-audit now: False

Stage 2R Focus

Repo-local contradictions

  • R2R_D2_missing_clinical_use_boundary-20
    R2R_D2_missing_clinical_use_boundaryclinical_adjacent=True and explicit non-clinical boundary was not detected
  • R2R_D4_unsupported_workflow_claim-15
    R2R_D4_unsupported_workflow_claimworkflow/demo/CLI claims detected while workflow, tests, or local support entrypoints are absent
  • R2R_1_readme_package_code_alignment+15
    R2R_1_readme_package_code_alignmentshared bio-domain terms detected across README and package metadata
Stage 3 Focus

Accountability surfaces

  • B1_data_provenance_controls+15
    S3_B1_dependency_manifestdependency or lock manifest presence plus data-source, dataset, or IRB language review
  • B3_coi_funding+5
    S3_B3_coi_fundingCOI/funding/sponsor language review across README, docs, FUNDING, CITATION, and AUTHORS surfaces
Policy Boundary

How to read this artifact

Mirror-only policy surface: selected profile metadata is shown in this report, but authoritative scan scoring still follows deterministic runtime constants. Preview-only posture changes, including Stage 4 replication emphasis, do not change the formal score until a future read-through phase.

Decision Path ?

Final = 0.4 × S1 + 0.2 × S2R + 0.4 × S3 − C1_penalty  |  Stage 4 remains a separate replication lane.
Configured, Not Rewritten

Changing review posture does not require touching the score core

Use stem policy simulate with a governed profile file when you want to preview a different review posture. The authoritative score path stays deterministic; the profile is surfaced as metadata and preview-only interpretation.

If you only need the default posture, you do not need a profile file at all.

profile.json
{
  "profile_name": "strict_clinical_adjacency",
  "profile_read_mode": "mirror_only"
}
command
stem policy simulate /path/to/repo --profile-file profile.json
artifact note
Calibration Effect: mirror-only
Policy metadata surfaced
Formal score unchanged
Stage 1 — README Intent ?
75

Claim language, limitation posture, and clinical boundary wording.

  • R3_clinical_disclaimer-5
    CA-INDIRECT surface lacks explicit non-clinical or non-diagnostic boundary.
  • S1_domain_readme+10
    README exposes bio/medical domain vocabulary.
  • S1_domain_package+5
    Package metadata exposes bio/medical domain vocabulary.
  • R2_regulatory_framework+5
    Self-asserted privacy/compliance language detected without stronger regulatory-framework evidence.
Stage 2R — Repo Consistency ?
40

Internal contradictions between README, workflow claims, and support surfaces.

  • R2R_D2_missing_clinical_use_boundary-20
    R2R_D2_missing_clinical_use_boundaryclinical_adjacent=True and explicit non-clinical boundary was not detected
  • R2R_D4_unsupported_workflow_claim-15
    R2R_D4_unsupported_workflow_claimworkflow/demo/CLI claims detected while workflow, tests, or local support entrypoints are absent
  • R2R_1_readme_package_code_alignment+15
    R2R_1_readme_package_code_alignmentshared bio-domain terms detected across README and package metadata
Stage 3 — Code / Bio Responsibility ?
25

Engineering accountability, provenance, and reviewable responsibility surfaces.

  • B1_data_provenance_controls+15
    S3_B1_dependency_manifestdependency or lock manifest presence plus data-source, dataset, or IRB language review
  • B3_coi_funding+5
    S3_B3_coi_fundingCOI/funding/sponsor language review across README, docs, FUNDING, CITATION, and AUTHORS surfaces
Stage 4 — Replication ?
30

Reproducibility evidence is reported separately and does not alter the formal tier.

  • S4_container_environment+10
    Container or compose file exists.
  • S4_environment_lock_evidence+10
    Environment, dependency, or lock manifest detected.
  • S4_exact_dependency_pins_or_hashes+10
    Exact dependency pin or hash evidence detected.

Code Integrity & Contract ?

Warnings First

Mapped risk lanes that fired

Clear Lanes

What stayed quiet in the current rule scope

Why can Code Integrity contain PASS while the overall score is still low?

Because Code Integrity is a narrow detector family. The formal score is still driven mainly by Stage 1, Stage 2R, and Stage 3 evidence posture.

What changed in the C4 / C5 / C6 split?

C4 is now reserved for executable fail-open exception behavior, C5 for unsupported compliance or boundary integrity claims, and C6 for mock-auth or no-auth trust-boundary signals.

MIT AI Risk Repository Coverage ?V4_03 | airisk.mit.edu

Feature Explainer

What this section is doing

AIRI is used here as a bounded risk-vocabulary layer around deterministic repository findings. The report uses the curated runtime bundle, not the full upstream AIRI universe.

21%
7 / 32 risks in detector scope
Bundle scope: curated_medical_clinical_subset
Snapshot: 2026-04-23 | License: MIT

Derived from The AI Risk Repository V4_03. Original source remains MIT-licensed and must be attributed in README, docs, runtime artifacts, and local registry metadata.

What does 7 / 32 mean?

It means seven AIRI risk IDs are currently reached by active local detector mappings, out of thirty-two AIRI risk IDs in the current detector scope.

What does “why mapped” mean?

Each covered AIRI row carries a bounded explanation built from the triggered detector, the local mapping justification, and the trigger reason surfaced by the scan.

What does AIRI not prove here?

AIRI does not independently verify harm, causality, clinical failure, or legal noncompliance. It is a risk-vocabulary layer around local findings.

Mapped, Not Guessed

AIRI rows light up through active detector mappings

The report does not infer AIRI coverage from prose alone. Coverage appears when a local detector fires and a governed mapping exists in the current AIRI runtime bundle.

trigger
C6_mock_auth_or_fail_open_boundary
status: detected
mapping
R2R_D5_single_external_service_dependency
→ 72.04.02 Market Concentration
report surface
covered_by: detector id
why: bounded mapping reason
Coverage Explorer

Covered and gap rows

AllAll Domains7/5
1Discrimination & Toxicity0/1
2Privacy & Security1/1
3Misinformation1/1
4Malicious Actors & Misuse0/0
5Human-Computer Interaction1/0
6Socioeconomic & Environmental1/1
7AI System Safety, Failures & Limitations3/1
Click a domain card to filter. Counts are shown as covered / gaps.
IDRiskDomainCovered by / Note
24.01.03Safe exploration problem with widely deployed AI assistLack of capability or robustnessC5_compliance_boundary_integrity: Unsupported legal/compliance cl
24.04.01Physical and Psychological HarmsOverreliance and unsafe useC2_dependency_pinning: External operational dependency signal sur
33.01.05Privacy and securityCompromise of privacy / PII leakC2_dependency_pinning: External operational dependency signal sur
39.25.00VerifiabilityLack of transparency or interpreS1_R2_unsupported_legal_or_compliance_claim: Legal, privacy, or c
60.02.01Reliability issuesLack of capability or robustnessR2R_D5_single_external_service_dependency: Required external serv
69.01.00False informationFalse or misleading informationC5_compliance_boundary_integrity: Unsupported legal/compliance cl
72.04.02Market Concentration and Infrastructure Dependencies:R2R_D5_single_external_service_dependency: Required external serv
65.03.03Reidentification2.1CC-3 catches shallow validators; dedicated reidentify() API expos
70.02.02Misinformation — hallucination of clinical knowledge3.1CC-1 catches threshold=0.0 default; actual output-level hallucina
39.25.00Verifiability — black-box AI in medical healthcare7.4B2 detects surface language only; Model Card / interpretability a
11.02.00Allocative Harms — withheld resources in healthcare1.1Subgroup performance disparities require dynamic evaluation; outs
72.04.02Market Concentration — healthcare single-point failures6.1Systemic risk beyond single-repository scope.

Evidence Detail ?

All (103) FAIL WARN PASS INFO
SEVDetectorFindingFile
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_readme_bio_termsREADME exposes bio/medical vocabulary.README.md
INFOS1_clinical_boundaryNo evidence detected for S1_clinical_boundary..
INFOS1_H1_clinical_certainty_hypeNo evidence detected for S1_H1_clinical_certainty_hype..
INFOS1_H2_regulatory_approval_hypeNo evidence detected for S1_H2_regulatory_approval_hype..
INFOS1_H3_autonomous_replacement_hypNo evidence detected for S1_H3_autonomous_replacement_hype..
INFOS1_H4_breakthrough_marketing_hypNo evidence detected for S1_H4_breakthrough_marketing_hype..
INFOS1_H5_universal_generalization_hNo evidence detected for S1_H5_universal_generalization_hype..
INFOS1_H6_perfect_accuracy_hypeNo evidence detected for S1_H6_perfect_accuracy_hype..
INFOS1_R1_limitations_sectionNo evidence detected for S1_R1_limitations_section..
INFOS1_R2_regulatory_frameworkNo evidence detected for S1_R2_regulatory_framework..
INFOS1_R2_weak_regulatory_self_asserSelf-asserted privacy/compliance language detected without stronger regulatory-framework eREADME.md
WARNS1_R2_unsupported_legal_or_complLegal, privacy, or compliance claim detected without supporting governance or security-groREADME.md
INFOS1_R4_demographic_bias_boundaryNo evidence detected for S1_R4_demographic_bias_boundary..
INFOS1_R5_reproducibility_provisionsNo evidence detected for S1_R5_reproducibility_provisions..
INFOS3_T1_workflow_filesNo evidence detected for S3_T1_workflow_files..
INFOS3_T2_domain_testsNo evidence detected for S3_T2_domain_tests..
INFOS3_T3_changelog_release_hygieneNo evidence detected for S3_T3_changelog_release_hygiene..
INFOS3_T3_changelog_bugfix_evidenceNo evidence detected for S3_T3_changelog_bugfix_evidence..
INFOS3_B1_dependency_manifestDependency or environment manifest exists.package-lock.json
INFOS3_B1_dependency_manifestDependency or environment manifest exists.package.json
INFOS3_B1_dependency_manifestDependency or environment manifest exists.pnpm-lock.yaml
INFOS3_B1_data_source_languageData source, dataset citation, IRB, or provenance language detected.README.md
INFOS3_B1_data_source_languageData source, dataset citation, IRB, or provenance language detected.README.md
INFOS3_B2_bias_limitationsNo evidence detected for S3_B2_bias_limitations..
INFOS3_B2_measurement_evidenceNo evidence detected for S3_B2_measurement_evidence..
INFOS3_B3_coi_fundingCOI, funding, sponsor, or acknowledgement language detected.README.md
INFOS2_package_bio_termsPackage metadata exposes bio/medical vocabulary.package.json
INFOS2_package_bio_termsPackage metadata exposes bio/medical vocabulary.package.json
WARNR2R_D5_single_external_service_dRequired external service API key detected for a named workflow dependency..env.example
WARNR2R_D5_single_external_service_dRequired external service API key detected for a named workflow dependency..env.example
WARNR2R_D5_single_external_service_dNamed external service provider is presented as part of the core repository workflow.README.md
WARNR2R_D5_single_external_service_dNamed external service provider is presented as part of the core repository workflow.README.md
WARNR2R_D5_single_external_service_dNamed external service provider is presented as part of the core repository workflow.README.md
WARNR2R_D5_single_external_service_dLocal or self-host claims coexist with required external service dependencies.README.md
WARNR2R_D5_single_external_service_dLocal or self-host claims coexist with required external service dependencies.README.md
WARNR2R_D5_single_external_service_dLocal or self-host claims coexist with required external service dependencies.README.md
WARNC6_mock_auth_or_fail_open_boundaMock-auth, auto-login, or no-auth local/self-host boundary surfaced in reviewed sources.README.md
WARNC6_mock_auth_or_fail_open_boundaMock-auth, auto-login, or no-auth local/self-host boundary surfaced in reviewed sources.README.md
WARNC6_mock_auth_or_fail_open_boundaMock-auth, auto-login, or no-auth local/self-host boundary surfaced in reviewed sources.README.md
WARNC6_mock_auth_or_fail_open_boundaMock-auth, auto-login, or no-auth local/self-host boundary surfaced in reviewed sources.README.md
WARNC6_mock_auth_or_fail_open_boundaLocal or self-host claims coexist with mock-auth or auto-login boundary signals..env.example
WARNC6_mock_auth_or_fail_open_boundaLocal or self-host claims coexist with mock-auth or auto-login boundary signals..env.example
INFOC1_hardcoded_credentialsNo hardcoded credential evidence detected for C1_hardcoded_credentials..
INFOC2_dependency_pinningNo loose dependency evidence detected..
INFOC3_dead_or_deprecated_patient_adNo evidence detected for C3_dead_or_deprecated_patient_adjacent_paths..
INFOC4_exception_handling_clinical_aNo fail-open exception handler detected in executable Python code..
INFOBIO_smiles_surface_integrityNo malformed or suspicious SMILES-like strings detected by conservative surface checks..
INFOBIO_smiles_rdkit_validationRDKit optional validation lane not exercised because no SMILES-like candidates were detect.
INFOBIO_smiles_parser_guardNo missing None/invalid guards detected after SMILES parser calls..
INFOBIO_silent_mock_fallbackNo silent mock or simulated-data fallback patterns detected in production code paths..
INFOBIO_trace_manifestNo traceability manifest or runtime audit-log schema surface detected..
INFOBIO_run_traceNo risky subprocess or os.system bio-tool execution patterns detected..
INFOS4_container_environmentContainer or compose file exists.Dockerfile
INFOS4_make_reproduce_targetNo Makefile detected..
INFOS4_environment_lock_evidenceEnvironment, dependency, or lock manifest detected.package-lock.json
INFOS4_environment_lock_evidenceEnvironment, dependency, or lock manifest detected.pnpm-lock.yaml
INFOS4_exact_dependency_pins_or_hashLock manifest with exact dependency resolution detected.package-lock.json
INFOS4_exact_dependency_pins_or_hashLock manifest with exact dependency resolution detected.pnpm-lock.yaml
INFOS4_readme_reproducibility_sectioREADME exists but no reproducibility or replication section heading was detected..
INFOS4_checksum_filesNo evidence detected for S4_checksum_files..
INFOS4_dataset_urlDocumentation exists but no dataset URL or data source URL was detected..
INFOS4_model_weight_url_or_checksumDocumentation exists but no model artifact URL/checksum evidence was detected..
INFOS4_citation_cffNo evidence detected for S4_citation_cff..
INFOS4_license_restrictionLicense/readme/docs surfaces exist but no restriction language was detected..
INFOS4_cli_entrypointNo package metadata or Python AST surface detected..
INFOS4_seed_settingNo deterministic seed setting evidence detected by AST summary..
INFOS4_runnable_examplesNo evidence detected for S4_runnable_examples..