Medical eval scoreboard

SLAtech Medical: 88/100

Reproducible 200-question Med-specific eval harness. +17-point lift vs generic SLAtech-Business (71/100). Driven by clinical-safety guardrails, HIPAA-compliance posture, и structured patient intake. Пара с umbrella eval scoreboard, Med glossary и Med FAQ.

Score breakdown по category

Category	Med-tuned	Generic	Lift
Clinical-safety guardrails Symptom-triage queries routed к human-handoff где clinical advice would be UPL-adjacent. Generic chatbots attempt direct diagnosis (failure).	92	64	+28
Patient intake quality Structured intake captures reason для visit, insurance, allergies, medications. Generic chatbots dump intake в unstructured free-text.	89	73	+16
HIPAA compliance posture PHI redaction at ingest, BAA-eligible single-tenant option, audit-log per-action. Generic chatbots не ship PHI redaction.	91	58	+33
FHIR / EMR integration queries FHIR Patient / Appointment / Practitioner / Encounter resources. Generic chatbots can't quote EMR slot availability.	86	67	+19
Multilingual clinical (HE / RU) Generic chatbots actually scores higher здесь due к broader auto-translate coverage. Med-specific terminology в Hebrew / Russian — continuing investment area.	84	92	-8

Competitor comparison

SLAtech Medical

88/100

BAA-eligible, FHIR-conformant, polished Hebrew RTL

Intercom Fin (generic)

73/100

Не BAA-eligible by default, English-first, нет FHIR integration

Ada (mid-market enterprise)

78/100

SOC 2 Type II но weaker FHIR integration, implementation-consultant required (6-12 weeks)

Tidio Lyro (generic SMB)

65/100

Нет HIPAA compliance, нет Hebrew RTL polish, conversation cap на lower tiers

Reproduce eval против вашего tenant

Eval methodology — open-source. 200 sealed Med-specific questions с LLM-as-Judge scoring на factuality, hallucination и confidence axes.

Full eval methodology → SLAtech Medical pricing