PhenoCDS guides any frontline doctor to see the unseen — turning missed clinical clues into early rare disease diagnoses before irreversible harm occurs. Localized NLP, adaptive disease prevalence mapping, and FHIR-native output, all within a HIPAA-compliant web platform.
The rare disease crisis is not a lack of medical knowledge — it's a structural failure to translate clinical observation into standardized, actionable data at the point of care.
Subtle phenotypic features are invisible under high-volume clinical pressure. Non-genetics physicians lack the tools to connect scattered clues — leaving patients in a 5–7 year diagnostic limbo before irreversible harm sets in.
RECOGNITION GAPClinical notes worldwide are written in local languages, mixed terminologies, and informal shorthand (e.g. eye space wide, DMS delay). Existing tools, trained on English-only corpora, fail entirely in non-English clinical environments.
Current tools output cold diagnostic codes and stop there. Patients receive a diagnosis with no bridge to local patient organizations, orphan drug eligibility, social welfare resources, or research enrollment opportunities.
RESOURCE GAPDesigned as a secure, web-based platform with HIPAA-compliant data handling. All patient phenotype data is processed without transmission of personally identifiable information — zero PHI leaves your environment.
PhenoCDS is not a static tool — it is a learning platform. Every clinical interaction strengthens three interlocking flywheels that competitors cannot replicate without starting from zero.
The system continuously tracks which HPO mappings physicians actually select for each local clinical expression. High-frequency selections are automatically promoted in the ranking — making the mapping dictionary increasingly accurate for regional clinical language patterns over time. What takes a competitor years to build, we accumulate through daily clinical use.
LANGUAGE FLYWHEELDisease recommendation rankings are dynamically weighted by local population genetics and real-world diagnosis outcomes. As confirmed diagnoses accumulate, the engine automatically recalibrates prior probabilities to reflect true regional prevalence — not just published literature. A rare disease rare in Europe may be far more common in East Asian populations, and our engine learns this.
PREVALENCE FLYWHEELBy analyzing which phenotypes appear in confirmed diagnoses but were absent from initial clinical notes, PhenoCDS identifies the specific clues that physicians in each region systematically miss. These blind spots are then surfaced as prioritized next-best-question prompts — turning population-level insight into individual clinical guidance.
INSIGHT FLYWHEELInternational NGP tools have fundamental gaps across clinical workflow integration, local language support, and privacy architecture — gaps that become critical in non-English healthcare environments.
| Dimension | Traditional Query Tools Phenomizer, GDDP |
International NLP Models ClinPhen, Doc2Hpo |
Face2Gene (FDNA) 130 countries · 2,000 clinics |
DeepRare Nature 2026 · Academic Engine |
PhenoCDS |
|---|---|---|---|---|---|
| Clinical workflow integration | Manual code entry, post-consultation lookup | Retrospective upload, time-lagged | Facial photo upload required — consent burden & workflow disruption at bedside | Academic tool only, no EHR integration | Real-time IntelliSense at point of care |
| Non-English language support | English HPO terms only | English-only training corpus | English interface; no local clinical language support | Primarily English, limited localization | Multi-layer bilingual parsing (Rule + Embedding) with adaptive local mapping |
| Non-facial rare diseases (e.g. XLH) | Partial — relies on HPO input only | Partial support | Completely inapplicable — ~60% of rare diseases have no facial features; Caucasian bias documented in Asian patients | Full support via HPO symptom reasoning | Full support — symptom-driven, no photo required |
| Guides doctors to find missing clues | No | No | No — photo-based; cannot prompt for unseen symptoms | No — requires complete HPO input upfront | Next-best-question engine actively prompts for missing phenotypes |
| Privacy & HIPAA compliance | Public web, uncontrolled data flow | Requires upload to foreign cloud servers | Facial photo upload raises significant privacy & IRB concerns | Cloud-dependent, IRB clearance challenging | HIPAA-compliant · Zero PHI transmission · Secure web processing |
| Regional prevalence calibration | Global static database only | No regional adjustment | No — Caucasian-dominant training; documented accuracy drop in Asian patients | Literature-based priors, not locally calibrated | Dynamic prevalence weighting — continuously calibrated by real-world diagnosis outcomes |
| Onset timeline weighting | Flat phenotype set, no temporal dimension | Not supported | Not supported | Partial support | Congenital / Infantile / Adult dynamic weighting via Orphanet onset data |
| Post-diagnosis resource linkage | None | None | None | None | Integrated rare disease foundation, patient support & orphan drug eligibility |
| FHIR output | HPO list only | Partial (SNOMED CT primary) | No FHIR output | Partial FHIR support | Native FHIR R4 Observation Bundle output |
NGP defines the direction. PhenoCDS delivers three critical evolutions for real-world clinical deployment.
PhenoCDS addresses two massive, opposing market failures simultaneously — unlocking dormant drug revenue for pharma while eliminating wasteful healthcare spending for payers.
Pharmaceutical companies invest billions developing rare disease treatments. Yet 40–60% of eligible patients remain undiagnosed — meaning drugs with approved indications generate a fraction of their potential revenue. Every year of diagnostic delay is a year of lost treatment.
| Disease | Cost/patient/yr | Idle patients | Idle market |
|---|---|---|---|
| XLH | NT$5.45M | ~1,000 | NT$5.45B |
| Porphyria | NT$13.2M | ~2,200 | NT$29.0B |
| OI | NT$18.6K | ~700 | NT$13.0M |
| Combined idle market | ~NT$34.5B | ||
Every year a rare disease patient spends undiagnosed, they cycle through unnecessary consultations, redundant tests, and ineffective treatments. This is not a clinical failure — it is a systemic failure to connect observable symptoms to the right diagnosis at the first encounter.
Real-time phenotype checklists prevent diagnostic misses at the moment of care. Rare genetics expertise is democratized to every frontline physician — regardless of specialty or institution.
Cleaning clinical input at the source produces the first high-quality structured genetic phenotype database aligned to HPO/FHIR standards — a foundation for precision medicine and real-world evidence studies.
Shortened diagnostic odyssey. At first suspicion, the system immediately connects families to local patient organizations, orphan drug eligibility, social welfare resources, and clinical trial enrollment opportunities.
A rare convergence of clinical genetics expertise, AI engineering, and healthcare entrepreneurship — united by a shared mission to end the diagnostic odyssey.
Interested in clinical collaboration, research partnership, or investment? We'd love to hear from you.