When a patient downloads their medical records and uploads them to ChatGPT to ask about a diagnosis, something legally significant happens at the moment of upload. The data does not change. The information inside the record — diagnoses, prescriptions, lab results, the names of treating physicians — is identical to what it was inside the hospital’s system. But the legal protection wrapped around it has largely evaporated.
HIPAA’s protections attach to covered entities: healthcare providers, health plans, and their business associates. The moment a patient exercises their federally guaranteed right to access their own records and transfers them to a consumer AI platform, those records step outside HIPAA’s perimeter. What was among the most legally protected categories of personal data in the United States becomes governed by a patchwork of state consumer privacy laws, FTC enforcement authority, and whatever privacy commitments the AI platform has voluntarily made.
This is not a loophole. It is an architectural shift — one that is happening at scale, accelerating, and raising compliance questions that most health-adjacent AI platforms are not yet equipped to answer.
How We Got Here: The Policy Decisions That Made This Possible
The movement of HIPAA-protected records into consumer AI environments did not happen spontaneously. It was enabled by a deliberate federal policy choice: the requirement under the 21st Century Cures Act that healthcare entities facilitate patient access to their electronic health information and maintain standardized APIs that make that data interoperable and moveable. The Information Blocking provisions of the Cures Act define information blocking as any practice that is “likely to interfere with, prevent, or materially discourage the access, exchange, or use of electronic health information.” Healthcare providers that make it difficult for patients to access or transfer their records are subject to enforcement. As of February 2026, the Information Blocking Complaint Portal has received over 1,600 complaints, with enforcement action increasingly anticipated. The intent was entirely reasonable: patients should control their own health information. A person managing a chronic condition should be able to share their records with a new specialist without navigating institutional friction. A patient seeking a second opinion should not have to beg for their own data. The downstream consequence — that this same interoperability makes it trivially easy to upload a complete medical history to a consumer AI platform that has no HIPAA obligations whatsoever — was not the primary design concern. It is now a primary compliance concern for every platform that accepts health data from users.The Architecture That Has Changed
Until recently, health data in digital systems operated in relatively predictable silos. HIPAA-protected data stayed within covered entity environments — patient portals, electronic health record systems, clinical applications — governed by the HIPAA Privacy and Security Rules. Consumer health data, collected by wellness apps, wearables, and symptom checkers, operated outside those environments under a different and generally weaker regulatory framework. The technical architecture underlying both environments was client-server: a user inputs data into a form, the data goes to a central server, the server runs deterministic rules-based logic and returns a response. Predictable, auditable, and bounded in what it could do with the data it received. LLMs change this in ways that matter for privacy governance:- They are not deterministic. The same input does not always produce the same output. The processing logic is not human-authored rules — it is learned patterns encoded in model weights. This makes it fundamentally harder to audit what the system does with the data it receives.
- They are longitudinal and pattern-aware. Where a rules-based system responds to a discrete query, an LLM can build contextual understanding across a conversation and, depending on configuration, across sessions. A medical record uploaded in one session can inform responses in subsequent sessions in ways that are not always transparent to the user.
- They comingle data from different regulatory environments. A single LLM session can contain HIPAA-protected records a user uploaded, consumer health information the user typed in, and inferences the model drew from both. These sit together in a context window and potentially in persistent memory, governed by whatever the platform’s privacy policy says rather than by HIPAA.
- They generate inferences. An LLM processing a medical record does not merely store and retrieve the information in that record. It draws inferences — about health conditions not explicitly stated, about family members mentioned in the record, about the patient’s likely future health trajectory. Those inferences may not exist anywhere in the source data but are created by the processing itself.
The Regulatory Fragmentation Problem
When a patient uploads their medical records to a consumer AI health platform, the legal framework that governs what happens next depends on a combination of factors that vary by user, by state, by the nature of the data, and by the platform’s own legal structure. There is no single answer. A platform handling health data uploaded by consumers may simultaneously face obligations under:- The FTC Act — prohibitions on unfair and deceptive practices apply to any consumer-facing platform, and the FTC’s Health Breach Notification Rule requires notification when consumer health data is breached by non-HIPAA entities
- California’s Confidentiality of Medical Information Act (CMIA) — which extends protections beyond HIPAA to a broader range of entities handling medical information about California residents
- Washington’s My Health My Data Act — one of the broadest state health privacy laws in the country, applying to consumer health data with opt-in consent requirements that exceed federal standards
- Virginia SB 754 — adding health data protections in another major state
- Illinois’ Wellness and Oversight for Psychological Resources Act — imposing specific restrictions on AI use in mental health contexts
- Genetic privacy laws where uploaded records contain genomic information — including Texas HB 2545 and the Texas Genomic Act
- Youth protection requirements where minors share health information, triggering verifiable parental consent obligations
- Comprehensive state privacy laws in the growing number of states that have enacted them, where health data triggers heightened obligations
The Data That Belongs to People Who Never Consented
One of the most underappreciated privacy implications of consumer health AI is the auxiliary data problem — the personal information embedded in medical records that belongs to people other than the user who uploaded them. A complete medical record is rarely about one person only. It routinely contains:- Family member information — genetic test results that reveal heritable conditions shared with biological relatives, family history sections that document diagnoses of parents, siblings, and children, emergency contact details and insurance beneficiary information
- Minor children’s data — pediatric records included in a parent’s health history, vaccination records, developmental assessments, and mental health notes about children who cannot consent on their own behalf
- Healthcare provider information — physician names, provider identifiers, tax numbers, clinical notes authored by specific practitioners, internal protocols that may be proprietary or subject to confidentiality obligations
- Third-party diagnoses and treatment decisions — notes from specialists, referral letters, second opinion consultations — authored by clinicians who have not consented to have their professional judgment processed by an AI system
- Potentially copyrighted clinical content — proprietary clinical methodologies, copyrighted educational materials, and internal protocols embedded in records by the institutions that created them
Clinical Judgment and Algorithmic Interpretation
Beyond the data governance questions, consumer health AI introduces a clinical accuracy problem that has direct implications for liability and consumer protection. Medical practice routinely involves judgment calls that fall outside standard protocols but represent the correct clinical decision for a specific patient. Off-label prescribing is the most common example — using FDA-approved medications for conditions they were not approved to treat is evidence-based, widespread, and often the standard of care in specialized medicine. A general-purpose LLM interpreting a treatment plan that includes off-label prescribing may flag it as incorrect or potentially dangerous, undermining the patient’s confidence in their physician’s decision without any clinical basis for doing so. The problem extends to any clinical decision involving nuance:- Treatment guidelines that are actively evolving and where the most current evidence differs from what is encoded in a model’s training data
- Patient-specific contraindications where the treating physician has made a documented judgment about a particular patient’s risk profile
- Specialist reasoning that is correct within a subspecialty but appears anomalous from a generalist perspective
- Experimental or compassionate use treatments where the clinical rationale is sophisticated and context-dependent
The Emerging Governance Architecture
In the absence of a comprehensive regulatory framework, some AI health platforms are building their own governance architecture — a combination of technical design choices and voluntary policy commitments that attempt to address the risks the regulatory patchwork leaves unmanaged. Several approaches are emerging:- Health data segmentation — maintaining separate storage and processing environments for uploaded medical records versus other user data, with distinct access controls and retention policies for each. Some platforms are publicly committing to “purpose-built isolation, separate memories, and compartmentalized storage” for health records specifically.
- AI training exclusions — explicit policies stating that uploaded health data is not used to train or fine-tune AI models. This addresses one of the primary concerns users have about consumer health AI and aligns with the direction of travel in data minimization regulation, where necessity requirements are increasingly constraining secondary uses of personal data.
- Voluntary HIPAA-equivalent commitments — platforms that are not HIPAA covered entities but that handle health data committing to apply HIPAA-level protections voluntarily. As noted above, these commitments create enforceable standards even where HIPAA itself does not apply.
- AI ethics boards and governance committees — internal oversight structures that review health AI use cases against ethical and privacy standards beyond what law requires.
The Compliance Priorities for Health-Adjacent AI Platforms
For any platform that accepts health data from users — whether purpose-built for health or a general-purpose AI that users are choosing to use for health purposes — the compliance priorities are:- Map the regulatory frameworks that apply to your user base, not just your platform’s general category. The combination of states where your users are located, the categories of data your platform accepts, and the inferences your system generates determines your compliance obligations. A single national compliance posture built around the most permissive applicable standard is not adequate.
- Design for the auxiliary data problem explicitly. Technical measures that detect and handle third-party personal data embedded in uploaded records — family member information, minor children’s data, provider identifiers — need to be part of the platform architecture, not an afterthought. This includes both what the platform retains and what it uses for inference.
- Align your public privacy commitments with your actual technical implementation. Voluntary commitments made in privacy policies and marketing materials are enforced as binding representations. Every commitment to HIPAA-equivalent protection, training exclusion, or data segmentation needs to be technically implemented and verifiable — not aspirational language in a privacy policy.
- Build clinical nuance into your model’s behavior design. AI health tools that cannot distinguish between off-label prescribing and prescribing errors, or between evolving clinical guidelines and outdated standard-of-care assumptions, create both clinical risk and liability exposure. This is a model design and evaluation problem, not just a disclaimer problem.
- Monitor the regulatory development actively. The state health privacy landscape is moving faster than federal policy in this area. Washington’s My Health My Data Act, Illinois’ mental health AI restrictions, and the expanding set of state genetic privacy laws are all actively developing. A compliance program that was accurate at the beginning of 2025 may have significant gaps by the end of 2026.