We pour our hearts out to AI chatbots. Health worries, financial secrets, relationship troubles, creative ideas, and even legal strategies flow freely into these digital confidants. The conversation feels private—like talking to a trusted friend or professional. But according to a detailed study by Professor Theodore Christakis of the University of Grenoble Alpes, that sense of privacy is often an illusion. The very features that make chatbots incredibly useful also make our conversations preservable, searchable, discoverable, disclosable, and exploitable.
In his March 2026 paper “You Trust Your Chatbot With Everything. Should You? Part 1: How The Controller Uses Your Chat Data,” Christakis maps the internal privacy boundary of consumer chatbots. A forthcoming Part 2 examines external risks like government demands and court access. This analysis is essential reading for anyone using tools like ChatGPT, Gemini, Claude, Grok, or others—and critical for privacy professionals navigating the consumer-AI privacy gap.
The Convergence Problem: Usefulness Breeds Vulnerability
Christakis identifies a fundamental tension at the heart of modern AI chatbots. Features designed for better user experience—long-term retention, memory functions, detailed logging, personalization, and integration with other tools—create rich, detailed records of our interactions. This “convergence” means the same choices that enhance utility also expose users to significant privacy risks.“The very choices that make a chatbot useful, namely retention, memory, logging, personalisation and connected tools, are the choices that make the resulting record preservable, searchable, discoverable, disclosable and exploitable.” — Theodore Christakis, University of Grenoble AlpesThis isn’t about malicious intent by providers in most cases, but about structural design decisions. A single stored conversation can simultaneously serve safety monitoring, model improvement, advertising personalization, and become a target for external access. For users treating chatbots as digital diaries or therapists, the implications are profound.
The Four Main Pathways for Conversation Extraction
Christakis outlines four primary pathways through which providers can access, use, or extract user conversations. These pathways highlight how data flows internally within the provider’s ecosystem and beyond:-
- Training and Model Improvement: By default, many providers use conversations to train or fine-tune models. Even with opt-outs, nuances exist—such as feedback mechanisms (thumbs up/down) that can override settings or retention policies tied to training consent. This creates persistent records that may influence future model outputs or be subject to extraction techniques.
-
- Human Review and Safety Monitoring: Providers routinely reserve the right for human annotators to review chats for safety, abuse prevention, or quality improvement. Reviewed conversations can be retained longer than user-deleted ones, sometimes for years, creating extended exposure windows.
-
- Personalization, Memory, and Advertising: Memory features store conversation context for better personalization. When combined with advertising (as tested by some providers), past chats inform targeted ads, turning intimate disclosures into commercial data points. Cross-service integrations further expand data flows.
-
- Logging, Retention, and Connected Tools: Comprehensive logging for operational purposes, combined with integrations across ecosystems (e.g., with email, calendars, or other services), creates searchable, exportable records that are vulnerable to internal access, breaches, or external legal demands.
Comparative Insights Across Major Providers
Christakis’s study includes detailed comparisons of services like ChatGPT, Gemini, Claude, Grok, and DeepSeek. Common findings include broad rights to use inputs for improvement, human review capabilities, and varying levels of transparency. While providers implement safeguards against regurgitation of private data, risks remain—especially as models evolve and extraction techniques advance. Advertising integration is a growing concern. For instance, OpenAI’s testing of contextual ads based on chat history blurs lines between helpful assistant and data-driven marketer. Memory features that enhance continuity can also amplify privacy trade-offs.The Sealed Mode Proposal: A Path Toward Better Protections
One of the study’s most impactful recommendations is the creation of a “Sealed Mode” for high-stakes conversations. This would be a clearly labeled option (e.g., for health, legal, or financial discussions) with architectural constraints including:-
- No use for training
-
- No advertising
-
- Siloed personalization (no cross-chat leakage)
-
- Strict, short retention periods
-
- Minimized human review with audit trails
-
- Cryptographic hardening for added security
Broader Risks: From Internal Use to External Access
Part 1 focuses on provider handling, but the study foreshadows Part 2’s examination of government demands, civil discovery, and breaches. Retained conversations become valuable assets—or liabilities—in legal proceedings, regulatory investigations, or cyberattacks. In an era of increasing law enforcement interest in digital records, the privacy boundary extends far beyond the provider’s servers. Users self-censor or avoid powerful use cases due to uncertainty, limiting AI’s potential benefits. Conversely, over-disclosure under false privacy assumptions can lead to real harm.Practical Recommendations for Users and Organizations
While waiting for industry-wide changes, here are actionable steps:-
- Review and Adjust Settings: Actively opt out of training/data improvement where possible. Disable memory for sensitive topics and understand retention policies.
-
- Use Anonymization Techniques: Avoid identifiable details when possible. Use placeholders or general scenarios for brainstorming.
-
- Choose Modes Wisely: Opt for temporary chats or incognito-like features when confidentiality matters most.
-
- Consider Local/Offline Alternatives: For highly sensitive work, explore open-source models running locally on your device.
-
- Advocate for Transparency: Push providers for clearer notices, Sealed Mode options, and verifiable controls. Organizations should integrate these considerations into AI governance policies.