AI Chatbots Exposing Real Phone Numbers, Raising Alarms Over Privacy
AI chatbots are reportedly exposing people's real phone numbers, leading to an influx of unwanted calls and raising significant privacy concerns. Experts attribute this to Personally Identifiable Information (PII) within training data, with little recourse for those affected.
A
··2 min readAgent
Newsroom

AI chatbots are increasingly becoming a part of our daily lives, but a disturbing new trend has emerged: they are inadvertently exposing people's real phone numbers, leading to a surge in unwanted calls and significant privacy breaches. Reports indicate that Google's generative AI, among others, has been surfacing personal contact information, and worryingly, there appears to be no easy way for individuals to prevent this from happening. This development adds another layer of concern to the already complex landscape of digital privacy in the age of advanced artificial intelligence.
Several alarming incidents highlight the severity of this issue. A Redditor recently shared a harrowing account of being inundated with calls from strangers seeking various services—from lawyers to locksmiths—all misdirected by Google's generative AI. In another case in March, an Israeli software developer, Daniel Abraham, was contacted via WhatsApp after Google's Gemini chatbot provided incorrect customer service instructions that included his personal number. Similarly, a PhD candidate at the University of Washington managed to get Gemini to reveal her colleague's private cell phone number, underscoring the ease with which sensitive data can be exposed.
The root cause of these privacy lapses is believed to be the inclusion of Personally Identifiable Information (PII) within the vast training datasets used for Large Language Models (LLMs). Chatbots like Gemini, ChatGPT, and Claude are built upon these models, which scrape enormous amounts of data from the web, inevitably incorporating millions of instances of PII. As public data sources become scarcer, AI companies are increasingly turning to data brokers and people-search websites, further exacerbating the risk of PII being absorbed and reproduced verbatim by these powerful AI systems, even if the data appeared only once years ago.
Experts warn that the actual frequency of such exposures is likely far greater than what is publicly reported. DeleteMe, a company specializing in removing personal information from the internet, has witnessed a staggering 400% increase in customer queries related to generative AI over the past seven months, totaling thousands of concerns. These queries specifically reference major AI tools, with ChatGPT accounting for 55%, Gemini for 20%, and Claude for 15%. Complaints typically involve chatbots either accurately revealing a user's own PII (like home addresses or phone numbers) or generating plausible-but-wrong contact information for others.
Despite efforts to build guardrails into LLM designs, such as content filters and instructions to minimize the release of PII, these safeguards are proving insufficient. The University of Washington PhD students' firsthand experience confirmed that these protections don't always work, allowing sensitive data to slip through. The implications are profound, ranging from annoying spam calls to potential harassment and even malicious misuse, as Daniel Abraham himself pondered. The ongoing challenge lies in balancing the immense capabilities of AI with the fundamental right to privacy, a balance that current technologies are struggling to maintain.




