Generative AI: Data privacy, backup and compliance

Generative or conversational artificial intelligence (AI) tools have attracted a lot of attention, as well as some controversy, as applications such as OpenAI’s ChatGPT and Google’s Bard create human-like responses to queries or prompts.

These apps draw on large databases of content and raise questions around intellectual property, privacy and security. In this article, we look at how chatbots work, the risks posed to data privacy and compliance, and where generated content stands with regards to backup.

These tools – more accurately termed “generative AI” – draw on large language models to create human-like responses (see box). OpenAI’s large language model is the Generative Pre-trained Transformer (or GPT); Google Bard uses Language Model for Dialogue Applications (LaMDA).

However, the rapid growth of these services has caused concern among IT professionals. According to Mathieu Gorge, founder of VigiTrust, in a recent research project, all 15 chief information security officers he interviewed mentioned generative AI as a worry.

“The most serious concerns are IP leakage and confidentiality when using generative AI,” says Gorge, adding that the ease of use of web- or app-based AI tools risks creating another form of shadow IT.

As online services, generative AI apps transmit and process data over the internet. The main services do not detail where they physically store data.

“Every one of these services has different terms and conditions, and you need to read these very carefully,” says Tony Lock at Freeform Dynamics. “Are they using your inputs, so next time you log on they know who you are and how you like to phrase your queries? They are probably saving some of that information. A lot depends on the systems, because some use old data [to answer queries] and others go out and look at everything they can find.”

Generative AI: Data privacy, backup and compliance | Computer Weekly

Chatbots and data privacy

Chatbots and compliance

Chatbots and backup