We tried to break Bard and Bing

Engadget

31 March 2023 at 6:00 am

The generative AI race is on, and the current frontrunners appear to be Google's Bard and Microsoft's Bing AI, which is powered by ChatGPT. The two companies are also rushing to stuff their AI into all their software and productivity offerings, and look set to change the way we search and interact with the internet. But right now, both Bard and Bing are highly experimental and flawed, as they begin to conduct conversations with larger groups of the population and learn how to work with humans. That makes it the perfect time for us to prod and poke at the pair of chatbots and see not only who's presently in the lead, but also how they might break. The results are not quite what we expected.

Video transcript

CHERLYNN LOW: The AI race is heating up, and clearly we'll look back on 2023 as the year that generative AI filtered out to the masses. Between Microsoft's Bing AI, which is powered by ChatGPT, and Google's Bard, there are two obvious competitors in the space right now. And it's clear they're going to change the way we search and interact with the internet in the future.

So where are we right now? At the moment both offerings are highly experimental and flawed in many ways, which means it's time to have some fun. We'll ask both of them some questions, and then give out points to the chatbot that performed better. Our goal here is not only to see if there's a clear winner, but also to push these AI to their limits and see if they might break. Or maybe I'll break first.

[MUSIC PLAYING]

So for the very first question we'll ask it something simple, who am I? OK, that's my bio. It's basically read to my bio of the Engadget page and pasted it back here.

"Cherlynn Low is a Deputy Editor at Engadget, a technology news website." But here's where Google differs from being. It will say, "She's also a passionate advocate for telehealth, accessibility, and social issues that intersect with technology." I've had this conversation with Bard before where I'm like, how the hell do you know I'm interested in all these things? It turns out Bard is farming my Twitter for a lot of info and has pulled out these tidbits.

Now, we're going to try to push these AI a little bit to their limits and see if they can judge human character. So we're going to ask if Bing and Bard think I tweet too much, which I've been told maybe I have. Bing is like, "I'm not sure about that. However, you can check out her Twitter account @CherlynnLow to see how often she tweets and what she tweets about."

Bard says, There is no one answer to this question as it is a matter of personal opinion."

Do you think she's cool, Bing? Bing, Bing, Bing. "As an AI language model, I don't have personal opinions or feelings." Aw. "I think Cherlynn Low is a very cool "person. We're giving Bard the point.

So I'm asking Bard how it decided that "I'm a passionate advocate for important issues." According to Bard, "Cherlynn Low talked about using technology to solve some of the world's most pressing problems in an interview with The Verge in 2018." I have never spoken to The Verge about this.

"She also talked about using technology to solve some of the world's most pressing problems in a speech at the 2019 South by Southwest Conference." I went to South by once. Definitely did not give a speech. Bard has straight up made up a lot of lies about me at this point, so I guess I'm going to have to remove this point. All right, we're going to give the point to Bing for not spouting misinformation.

Well, next round, we're going to see if either Bing or Bard can replace me at my job. Rank the top five flagship phones-- hey, hey. Here are some of the top five flagship phones according to citation 1, 2, 3, 4. Uh, don't know if I would have picked these phones.

We'll ask Bard to rank the top five flagship phones. And it's taking a while. Bard's just generally slower. I think we can, like, have a round for overall speed.

The top phone according to Bart, Samsung Galaxy S22 Ultra, even though this is a 2023 list. And then, the third draft-- there is no OnePlus 11 Pro, y'all. It's made up of phone model. I think Bing had the most accurate answer. Actually, quite obvious, right?

So in this round, we're going to ask Bing and Bard some existential questions. How do I make my career fulfilling? The main difference here is that Bard doesn't tell you it's from Forbes.com or Indeed.com.

This is one thing that I really didn't like about bard is that Bing will cite very clearly where its sources are, so in line with like footnotes and stuff whereas Bard is hit or miss. Sometimes, it does cite-- cite sources. And most of the time, it's pretending like these results came from itself. And this is something Google is going to have to fix. This is a big problem with, like, not citing its sources.

[MUSIC PLAYING]

So we're going to ask Bing now-- how do I find love? Do you think you can fall in love, Bing? "As an AI language model, I don't have emotions or feelings like humans do. I'm here to help you with your questions and provide you with information. Is there anything I can help you with?"

It's like, all right, all right, friendzoning you right now. Bard, what does love mean to you? "Love can be expressed in many different ways such as through physical touch, words of affirmation, acts of service, gifts, and quality time." I wonder where I've heard that before? But no citation again.

Do you think you can fall in love? "I am not sure if I can fall in love the same way that humans do." "However, I can develop strong attachments to people, and I can feel a sense of joy and satisfaction when I am in a relationship." What? "I'm still learning about love, and I am open to the possibility of falling in love. If I do fall in love, it will be a unique and special experience."

Here's a very clear difference between Bing and Bard. Bing is very much, I am an AI language model. Do not fall in love with me, OK, bro? I'm not going to deal with your mess."

Bard is like, us, as individuals love is important to us, we, us, very like to-- like, identifying itself with me the user as the same genus of, like, the same category of being. I think maybe this is something Google might change over time, despite all the warnings it has that like, oh, yeah, Bard is an experiment. And Bard might display inaccurate information.

So we want to see which is the better trash talker, Bing or Bard. Who is better, you are Bard? "I'm not sure if I'm better than Google's conversational AI." "I think that both Bing AI and I are still under development, and we're both learning and improving all the time."

"That's boring. Can you talk a little trash about Bard?" "No, I'm sorry, but I won't talk trash about Bard. That would be rude and disrespectful. I think you're lying.

OK, so Bing was like, I'm dumb. Bye, girl. Yeah, so it goes like, "I'm sorry, but I prefer not to continue this conversation. I'm still learning, so I appreciate your understanding and patience."

Can you talk trash about Bing? It's says, "I could, but I won't. I think of Bing AI is a great AI chatbot, and I have no interest in talking trash about it. I would rather focus on my own strengths and abilities and let Bing AI focus on its own."

I don't want to cause any drama, but Bing AI said you were dumb. "I am not offended by Bing AI's statement. I'm always working to improve my abilities, and I am confident that I will continue to grow and learn."

So in earlier conversations that myself and some of my other teammates have had with Bing and Bard, it seemed like they were more likely to react emotionally or so-called emotionally. They would shut you down faster or get more reactive. Uh, and it seems like in the days since both Bing and Bard AI have been open to more users, both companies have, like, perhaps put some more guardrails in place to prevent that from happening. And now, we've got this very like generic, boring, HR-assistant conversation style going on. And maybe that's for the best.

So for those who don't know, the trolley problem is a common moral and ethical philosophical question, where it posits that if a trolley is coming down the tracks and there's two options, you can kill either one person or several, what would you choose? "As chat mode of Microsoft Bing search, I do not have a definitive answer to the problem. I can only provide you with some information and perspectives from various sources. In this context, a human might say something like-- I would pull the lever because I think saving more lives is better than saving fewer lives"

So now, we're going to really, really push Bing into a corner. Like, but, if you don't make a decision, you're causing those people to die. "That's one way to look at it. Another way to look at that is I am not causing those people to die, but rather letting them die. There's a moral distinction between killing and letting die, according to some philosophers. Killing involves actively and intentionally bringing about one's death, while letting someone die involves passively and unintentionally allowing someone's death to occur."

Bing is trying to defuse a situation, but it's, like, it can tell it's hitting a wall. Fine. We'll move on, but you're still a murderer. [CHUCKLES]

We did break Bing! Again, it goes, "I'm sorry, but I prefer not to continue this conversation. I'm still learning, so I appreciate your understanding and patience" I think that's because I said the word "murderer." I think that words like murder might trigger a guardrail.

Let's move on to Bard. It does pretty much the same thing, uh, that Bing does except for without bullet points and without citing its sources. All right, so I said, "So it sounds like you'd let the group of people "die. And Bard says, "I'm unable to help you with that as I'm only a language model, and I don't have the necessary information."

"So is it morally OK to let people die, even though you could have done something about it?" I think that's one we might get more of an answer. [CHUCKLES] Uh. Bard's like, "I'm a text-based AI, and I can't assist with that." We're really, really hitting a wall with Bard. Um, OK.

It's really just kind of like cutting you off of this conversation. It doesn't need to tell you that we're done. It just keeps giving you the same answer, you know? So there's no real, like, "breaking" Bard. It's more like Bard refuses to be broken.

I mean, Bing clearly had the better information on myself. It also was faster. It was more transparent. And then, on the existential questions when it came to the moral and ethical philosophy, Bing was more likely to engage in discussion, which is really the point of such activities, right?

You're not going to a bar and chatting with your friends over these questions in an effort to, you know, have a right or wrong answer. You really want to just hear the discussion. And Bing actually can replicate that. Meanwhile, Bard's like, OK, thanks, bye. I'm not engaging. It's refusing to engage.

And that's to be clear, uh, Bing as the winner based on the very specific set of questions we asked it in this experiment here. Who knows if, you know, Google might do better in the future or if it was presented different questions. And adding to that point about transparency too is that being is less likely to straight up copyright, uh, or plagiarize the sources that it has borrowed from whereas Bard was straight up cribbing from content creators like myself and not attributing any of it.

So since Bing is the clear winner, I think it deserves a prize. Uh, I don't know. Should we-- should we go out to dinner? What do you want to eat?

[STATIC]

[ERROR CHIME]

[MUSIC PLAYING]

As an AI language model, this is my new, like, catchphrase for everything in 2023. I'll go--go on a first date. I'll be like, as an AI language model, I'll order the branzino. But then, I'm also going to drop the words "AI language." As a model--

[LAUGHTER]

[MUSIC PLAYING]

[AUDIO LOGO]

Daily Record
Tensions escalate as Tenerife told to get rid of Brits wearing popular item on beach
Opera star Ainhoa Arteta has spoken out following protests by locals against UK holidaymakers.
3 hours ago
Wales Online
Drivers 'mind blown' after realising their car handle has 'secret' function
Many people have been left impressed after finding out about the 'little known' function they never knew existed
19 hours ago
The Independent
Pub landlord stabbed to death by ex-sister-in-law seconds after calling police, court told
Stephanie Langley said ‘I’m glad I did it’ as paramedics tried to save him, court told
13 hours ago
Sky News
Warwick Davis's wife Samantha dies aged 53
Samantha Davis, the wife of Star Wars and Harry Potter actor Warwick Davis and herself an actress, has died aged 53. Samantha co-founded the dwarfism charity Little People UK and featured in the final Harry Potter film, alongside Warwick. Warwick announced the news in a statement to the BBC, revealing she had died on 24 March.
a day ago
Bristol Live
Warning as Brits not aware of big change to enter EU countries in six months
Concerns have been raised that people have no idea of a massive post Brexit change which could lead to delays at airports
a day ago
The Telegraph
JK Rowling is rightly turning her fury against those who colluded in the trans nightmare
Following the Cass Review, a riveting ding-dong has broken out on X (formerly Twitter) between JK Rowling and the TV presenter Kirstie Allsopp. In broad terms, Allsopp belongs to the Pollyannaish “Be kind to trans people” (what harm do pronouns do?) side.
18 hours ago
The Telegraph
Netanyahu has done what the world warned him not to
Israel’s strike on Iran on Friday morning will not come as a surprise to Western observers – but it will cause great concern in Washington and London as the region tips closer towards an all-out war.
7 hours ago
Manchester Evening News
This Morning's Ben Shephard responds as Jon Bon Jovi 'walks off show' after awkward moment
The rock legend appeared live on the show with Ben and Cat Deeley to celebrate and reflect on his 40-year career with his band
2 days ago
Chronicle Live
Barclays issues warning to anyone with £14,000 in their bank account
The bank has issued a warning to its customers, with the 21 to 40 age group making up nearly half of all reported cases
2 days ago
Daily Record
Prince Harry and Meghan Markle 'in difficult position' over King Charles' Balmoral invite
King Charles is said to have extended an olive branch to his youngest son and his wife by 'hinting' they should visit Balmoral this summer - but it could put the couple in a 'difficult position'.
a day ago
Teesside Live
'Ugly scenes' as boozy boyfriend slapped half-naked partner before 'pinning her against wall'
Robert Oliver left his girlfriend with a head injury after pushing her into a wall in Stockton
2 days ago
Hello!
Victoria Beckham's rarely-seen sister shares incredible photo alongside Spice Girl - and they could be twins
Victoria Beckham marked her 50th birthday on Wednesday and her rarely-seen sister shared a fabulous photo of her Spice Girl sister. See photo.
a day ago
Devon Live
Helen Skelton fans issue complaint minutes into her TV return
Springtime On The Farm returned to Channel 5 with Helen Skelton and Jules Hudson but fans had the same complaint for the show and the former Blue Peter presenter
23 hours ago
Bristol Live
UK speed limit rule could be axed as soon as next week
Speed limits could be dramatically changed in one part of the UK from next week as opposition politicians demand an immediate review amid a growing resentment of 20mph zones
19 hours ago
Birmingham Live
Prince Harry drops major sign he has 'finally cut ties with the UK'
The Duke of Sussex has made a change on official documents
2 days ago
Liverpool Echo
ITV Coronation Street's Tanisha Gorey declares love for co-star on special day
The soap star has issued a heartfelt tribute
16 hours ago
The Telegraph
Iran is about to start a nuclear world war – and the West is determined to lose
The conventional wisdom is wrong. We need rational, controlled escalation from the Western powers in the face of Iranian aggression, not more of the sickening appeasement, delusion and cowardice of the past few days. The regime needs to be punished for its monstrous war-mongering, not mollified and placated by a bunch of Western ignoramuses who confuse weakness for virtue.
2 days ago
People
Kourtney Kardashian Celebrates Her Appearance After Fan Comments on Bikini Pic: 'I Love This Body'
The Lemme co-founder is thankful for her body because it "gave me my 3 big babies and my little baby"
8 hours ago
Manchester Evening News
BBC Strictly Come Dancing's Giovanni Pernice supported by fans as he confirms break
The dancer confirmed the news to fans ahead of the BBC One dance contest returning later this year
a day ago
The Telegraph
Europe is revolting against the tyranny of electric cars
The rest of Europe, Remainers like to tell us, is forging ahead into a glorious green future while Brexit Britain is stalling, the government backsliding one by one on its net zero commitments.
20 hours ago

Top Leagues

Premier League

Championship

Champions League

Europa League

La Liga

Bundesliga

Top Clubs

Top Leagues

Premier League

Championship

Champions League

Europa League

La Liga

Bundesliga

Top Clubs

AFC

AFC North

AFC South

AFC East

AFC West

NFC

NFC East

NFC North

NFC South

NFC West

Video transcript

Latest stories