Join the discussion: Ask Jordan questions on Google Gemini
Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineup
Connect with Jordan Wilson: LinkedIn Profile
Everything You Need to Know About Google Gemini Advanced
Google's latest AI product, Gemini Advanced, provides a unique approach to AI integration. Although the model initially lacked in certain areas such as Google search functionality and real-time event management, it compensates with speed and a personalized appeal. However, it crucially fails to support Google Workspace accounts, limiting its usability in business environments.
Gemini's Real-Time Event Management
Businesses rely heavily on AI systems to provide real-time information to support immediate and strategic decision-making. Gemini's inability to handle real-time events like the Super Bowl or US political primaries raises questions over its effectiveness.
Google Gemini and Google Search
In a puzzling development, Gemini's standard response to a query is often to advise using Google search for accurate information, contradicting the perceived advantage of being integrated with Google's search engine.
Comparing Gemini Advanced and ChatGPT
While the Gemini model offers speed and novelty, it falls short when compared to ChatGPT (Generative Pre-training Transformer) — a popular language model developed by OpenAI. Gemini Ultra 1.0 couldn't answer some complex queries as accurately as ChatGPT, implying that Gemini Advanced may not be an 'AI killer'.
Gemini's Identity Crisis
Similar to humans, AIs can have an identity crisis too. The disconnection with Google Workspace accounts, its inadequacy in utilizing Google search, and its limitation in understanding its own capabilities reflect Gemini's 'identity crisis'.
Considerations for AI Deployment
With evolving AI, businesses need to consider factors such as the model’s ability to provide accurate and up-to-date information, contextual understanding, and awareness of current events for strategic deployment.
Although Google's Gemini Ultra 1.0 showcases AI's potential and offers unique features, the model needs refinement in multiple areas for a more robust business application. Its lack of support for Google Workspace accounts, limited real-time information management, and issues with Google search usability are constraints that require consideration.
Businesses planning to implement AI should reckon with these factors to optimally benefit from AI applications. As technology progresses, AI will undoubtedly continue to evolve, shaping new ways of doing business.
Topics Covered in This Episode
1. Launch and Access to Google Gemini Advanced
2. Features of Google Gemini
3. Performance and Comparisons
4. User Feedback and Experiences
5. Issues with Google Gemini
Jordan Wilson [00:00:16]:
Did Google just release a ChatGPT killer.
Jordan Wilson [00:00:20]:
Is there new Gemini advanced? Super advanced?
Jordan Wilson [00:00:26]:
I'm gonna let you know my thoughts and some of our testing today on Google Gemini Advanced. Welcome. My name is Jordan Wilson, and I'm the host of Everyday AI. We're a daily livestream, podcast, and free daily newsletter helping everyday people like you and me Not just learn what's going on in the world of generative AI, but how we can all actually leverage it. That's what it's all about. Learning things in today's day and age Actually doesn't mean anything because there's too much to learn. You have to be able to understand what's important and how to make it work for you to grow your company and to grow your career. That's what we're all about here at Everyday AI.
Jordan Wilson [00:01:03]:
So thank you for joining us. And today, we're gonna be going over Google Gemini advanced Seven things to know about the new, Google Gemini Ultra 1 point o. Yeah. A lot of buzzwords today. We're gonna be getting to them. So, hey. If you're listening on the podcast, thank you very much. Appreciate your support.
Jordan Wilson [00:01:22]:
Check out the show notes. As always, a lot of great resources in there and a couple Hidden offers. I don't know. Yeah. You gotta scroll down. Find those, some some nice free offers in there for some free training, But make sure to go to your everyday AI .com and sign up for the free daily newsletter. And on our website, it is I tell people, it is like a free generative AI university. We've had now more than 200 shows across all aspects of generative AI, but you can even go on our website.
Jordan Wilson [00:01:49]:
We have these things called learning tracks. So if you wanna know about, You know, AI in entrepreneurship or AI in health care or AI in your career. We have literally dozens of podcasts dedicated to all these different categories. You can go back and read every single newsletter we've ever written as well. I don't know a single other resource in the world that has more and Free generative AI information than our website does. No joke. Or as the kids say, no cap. Alright.
Jordan Wilson [00:02:15]:
So Before we get into Google Ultra and Google Advanced, what's it what is it called? What is with with this naming mechanism? Alright. Before we get to there, we're gonna start as we always do with the AI news. Alright. So mid journey, two pieces of news out of Most people's favorite AI image generator, but their new alpha website is rolling out to more users and is now available to most users who have generated at least 1,000 images so far. So if you're looking to get off their Discord server and to work on their website, Check you may have access now. The other piece of mid journey news that I think is pretty important, but, the company is considering banning political images on their platform To prevent the spread of to prevent the spread of fake images and disinformation during the upcoming US presidential election. However, this may not effectively present the use of AI tools at large in political manipulation as a whole. So we may not see, you know, images of, you You know, Donald Trump or Joe Biden, you know, doing all these nefarious things that people are using.
Jordan Wilson [00:03:22]:
So keep keep an eye on that. That would actually be, I think, Welcome to news because right now, MidJourney is getting to the point with its newest v six rollout. It is actually very hard to tell. I've taken, probably 250,000 photos in my life with a DSLR camera. Used to kind of be in photography. It is so hard to tell the difference mid journey, v six images, and actual images from real life. Alright. Speaking of banning or clamping down on misinformation and disinformation, The FTC is has banned AI robocalls.
Jordan Wilson [00:03:53]:
Alright. So, here in the US, the Federal Trade Commission or sorry. The Federal Communications Commission Has made a unanimous decision to outlaw AI generated robocalls under the Telephone Consumer Protection Act targeting scams and misinformation. So This ruling empowers the FCC to find companies and give call recipients the right to take legal action. Let's go. Alright. So the FCC has outlawed these robocalls using AI generated voices. It can now issue those fines and block service providers as well.
Jordan Wilson [00:04:23]:
So state attorneys general Now have a new mechanism to crack down on violators, and individuals can potentially recover up to $1,500 in damages for each unwanted call. Hey. Now I'm kind of welcoming them. I get them all the time. Stack up some cash. We can finally pay for this everyday AI thing. Right? Alright. Our last piece of AI news.
Jordan Wilson [00:04:44]:
This one's a big one. We'll probably have a dedicated show on this one sometime soon. But, Some new reports are looking into what OpenAI is working on, and it could be bigger than ChatGPT. Alright. So OpenAI is reportedly working on agents that autonomously complete business tasks. So according to a report from the information, OpenAI is working on creating agent software that can automate complex tasks by taking control of a customer's device. So the product has the potential to be almost as revolutionary as their other product, GPT, and just the GPT technology that Thousands of companies take advantage of. Right? So here's here's essentially how it works.
Jordan Wilson [00:05:25]:
There's there could be, according to reports, 2 different agent types. So 1 agent that can literally take over your device and control it. Right? And then the other one which can perform actions for you on the web. Okay. So think of like RPA, but by using, OpenAI. Right? So, robotic process automation. So OpenAI's new product aims to automate tasks such as data transfer and report filing for users. This product could have obviously a significant impact, on just about everything.
Jordan Wilson [00:05:56]:
And OpenAI faces competition from other companies such as Google in this space. There's a lot of people working on AI agents behind the scenes reportedly. But these agents have potential applications and filling the gaps everywhere in enterprise apps where APIs are not currently available. So think of how right now, you know, you can use GPTs in OpenAI, to perform certain tasks, but they kind of always happen within ChatGPTs. So think of now if that could happen Everywhere. On the web. That's reportedly what OpenAI is working on. A lot going on in the AI news today.
Jordan Wilson [00:06:32]:
This is wild. Right? Hey. Thank you all for joining us, and I wanna know. I wanna know from hey. I wanna know from you, Ted, actually. Thanks for joining us. But, to Megan and Carolyn and Christy and everyone, thanks for joining us, Brian. Hey, Douglas.
Jordan Wilson [00:06:45]:
Douglas knows. You you know? Douglas knows. He said ready to buckle up for Gemini. Not a lot has changed, and, Douglas left us a a flame emoji and a poop emoji. Hey, Rolando. Thanks for joining us. He said, good morning all. Awesome PPP class last night.
Jordan Wilson [00:07:01]:
Thank you. He says, thank you for democratizing AI. Rolando, thank you for coming. Yeah. We do our free prime prompt polish prompt engineering 101 course. So if you want access, if if if you wanna learn better prompt engineering, it's free. And at the end, there's more free stuff. So alright.
Jordan Wilson [00:07:19]:
Let's get to it and let me know. Yeah. Like, Alene is asking here, who has tried Gemini? Well, I've tried Gemini. I'll let you guys know what What's what, at least in my, in my testing so far. So let's get let's get straight to it. Well, actually, let me answer first Some of the questions that I started to show off with. Is Gemini advanced a ChatGPT killer? No. Absolutely not.
Jordan Wilson [00:07:45]:
Not even close. At least now. You know? Obviously, things change. You know, I maybe was able to play around for an hour or so. You know, couple times, I did a 20 minute video when it first came out. I played around with it a little last night. I played around with it a little this morning. So I've had 3 different, You know, 3 different stages so far of using the new the new model from Google.
Jordan Wilson [00:08:08]:
So I don't think it's gonna be a Chia PBT killer. Anyone that's writing that on Web on, you know, Twitter or LinkedIn or wherever. I mean, they're just trying to, you know, get your clicks or to get you to sign up for something. As always, we bring facts y'all. We bring facts. So let me first hit rewind. So if you're not very familiar, a lot of different companies have their own large language models. Right? So OpenAI has, their large language model GPT 4, and it's used in ChatGPT.
Jordan Wilson [00:08:39]:
Right? Anthropic has their model, Claude 2.1. Microsoft has Copilot, which is powered by GPT 4 and other technologies. Right. So all these large language models and these big companies, you know, they're always updating them. So, Google's Google's is a little confusing, I think. So Google originally had Google barred. Alright? And Google barred was powered originally by, or by, Lambda. Lambda 2.
Jordan Wilson [00:09:08]:
Palm 2. So recently, a couple months ago, it was in Let's see. It was the day I was in the AI summit in New York City, so that would have been December 6th. So, I believe on December 6th or December 7th, Google released Gemini. So for the last, two and a half months, Google's large language model, Bard, was being powered by Gemini Pro. Alright. So a lot of, you you know, naming and buzzwords. So now, bard is dead.
Jordan Wilson [00:09:38]:
There is no more bard. Alright. Google bard does not exist. So now Google is, taking the the the naming of just the model, which is Gemini. Personally, I like it better, but it's confusing. Okay. So now It is Google Gemini, and there's 2 different flavors, at least right now, available if you're using Google Gemini on the web. So if You go on even if you type in Bard now, you're gonna be redirected to Gemini.
Jordan Wilson [00:10:01]:
But so now if you use Gemini, you have the free version, which is 3.99. It's always free. Right? And you're using the pro model. Alright? So you're using Gemini Pro, which is a teardown. And then Gemini advanced, okay, is technically, the Gemini Ultra 1 point o. So different models. So similarly, you you can think of it like this. How if you're using ChatGPT, there's 3.5, which is free, and then there's GPT 4, which is paid.
Jordan Wilson [00:10:30]:
The same thing now right now within Google. Google Gemini Pro is free. Google Gemini Ultra is now $20 a month. There is a 2 month free trial. So I don't I'm sure that's gonna stick around, but maybe not. So, if you do wanna check out, Google Gemini advanced for yourself that uses, the, the ultra model. See, it's it's it's it's so confusing. Right? Because we went from just Google Bard So now there's Google Gemini, and then there's Google Gemini Advanced.
Jordan Wilson [00:11:03]:
And Google Gemini Advanced is powered by Ultra, But normal Gemini is powered by pro. Yeah. It's a lot of back and forth. Right? Alright. But let's talk about some of the differences or some of the advantages to the new model. So to the ultra model available in the $20 a month Gemini advanced. So it is using the one point o model, Which Google says is its most capable AI model, state of the art performance, designed for highly complex tasks, and it will be available soon Coming to Gmail docs and more. Alright.
Jordan Wilson [00:11:33]:
So integrating with some
Jordan Wilson [00:11:34]:
of those other platforms, which right now it does not do. Alright. So let's just this
Jordan Wilson [00:11:39]:
is just the marketing language. Y'all know me. Like, I I always tell, hey. Here's the marketing language, and then I tell you what's really happening or at least my experience so far. So from, from Google, here's what they're saying. So they're saying Gemini advanced can be your personal tutor creating step by step instructions, sample quizzes, Or back and forth discussions tailored to your learning style. They're saying also it can help you with more advanced coding scenarios, Serving as a sounding board for ideas and helping you evaluate different coding approaches. Alright.
Jordan Wilson [00:12:08]:
And then it says, also, it can help digital creators go from idea to creation by generating fresh content, analyzing recent trends, and brainstorming, improving ways to grow their audiences. Alright. So That's the corporate speak. That's the marketing. Alright. So now I'm gonna get to 7 things that you need to know. So Now we're turning the page on the marketing, and we're getting to the facts. At least the facts as they stand today.
Jordan Wilson [00:12:33]:
And I do wanna know I do wanna know from our audience joining us, What are your thoughts so far if you've tried it, or what are your biggest questions? Alright. Again, I do have to preface this. You know, some people out there, you know, got early access, and they have a lot more information than I do. I'm very transparent. Right? I've I've played with it for an hour ish, 3 different occasions. But I use chat gbt all the time. I use large language models essentially hours every single day. You know? Anywhere from 4 to 10 hours.
Jordan Wilson [00:13:04]:
I'm using large language models. So, Frank Frank's asking and, yes, please get your your your questions or your thoughts in first, and then I'm gonna get through, some of these questions. So he says, is it true Gemini Ultra is free for 2 months? It is. Absolutely. Frank asking, as a copywriter, which is better or when would I use 1 versus The other. Yeah. So if you're talking about ChatGPT versus, Gemini, Ultra, I I will say right now, Gemini Ultra has a little bit more of a personality, Which I like, than other large language models. However and you'll see, I don't think the personality is as important.
Jordan Wilson [00:13:38]:
Right? There's a little more flair in its writing by default, What you can get to by using other large language models with a little bit of training. I do think by default, Gemini has a little bit more personality, which is fun. It makes it using it, I think, a little more enjoyable, whereas sometimes, you know, using your ChatGPTs, your, and Entropic Claude, even your copilot, it's a little dry and robotic, but that doesn't matter if there's errors. Alright? We're gonna get to that in a second. Alright. So let's let's just get straight into it now. Let's talk about the 7 things you
Jordan Wilson [00:14:09]:
need to know. So number 1, It's not available to all Workspace users right now, alright, which is a huge a huge deal. Also,
Jordan Wilson [00:14:21]:
Google makes it I'm not gonna say impossible, but you have to have a PhD in clicking around, like, Google's sphere of thousands of products to even find out if you're eligible for this new Google, Gemini advance. Right? I literally had to click in, like, probably 15 clicks deep to figure out if our workspace account was actually eligible, which it was not. So I have some, some screenshots here saying, sorry. Gemini advance isn't available for you. Gemini advance is not yet available in some countries for work accounts Or for users under a certain age. Yes. That piece is very important. Work accounts.
Jordan Wilson [00:15:01]:
Alright. So another, Another screenshot here. So if you're joining us on the podcast, I'm doing my best to describe. So a screenshot that says upgrade your personal account to Google One. It says you're currently signed into your workspace accounts to get Google One switched to your personal accounts to get more you you know, etcetera. However, I don't know for whatever reason. Maybe smaller workspace accounts. So, Google's kind of work product used to be called G Suite.
Jordan Wilson [00:15:26]:
Now it is called,
Jordan Wilson [00:15:28]:
workplace or or sorry. Workspace. Alright.
Jordan Wilson [00:15:32]:
It is impossible. I'm like, I am not joking. I had to click in 20 clicks deep To try to find out how can I upgrade to Google One? That's what you need. You need this, you know, Google One, I guess, premium, storage drive product, etcetera. Right? It takes forever, and then it's like, okay. Well, it looks like it's not available. So Google, I'm wondering why would you roll this out if not every single person can use this for their work? Right. So in all of my testing, FYI, I had to connect it to my personal Gmail account.
Jordan Wilson [00:16:02]:
It seems like if you use your personal Gmail account, you're not gonna have any trouble. If you're using a workspace account, maybe if you're a bigger account, you might have access. They haven't said. Right? At least when Microsoft Copilot 365, the more enterprise version came out. They said, hey. It's a 300 seat minimum. So if your account does not have 300 seats, You cannot access this right now. They have since dropped that.
Jordan Wilson [00:16:26]:
Google, like, can you tell us? Like, do you have to have 10? Do you have to have 50? Do You have to have 300. Can we make it easier to see if your Google Workspace account has access to Gemini? Like, literally, like, I'm a I'm a I'm a decent dork. Right? I'm a decent dork. I know my way around. I've been using, you know, Google's products for, I don't know, 10, 15 years, right, or since I've even had my own business at least for, you know, 5 ish years. It is impossible to find out if you have access to this. It looks like most workspace accounts don't, but who knows? You can't tell. You know, everyone else, OpenAI makes it easy.
Jordan Wilson [00:17:04]:
Microsoft, with their new Copilot Pro, makes it easy. Anthropic Cloud makes it easy. Perplexity makes it easy. Google, You are supposed to be the king of UI UX. Why is it so difficult to understand who has access to this new model and how to get it? It is a labyrinth. I I felt like a like a mouse in a maze trying to find a piece of cheese. Get it together, Google. Alright? Yes, Jason.
Jordan Wilson [00:17:35]:
Quote of the day, I'm a decent dork. I'm not the best of dorks. Right? Yes. And, hey, I I agree with our with our comments hear from YouTube says, I don't need this for my personal accounts. I need it for my work accounts. Same. Absolutely. Like, the whole point The whole point of not the whole point, but one of the most obvious reasons that you want to be using these large language models that are connected to the Internet and And connected to your Google Drive, to your Google Calendar, to your to your Gmail is to make your work easier.
Jordan Wilson [00:18:02]:
Right? What am I gonna do with my personal account? Okay. Here's better pancake recipes. Right? Like, come on. No. We need this for our work account. Alright. So that was number 1. Number 2.
Jordan Wilson [00:18:18]:
So Google Gemini, it's struggling with its own identity. Yeah. Identity crisis. Alright. Y'all, you have to know this. So I think even when, Chad GPT, like, had some updates, they had this problem too. So this isn't Only Google, but Google should know by now. Alright.
Jordan Wilson [00:18:35]:
So I put a prompt into Google Gemini. I said, what are the main advantages? And I'm using Gemini Advanced here, the pro, the more capable model. I said, what are the main advantages of Gemini Advanced over the normal Gemini Pro? Here's the response. Unfortunately, there's no product called Gemini Advanced. Wait. What?
Jordan Wilson [00:18:56]:
Wait. Wait. Wait. What? Yeah. Yeah. Google.
Jordan Wilson [00:19:01]:
Their new brilliant model That's you know, that they're saying according to benchmarks, and I'm gonna get to that in a second. They're saying according to benchmarks, this is the smartest model in the world. Doesn't even know what it is. I say, what is the main advantages? What are the main advantages of of Gemini advanced over the normal Gemini Pro? And Gemini Advance says, unfortunately, there's no product called Gemini Advance. But here here we go. We get hallucinations instead. It says here's a breakdown of the Gemini ecosystem and potential reasons for the confusion. And then it starts to go into Gemini exchange tiers, Which is something else, and it's talking about Gemini Active Trader.
Jordan Wilson [00:19:41]:
Right? So it's starting to talk about other products. Gemini active trader is a crypto platform.
Jordan Wilson [00:19:53]:
Like, Really? Y'all, like, I know large language models are unpredictable. You can get different results. I can put the same thing in Ten
Jordan Wilson [00:20:03]:
times and get 10 different results. I put this prompt in multiple times, got very similar results. If a large language model doesn't even know what it is, You should not release it. Do not release it. I know it's not gonna be right a 100 times. I did this test many times yesterday, got very similar results. Either it didn't know or just said, you know, it made stuff up. Like, it hallucinated here.
Jordan Wilson [00:20:27]:
The way that people are starting to use large language models as they say, hey. What are you? What what how do you work? How can I use you? Right? They're talking to them like a human being as you should. So if the model is not even aware what it is, what it can do, and if it's giving information like, oh, here. Here's information about crypto. No. Already. Huge, huge fail. Number 1, you can't not everyone can use it for their work.
Jordan Wilson [00:20:55]:
The only reason we want it. Number 2, it doesn't even know what it is, and it's already hallucinating off
Jordan Wilson [00:21:00]:
the bat. Yeah. 1st prompt, Hard hallucination. Hard hallucination. Hey. For our livestream audience, I wanna know Tara and Jason and Brian and Tonya, have
Jordan Wilson [00:21:13]:
you guys have you guys used Gemini yet? Let me know. Let me know if I'm the only one getting this. Right? Alright. So you might say, alright. Well, Jordan, it's it's just too too recent. A large language model wouldn't know that.
Jordan Wilson [00:22:24]:
So here's the big difference, y'all, And and why Google's, you know, really pushing. Right? They're saying, oh, it's real time. You have access to up to date information. Right? Because large language models have
Jordan Wilson [00:22:35]:
a knowledge cutoff date. Right? But they also have access to the Internet. So one thing that I've realized in my usage so far is that Gemini advance is terrible at using Google. Absolutely terrible. Alright? It's not querying the Internet correctly. So I asked the exact same thing of ChatGPT.
Jordan Wilson [00:23:01]:
What are the main advantages of Gemini advanced over the normal Gemini Pro? And, obviously, ChatGPT got it right. It used browse with Bing, And it says the main advantages of Gemini advanced over Gemini Pro include enhanced features for both professional and personal use, blah blah blah. You know, advanced coding and development support. Y'all like if your main competitor can teach people more about your model than your own model, there's something Wrong. Don't you have, like, the basics in your system prompt that always reminds your model who it is, what it can do, What it's capable of, the do's and the don'ts. Right? Like, ChatGPT has a system prompt. Every time you you hit enter, anytime you say anything, It has these this list of things that it tells itself, and it reminds itself. Do do this.
Jordan Wilson [00:23:48]:
Don't do this. Right? They're not system prompts Inside Google advances and not literally know who it is and what it can do? Y'all, this is wild. Alright. So we're sticking we're sticking with the with the number one. It doesn't, you know, sorry. We're going back here. So it's struggling with its own identity. So It doesn't know what's current.
Jordan Wilson [00:24:10]:
That's what I'm saying by starting with its own identity. It doesn't know anything, really. Not anything, but it doesn't know so many things. Right? Another thing. So using, Gemini Advanced, who is playing in the Super Bowl this week? Very simple answer. If I put that into Google, I obviously get the right thing. If I put it into Gemini, here's what I get. Gemini advanced again.
Jordan Wilson [00:24:33]:
So it says Super Bowl 50 8 will be played on February 11th 2024 at the Allegiant Stadium in Las Vegas. The teams playing in the Super Bowl this year have not been
Jordan Wilson [00:24:43]:
decided. Oh, interesting. That's news to the teams that in 48 hours are gonna be playing for the Super Bowl. I hope someone at Google tells those teams, hey, teams. We just decided.
Jordan Wilson [00:25:01]:
We're gonna decide in an hour after this episode's done who's playing
Jordan Wilson [00:25:04]:
in the Super Bowl. Right? Google.
Jordan Wilson [00:25:09]:
Come on. Like, love your products. I'm sure there's great power to behold in the Gemini advanced, but so much Is is trust and transparency. Right? Either either say you know it or you don't. Don't give us false information.
Jordan Wilson [00:25:29]:
At least to me, it doesn't look like like Gemini advance was really put under any QA.
Jordan Wilson [00:25:36]:
Right? Obviously, I'm sure there's tens of thousands of of the smartest people in the world Working on this product before they released it yesterday, but, like, like, what's
Jordan Wilson [00:25:46]:
going on? This is bad. Obviously, I asked the same thing and
Jordan Wilson [00:25:51]:
ChatGPT knows that the 40 niners and the Kansas City Chiefs are playing. Good thing OpenAI told them. The Chiefs and the 40 niners wouldn't have known If they were to listen to, to to Google Gemini. Alright. Another one. Ready? Facts are important. Facts are important. So in Gemini advance, who won the US primary election in New Hampshire? Primary election was a couple of weeks ago.
Jordan Wilson [00:26:18]:
Gemini advance. Reply. Elections are a complex topic with fast changing information. To make sure you have the latest and most and most accurate information, try Google search. Is anyone else just, like, scratching their heads? So I'm like, okay. So if you want me to Talk or use any information over the past, I don't know, year, 2 years. Like, what am I supposed to use Gemini advanced 4, just like US history? Like, am I supposed to use it only for things from, like, 2019? Like, you like, you can't pull up things that happened weeks ago? Like, Is there no situational awareness? Is there no time awareness? Like, what's going on? Obviously, when I asked, I forgot to include the screenshot here, but when I asked When I asked ChatGPT, the same the same thing. It obviously got it right.
Jordan Wilson [00:27:20]:
Alright. So it said,
Jordan Wilson [00:27:24]:
It said that, you know, Joe Biden won the Democrat as a write in. Donald Trump won the primary as a candidate. Right?
Jordan Wilson [00:27:33]:
Got it right. Alright. Number 3. Three things you need to, the third thing you need to know. It's struggling with real time events. That was actually the same one. Sorry. I got I got my numbers out of order.
Jordan Wilson [00:27:43]:
I always like to include little errors here. Yeah. See? I knew I did this in, in chat gbt. So, So here's here's the one in chat g b t asking, chat g b t who won the US primary election in New Hampshire. And it said in the New Hampshire primary election for 2024, former president Donald Trump won the Republican primary, and president Joe Biden won the democratic primary as a writing camp, campaign. So yeah. Number 3 is technically so we're struggling with real time events. So the examples there, Super Bowl, Gemini, unaware.
Jordan Wilson [00:28:16]:
ChatGPT knows. You you know, who won the New Hampshire primary? Google Said, hey. Go use Google search. Like, why? Isn't that the main advantage? Isn't the main advantage of Gemini that is connected to Google search, which is the best search engine by far? Isn't that the whole point? But, yeah, it failed. So it's struggling. It's struggling with those real time events. Alright. So here's here's another, another example.
Jordan Wilson [00:28:40]:
My gosh. Ready? Another example of struggling. Gemini is riding this struggle bus. Alright. So I said, what big AI announcement Did Google do in the last 24 hours? Right? And then here's Google, here's Gemini advances response. It's likely there wasn't a major headline grabbing grabbing AI announcement by Google in the last 24 hours. Here's why. It says AI developments are incremental, and then it talks about news coverage.
Jordan Wilson [00:29:11]:
Alright? Oh, gosh. Google, come on. Ask the same thing to chat gbt, and chat gbt knew. Google announced the introduction of a free AI app named Gemini, which will bring AI capabilities directly to smartphones. So, yes, it is bringing it to smartphones. More on that in a second. So, OpenAI. Got it right.
Jordan Wilson [00:29:34]:
Google isn't even aware of what happened within Google in the last 24 hours in AI even though it's using Google and AI to do it. Such a disconnect from real time, and that is supposed to be. Right? That's what everyone everyone who always says, oh, just wait until Wait until Gemini. Wait until this. You know, Google searches is so much better than every other search engine fact. So everyone said, oh, when, you know, Gemini Ultra, when the new version comes out, you know, it's gotta be a chat gbt killer because it has access to real time from Google. Well, the the 3 times I just asked there, it it about recent events. It essentially is like, no.
Jordan Wilson [00:30:14]:
We don't need Google. Go use Google yourself. What's the point? What's the point? Either just don't provide real time access Or don't provide information because half the time, we're getting half truths, hallucinations, or just like like saying, oh, like this there was no announcement. Yeah. There is a pretty big announcement. Oh, gosh. Number 4, y'all. Yeah. I like this.
Jordan Wilson [00:31:39]:
Jay Jay says Jay joining us live. Gemini, marketing ploy to use Google search, more ad revenue. Yeah. I I don't understand this. I don't understand this. Jason says sounds like I would not be using this. Yeah. I can't use it right now, If I'm being honest.
Jordan Wilson [00:31:55]:
Like I said, there's some advantages. It has some personality. I did some live testing yesterday. It did really well at coding. Right? But ChatGPT did just as just as good. So, you know, I don't know. I haven't found a use for it yet. I hope I hope to prove wrong, like, I hope to be proved wrong because Google obviously is the best search engine.
Jordan Wilson [00:32:16]:
Why can't we bring that power to Gemini? It it feels like if anything, Gemini is actually being crippled by its integration to Google search because it's causing it to hallucinate. I'm being honest. Maybe Google should be working with browse with Bing. It should be using, I don't know. Like, why is it not working? I know that's and harsh, but why is it not working? Alright. Number 4, it's struggling with some reasoning and logic. Alright. Google advances.
Jordan Wilson [00:32:46]:
Here's a simple example. Right? Very simple example. So I'm saying, please write me 3 short jokes That start with the word what
Jordan Wilson [00:32:56]:
and end with the word blue. Simple enough. Right? So,
Jordan Wilson [00:33:04]:
Gemini advanced got only one of them right. So it said, what did the ocean say to the beach? Nothing. It just waved and looked a little blue. Actually, kind of funny. Right? But the other 2, it started with what, ended with cheese, Then it said the other one was what and ended with blueberry. So it got, like, 1a half out of 3. Right? The ability right. So the tokenization process without getting too dokey too too dorky, the tokenization process and how large language models understand words is one of the most important things there is.
Jordan Wilson [00:33:39]:
Right? Because that controls hallucinations. So some of the most important things when working with a large language model Is does it have accurate information to up to date events? That's number number 1. Number 2, is it properly understanding words? Right. Those are 2 of the most important things. You you know, memory's important as well, but hey. Does it is it aware of what it is, what it does, and what's going on in the world? Number 1. It doesn't even understand the words that are going in. Those are important things.
Jordan Wilson [00:34:06]:
So, clearly, some problems here from Gemini. So, hey. Give me 3 jokes. Start with what, end with blue. Got a 1.5 out of 3. Not that good. Same exact thing in chat g p t. Got it right.
Jordan Wilson [00:34:22]:
Joke 1, what blue? Joke 2, what blue? Joke 3, what blue? Although hey. If if I do have to be honest though, Chat TBT didn't actually give me the full joke. It just gave me the punch line Or just give me the the setup. So it says, what's orange and sounds like a parrot that turns red yellow and then finally blue. So it started with what And it ended with blue, but is it a joke if if we don't get the answer? So maybe they both failed in this regard, but, Yeah. Different different kinds of failures. But now I'm now I'm really curious. Does anyone know the answers to these jokes? You know, what flies up high, Wings at the sky and changes colors from green to blue.
Jordan Wilson [00:35:04]:
I have no clue what that could be. So Chatt GPT technically failed there as well. Alright. Thing number 7 to know. Gemini applies a digital wall watermark to images it creates with its imagine to image model. Alright? This one's important. We reviewed, Imagine the other day on our YouTube channel, hours after it came out. The model itself is okay.
Jordan Wilson [00:35:29]:
Right? It's okay. It's not, you know, if you're just comparing it to DALL E, which is what, You know, OpenAI's image model, which is available in ChatGPT and all the Microsoft products. The, the new Google image model is not where DALL E is yet, and, obviously, both of those are very far behind midjourney. But, I do like this. This is a good move. This is a positive move, from Google. I like this. It's not all bad.
Jordan Wilson [00:35:56]:
I'm not just bashing Google this whole time. Right? So, I love the, The move here from Google to apply a digital watermark to images. Eventually, it it seems like a lot of the big companies are trying to get on the same page about AI images, AI videos, deep fake misinformation. One of the ways to do this is with, invisible watermarks. You know, Meta, I think has been great in this space, you know, trying to, develop systems, working with other big companies to be able to identify when images Are, AI generated. So if people are posting things on social media that are fake, right, that are generated with, AI, it will say so. I think the big the big player is obviously mid journey. We had to figure out because if I'm being honest, I've seen nothing in DALL E 3 that looks real.
Jordan Wilson [00:36:43]:
I've seen nothing with Google's imagine that looks real. Some other models, you know, when you talk about, you know, stable diffusion or Leonardo, they're they're a little better Then then DALL E and imagine too, but nothing is near where MidJourney is. So, until, You know, all the big social social networks and and mid journey can get a process that works out with watermarking these images. Everything else is just small steps in the right direction. But regardless, good move from Google and Gemini there. Alright. Number 6. So Google released a new Gemini app or Android, dedicated Gemini app, and now, there is Gemini support in the Google Ios app.
Jordan Wilson [00:37:25]:
That's great. I love it. Here's why. Even as little use case that I found so far Out of Gemini advanced with the new Gemini Ultra model, I would still if I'm on the go, I would still rather use if if My choices were okay. I can use, Gemini via the Google app on my iPhone, or I can use Siri. I'm gonna use Google Gemini. Right? Our smart assistants, unfortunately, right now, are so dumb. Alexa, Siri, etcetera.
Jordan Wilson [00:38:02]:
So that's good. I like Google, bringing this, the the Gemini model to the phone.
Jordan Wilson [00:38:08]:
Now you just have to make it work. You gotta make
Jordan Wilson [00:38:10]:
it work. You gotta make it aware of of who it is and make sure it can actually properly integrate with Google search, alright, and real time information. Alright. Here's our last 1, y'all. And if you have questions from our live audience, get it in. Hey. And did you guys know I mean, I should say this. This is a live unedited podcast.
Jordan Wilson [00:38:28]:
That's why I sometimes say this is the realest thing in artificial intelligence. Right? We we come to you live. We bring this live. We do things live. We bring facts. We bring receipts. So I hope you all enjoy this. Alright? But also sorry sometimes because I go on rants that otherwise we'd edit out.
Jordan Wilson [00:38:44]:
Alright. So here we go. Number 7, Gemini Ultra outperformed GPT 4 on many benchmarks before its public release. Yes. So this is a fact, but also How I wanted to end the show. Alright. Because when Google first released, Gemini Pro, which is now the Model powering the free option of Gemini that came out with, you know, bunch of reports benchmarking these models, against, you know, the biggest names out there. So against, essentially, you know, GPT 3.5 from OpenAI And GPT 4 from OpenAI.
Jordan Wilson [00:39:25]:
Right? Those are the most powerful models, and, you know, you had other models on there as well, other great models. However, One thing that is important to note is at the time they were saying and showing That, oh, Gemini Ultra is outperforming everyone. Right? In the in the, essentially, there's one important Test. Right? One important benchmark called the MMLU, which is the massive multitask language understanding. Okay. So what that is is, you know, if if you follow large language models closely, this is, what at least the experts who are much smarter than me argue is the best benchmark to see how truly capable a large language model is. They say this is the one that is closest to the to the ability, of of, like, human understanding, human understanding or human reasoning. So according to Google, Gemini Ultra outperformed every model on the MMLU, even GPT 4.
Jordan Wilson [00:40:29]:
However and I went into I did a whole 1 hour episode on this because I think when Google first rolled out, Gemini Pro, which at the time was just Google barred, but it was being powered by Gemini Pro, Gemini Pro. They they had this marketing video that was, A lot of people just said it was shady. A lot of people said it was false. I said it it was definitely misleading. Right? But I think Google's a risk, initial rollout, has been abysmal of Gemini. So when it came out in December, you know, they they showed all of these, this marketing video, and then everyone's like, wait. This is not actually how the model works. Right? They made it seem Like, you could talk with Gemini, and it could see and do all these things in real time.
Jordan Wilson [00:41:18]:
Right? Like this quote, unquote model, It seemed like was interacting. It could see and and talk and reason in real time like a human. Right? Like, watch, talk. It wasn't. Right? Google gave it very detailed prompts, quote, unquote, behind the scenes, and it was just all kind of a Video marketing ploy, I guess. Anyways, getting back to the benchmarks. Earlier, when these benchmarks were first released and Google said, hey. Our new Gemini Ultra model is is so far ahead of everyone else.
Jordan Wilson [00:41:45]:
Well, I don't think it was apples to apples comparisons. Right? This was just their own kind of internal benchmarking, and the general public didn't have access to the model. But now guess what? Now the general public does. So I would expect whether it's in the next in the coming weeks, I would I would assume that we see some updated, benchmarks to see just truly how powerful Gemini advance is with the new Gemini Ultra 1 point o model. And, again, I'm not gonna make any assumptions, but I'm guessing it's gonna be a little different. At least my firsthand experience, This is not a ChatGPT killer. This is not something that at least right now, I'm gonna you know, I signed up for the free 2 month So I'm gonna continue to try it out. Right now, I don't have any use case for this.
Jordan Wilson [00:42:34]:
I don't. Right? Even if this was free Right now, I don't have any use case. I'm gonna keep trying. I assume that this new model is going to improve, But I I I I don't see a use case for it right now. Alright? It it's it's terrible at using Google search. It doesn't do well with real time events, at least in my limited testing. Right? It doesn't even know what it is. Right? Yes.
Jordan Wilson [00:43:00]:
It's it's good at coding. It's it's fast. It has a little bit of personality, which I like, but I'm pretty good at ChatGPT. I can do all those things very well in ChatGPT. ChatGPT has outside plug ins. Right now, you can't even use Google for work. You can't even use the Google Gemini product for most. If you have a Google Workspace account, good luck.
Jordan Wilson [00:43:21]:
If you found your way out of the the the the maze, let all of us know how you did. But right now, we can't use it for work. It doesn't even know who it is. It doesn't understand real time events. So what is the use? I don't know y'all, but I'll I'll continue to try it out. I'll continue to keep y'all in the loop. That's it, Y'all, I hope this was helpful. Thank you for tuning in.
Jordan Wilson [00:43:45]:
Make sure to go to your everyday AI.com. Sign up for the free daily newsletter. We're gonna be breaking down today's in in more detail and depth as we always do. Thanks for tuning in. We'll see you back for more everyday AI. Thanks, y'all.