字幕列表 影片播放
[MUSIC]
SATYA NADELLA: Hi, everyone.
Welcome. It's great
to see all of you in Seattle in person.
We have an unbelievable show.
I see Scott Guthrie even wore his red shirt.
We welcome you to
the Azure Kubernetes Container Service 2023 launch.
No, don't worry. We'll have some fun.
Scott is not coming up to show you
code onscreen anytime soon,
but look, it's an exciting time in tech.
The broad contours of
this next platform are
just getting clearer and clearer each day.
The advances, what's possible.
That's what obviously excites us in our industry.
But they're also grounded
in what's happening in the broader world.
There's no question. There are
enormous challenges out there.
In fact, it reminds me and I've spoken about
this before of the very founding of Microsoft in 1975.
In fact, when the popular electronics cover
came out with the Altair,
which of course our founders
picked up and ran with it and
created essentially what's the software industry
as we know of it today.
That same week,
Newsweek had a cover with President Carter
trying to fight off
the three-headed monster of inflation,
recession, and an energy crisis.
Today you'd have something similar.
You will have AI on one cover and
then we'll have those three challenges
plus for good measure,
we can add a few more.
We, as Microsoft,
we as the tech industry have to really
ground ourselves in how do we relate one to the other?
In other words, can we use technology to overcome
the challenges that people and
organizations and countries face?
That's really the pursuit here.
In that context, I would say,
I just want to share a couple of
anecdotes which gives me great hope.
Quite honestly, it gives me personally a lot of
satisfaction around working at Microsoft,
working in this industry to
push the state of the art of technology.
The first one, obviously when Sam and
his team late last year launched ChatGPT,
that's the only thing anybody your friends and
family wanted to talk about throughout the holidays.
It was just crazy house.
It was like the Mosaic moment.
The closest we've come.
It's been 30 years now since when Mosaic launch,
which I distinctly remember.
It was very exciting time.
I went on a holiday first and then I was in
the first week of January I was in India.
On Jan 1st,
I look at my news feed and I see
this tweet that Andrej Karpathy put out.
Who is our ex OpenAI and Tesla,
and is now an independent AI developer.
He had this thing about the product that really he
was most excited about the previous year was
GitHub Copilot and he was saying how 80 percent of
his code was being generated by this,
I would say first at
scale product built on NLM technology.
This doesn't mean he's
80 percent somehow not doing his work.
In fact, he's getting so much more leverage.
In fact, recently we crossed
100 million developers on GitHub.
Think about this. There are
100 million developers on GitHub.
If we can improve their productivity,
just like how Andrej was able to observe.
Then let's say in the next decade we double that number,
may be double it again.
We get close to a half a billion developers,
what economic opportunity it would create.
Because there is not a meeting that I
go to today with any CEO,
CXO of any organization who's not
looking for more software developers,
more digital skills.
That's the currency in every sector of the economy,
in every country of the world.
That's the opportunity we
have to be able to take
this technology and make a difference.
But then the next day I went
to Mumbai and then I saw this demo.
This was just for me,
the most profound thing that I've seen in a long time.
The demo was actually built by
the Ministry of Electronics in
India because they are building
out a digital public goods.
Their idea is look, India has got
multiple official languages and they wanted to
democratize essentially access to language translation.
They're building it out as a digital public good.
In fact, Microsoft and Azure and
Microsoft Research are all involved in that project.
This is basically speech-to-text,
text-to-speech across all of the languages in India.
They showed me this demo where
a farmer speaking in
Hindi expresses a pretty complex thought
about how he had heard
about some government program and wants to
apply for a subsidy that he thinks he's eligible for.
It's a pretty complex prompt query.
There's technology,
that's a good job.
It goes to the bot, recognizes the speech,
comes back and says, you know what?
You should go to this portal,
fill out these forms,
then you'll get your subsidy.
He says, look, I'm not going to go into any portal.
I'm not going to fill out any forms,
can you help me? He does it.
Then I was told that a developer said, you know what?
That is Daisy Ching a model that was
trained on all of the documents of
the government of India using
GPT with this speech recognition software.
Basically two models coming together to really
help a rural farmer in India
trying to get access to a government program.
Look, I grew up in India.
I dreamt every day that someday
the industrial revolution will get evenly
distributed across the world.
Here I was,
seeing something so profound,
something that is developed by the folks at
OpenAI in the West Coast of
the United States a few months earlier,
used by a developer locally
to have an impact on a rural farmer.
That to me is what gives
me meaning and I think gives
us all meaning in our industry.
It is just fantastic to see that.
Now of course we've got to scale it and scale
it with a real understanding that we can break things.
It's about being also clear-eyed about
the unintended consequences of any new technology.
In fact, that's why way back in
2016 is when we came out with the AI principles.
You have both we as Microsoft and our partners at OpenAI,
deeply care about this.
In fact, the entire genesis of
OpenAI is from that foundation.
We built these principles,
but we've not just put those principles as a document,
but we've been practicing it because that's the only way
technology gets better in this particular case.
When you're talking about AI, it's about alignment with
human preferences and societal norms
and you're not going to do that in a lab.
You have to do that out there in the world.
It starts by the way with design decision ones makes.
When you think about AI, you
can have the human in the loop,
you can hear how the human on
the loop or you can have the human out of the loop.
Those are decisions we as product makers build,
build into the products.
Whenever we have come up with
some new things and new models,
we in fact put a premium on human agency.
When you think about these generative AI models,
remember one thing, they are prompted to do things.
The prompting comes from a human being.
We get to in fact help them prompt.
We get them to take the draft that gets generated.
They should review the draft.
They get to review the draft,
they get to approve the draft using judgment.
We want to give them as many perspectives,
as many ways to regenerate.
We want to take even just the design of AI products as
a first-class construct and build that into our products.
But that's not sufficient, we realize that.
You have to write at
the pre-training stage when it comes to the data,
the pre-training itself, the model itself has to be safe.
The safety system around the model,
the application context, all will matter.
We absolutely, and you'll see that today,
we take all of that as
first-class things which we want to reduce,
not just to principles but to engineering practice.
Such that we can build AI
that's more aligned with human values,
more aligned with what our preferences are,
both individually and as a society.
What is it that we should do and what should we build?
I think that this technology is going to reshape
pretty much every software category.
We know, we've seen that and we've solved.
If you think about the web, we've had what?
Three at least very distinct platform shifts
that have shaped the web.
The web was born on the PC and
the server and then it evolved with mobile and Cloud.
Now the question is,
how is AI going to reshape the web?
Each time in each one of those phases,
some real foundational technology layers,
sometimes I describe them as
these organizational layers of technology, were born.
The browser was the first one.
Without the browser, the web would have not
been as popular as it is today.
Same thing with search.
Search organized the web.
Then when it came to the mobile
generation and mobile and Cloud,
in fact these super apps,
especially outside of the United States around messaging,
became the way people consume the web.
We also had app stores.
The question is, what are the constructs?
What are the organizing layers going forward?
You'll see some of that today.
We think there are two things that are emerging.
One is this conversational intelligent agents.
I think they're going to be things that
we're going to have everywhere we go.
All computer interaction is
going to be mediated with an agent helping you.
In fact, we're going to have this notion of a
co-pilot that's going to be there
across every application canvas inside of
an operating system shell in a browser.
We want to show you some of this innovation starting with
how it's going to reshape
the largest software category on planet Earth,
which I've been working on for a long time,
which we're very, very excited about, Search.
It's a new day in Search.
It's a new paradigm for search.
Rapid innovation is going to come.
In fact, a race starts
today in terms of
what you can expect and we're going to move.
We're going to move fast.
For us every day,
we want to bring out new things.
Most importantly, we want to have
a lot of fun innovating
again in Search because it's high time.
With that, let me turn it over to Yusuf.
YUSUF MEHDI: Thanks, Satya. It's great
to be here with all of you today.
We've been working on something we think is pretty
special and we're just eager to share it with you.
In my time at the company,
I've been lucky to witness a few important moments.
I think this might be another one of those.
We believe as Satya said that we can improve
the way billions of people can benefit from the Internet.
Created by really an amazing team across Microsoft,
from the core web engineering folks in Michael's group,
to the brilliant folks at Microsoft Research,
to the tireless people that work on
the Azure AI supercomputer,
we think of it humbly as
the next-generation of search and browsing.
Infused with AI and
assembled as an integrated experience,
we're going to re-imagine the search engine,
the web browser, and new chat experiences
into something we think of as your copilot for the web.
Now, copilot is a critical word because we
believe in the empowering nature of AI in which you,
the individual, are in charge.
Now what it is, is not even as
important as what it represents.
That is for us an aspiration
to unlock the joy of discovery,
the wonder of creation,
and that feeling of empowerment from
being able to harness the world's knowledge.
At the center of this new co-pilot experience is
an all new Bing search engine and Edge web browser,
and it's going to do four things for you.
First, it's a better search.
It's the search you know and love but
it's better because it's AI powered.
Second, not only does it give you the search results,
but it will actually answer your questions.
Third, we're going to make it incredibly easy to use.
We're going to let you chat, we're going to
let you just talk to it naturally.
Last, when you need that spark of creativity,
Bing can generate content for
you automatically to help you get started.
Let's talk a little bit about the opportunity
in search and why
we believe we're at the start of the next generation.
No. Sorry about that, we're on to the next generation.
All seriousness, with
over 10 billion queries a day,
we all know the search engine is an incredible tool.
We do. Yet as the web has grown,
we've run into challenges.
People are overwhelmed increasingly with
too many links when they're
trying to find simple answers.
But 40 percent of the time people click on search links,
they click back immediately.
That's a sign they're not finding what they want.
Most notably, we have to adapt to
search versus the other way around.
It turns out search works better
if you give it fewer keywords.
You might be surprised to know that 75 percent of
all searches are three key words or less.
But come on, we shouldn't be surprised.
Search has remained fundamentally
the same since the last major inflection.
When we moved from a directory approach to
search to an algorithmic approach to search.
The user experience and the approach
underlying are essentially the same as 20 years ago.
Few keywords and you get millions of links.
But as the world around us has changed,
the way people use search has changed, we all know this.
People are trying to use search to do
more than it was designed to do.
To illustrate that let's just look at a breakdown
of search queries by type to understand this point.
Search querries today you can bucket them
into serve three types of categories.
The first are what we call navigational.
This is people searching for a website.
When you need to renew your driver's license,
you need to go to the DMV.
When you want to tell your friend
about that great pizza place,
you send them the WebLink.
That's what search was designed to do,
and it does it super well.
The second, third of categories
are what we call informational.
These are things like what's the weather forecast?
What was last night sports scores?
What's today's stock price?
Search works great for that.
But for anything more complex, like hey,
I got to pick up this love seat at the store,
is it going to fit in the back of my car?
Search is going to fall short on that.
Then the final third,
we're back into what we call everything else.
These are typically more deep research in nature.
This is things like trip planning or complex shopping.
An example query here that people are putting in is, hey,
can you recommend a five-day itinerary to Mexico City?
For these, search falls far short of the desired goal.
What this means if you just do the simple math,
it means that roughly half of
all searches aren't delivering the job that people want.
If you go on that 10 million queries,
it means that every second,
50,000 people searches potentially go unanswered.
This is why we believe
it's time for a new approach in search.
For the billions of queries that are going unanswered,
we've seen new attempts to try address the problem.
As you all know, there are vertical search attempts.
Amazon has done a better job for shopping.
YouTube is great for video.
Reddit is a great place to come get advice.
The benefits of search are well-known.
It's fast, it's timely,
and there's a great business model.
Then more recently they've been another vector,
more disruptive ideas like leveraging AI to
answer questions directly and to generate content.
These are amazing as well,
and they show what's possible.
But what if you could get the two to come together?
Not only would you get two things in one,
but we think you could actually solve problems in each.
We think you could get to something that is
really one plus one equals three.
We've done that with the new Bing.
I want to share with you
four technical breakthroughs the Team
has achieved to make this come to life.
First, through our fantastic partnership
with SAM and the brilliant team at OpenAI,
I'm excited to announce that Bing is running on
a new next-generation large language model.
One that is much more powerful than ChatGPT,
and one that is customized specifically for search.
It's unlike anything you've had a chance to play
with and we can't wait for you to try it.
Second, we've developed a proprietary way of working
with OpenAI that allows us to best leverage the power.
We call this collection of capabilities and
techniques the Prometheus model.
The core idea here is that both
at training and at runtime,
we engage with the OpenAI model more
intelligently through our knowledge of the web,
via the Bing index and some special query techniques.
We're going to dig into this a little bit more later,
but the benefits are the following.
First, we can improve
the relevancy of answers by feeding in
and better tuning queries given
our understanding of the web search index.
Next, we can annotate
the answers with specific web links and citations.
We can get you more up-to-date
information because search crawls the web every day.
We can improve understanding of geolocation.
Finally, we can increase
the safety of the answers as well by catching
queries at initiation and then
checking that again at the delivery of an answer.
Next, we've been making
steady improvements on the Bing algorithms for years.
We test these with independent judges.
The shows that are search experience is on par or
better than any search experience
when you take away the brands.
But a few weeks ago, something special happened.
We applied the AI model to
our core search ranking engine and we
saw the largest jump in relevance in two decades.
We believe we can continue to drive
breakthroughs as we improve the models.
Then finally, we are re-imagining how you
interact with all of these capabilities across search,
browser, and chat by
pulling them into a unified experience.
I want you to think about search
coming together with answers,
search coming together with chat,
and search coming together with the browser.
As we all know and as the folks at OpenAI taught us,
the user experience is as important as
the underlying technical platform. Enough talk.
You guys ready to see it in action?
Let's show you how we're going to enable
the Copilot for the web.
Now before I do, I want to call
out two things for clarity.
First, because there's so much I'm going to show you
until you don't have to watch me type every search.
I recorded these searches live just yesterday.
Second, in case there are skeptics in the room,
you're going to get a chance to play with it directly,
put your hands on it and type the same queries as well
as your own favorites right after this presentation.
You're all familiar with search,
so I'm not going to show you a search.
I'm going to share what you can't do with today's search.
I'm going to focus on answers,
chat, and the ability to create.
But first let me introduce you to the new Bing homepage.
You're going to notice some subtle but important changes.
First, we have an expanded search box
capable of accepting up to 1,000 characters.
Because now Bing works with natural language.
You saw a little hint to chat,
which I'm going to get back to in a second.
Now want to set up the first search scenario.
My daughter and I, we both love art.
She studied art at school and I'd like to stay
connected with her on our mutual passion.
Last semester, she was learning about Mexican painters.
I'd like to get a quick summary of
the most influential Mexican painters
and the works to learn a bit more about the topic.
If I type the full query of what I'd like to
know in today's search, here's what I get.
I'll just type in, compare
the most influential Mexican artists
and their top paintings,
and you'll get what you expect. In some links.
It's fine, but we can do better.
Let's try this now in the new Bing.
What you'll see as we pull up, is first,
you see the web results here on the left,
but then on the right, you start to see
how we start to compile the answer.
What you get here now is we have
the ability to highlight these web links.
We can annotate the results.
That's because we're able to go in and
apply our index onto the answers there.
In other words, the answers and
the search on one page has
saved me a huge amount of time.
This gives you a little bit of a
sense of what you can do.
Now you've seen some of this before,
you might say, Hey, I've seen some of these.
Let's show you how we can do some additional things.
I'm going to show you another query.
Where we use the timeliness of search.
Let's go ahead and ask about events in
Scottsdale during the Super Bowl.
What you'll see is we get back and answer here,
where we have events and we're
able to do that because Bing cross web.
Notice how we can find not only
that the Super Bowl was played in Glendale on the 12th,
but then there are events like
Cardi B Super Bowl party that's
on the 10th also shows up.
We were able to pull these things together.
You start to get a sense of how we can build on what's
today with the Bing index.
Now I'm going to show you a few
more of these types of answers quickly,
so you can get a sense of the power,
and the time savings from Bing.
When I'm running an errand,
like the example I gave to you earlier,
I can ask Bing to determine if
that new loveseat from IKEA is
going to fit in the back of my Honda Odyssey.
What you'll see is they can
actually find the dimensions of the loveseat,
the interior space of the car,
and then make an estimation as to whether it will fit.
In this case, I'm choosing
an example of where Bing does not know the answer.
We know that we can't be definitive about it.
The reason I'm doing that is because we know we
won't be able to answer every question, every time.
But Bing can still provide
some helpful information as you can see on this answer.
We also know we'll make our share of mistakes,
and Bing rule this out.
We've added a quick feedback button
at the top of every search,
so that you can give us feedback and we can learn.
Another example, when I'm shopping,
I can ask Bing to search, find,
and compare the top three selling pet vacuums
listing the pros and cons.
Look how great this answer is.
It has all three of the products I'm looking for,
super-helpful pros and cons. Stop and think.
If you had to compile that,
how much time that would take you to do?
As you can see at the top of the page,
we still have the advertising in this example,
because we know when people are
shopping, those ads are helpful.
Finally on this one, if I'm cooking,
and I realize I've forgotten
a key ingredient, in this case, e.g.
eggs for my cake recipe,
Bing can not only find the egg substitutes,
it can get me the exact amount for each ingredient.
Take a look at this.
I love this. You can actually see e.g.
if you go with vinegar and baking soda,
the cake is lighter and more fluffy.
These are just little helpful tips that
everyday help make your life a little better.
These are just some examples.
Then you can start to get a sense of how, with answers.
We go far beyond what you can do with search day.
We can actually help you get what you want to get done.
Now let me tell you about how
Bing goes further to help you with
particularly complex questions for
which there is not a precise answer.
I wanted to introduce to you
the new chat experience in Bing.
I think of this as search with your own personal helper,
to help you refine your query until
you get exactly what you're looking for.
This comes in handy for activities
like trip planning and shopping research.
Let's start with shopping. I'm going
to look for 65 inch TV.
Again, you see ads at the top,
the result of the links on the left,
and the answer is here on the right.
You can pick whichever you'd like.
We'll give you a good set of answers.
But now I want to refine this query,
so I can do that by going into chat.
Now I can either swipe up with
my fingers or look up here at the top of the screen.
We have now a new chat scope.
With that, with one click,
you are now into chat.
Look how beautiful that is.
Searched to check, just so seamless.
Now we take away all of
the content that was in your place,
and you focus on your query.
The search box you can see,
now that can take up to 2.000 characters.
You can just talk to it. You can just ask for it.
In this case, let's say I'm going to
ask for gaming optimized TV.
All I have to say is which of these are best for gaming?
We remember all of the context.
We know that we're talking about flat screens,
we know we're talking about 65-inch TV's.
Look how Bing start to come back.
It does all the queries on
my behalf and comes back with a great answer.
I just want to highlight a few things for you.
Since we know you're asking about gaming TV's,
we pull out, this one has
a game optimizer, this has Game Mode.
We make that really helpful. I'm on a budget.
I'll ask it to adjust it for,
which one of these is the cheapest.
Again, Bing knows the context,
and it just goes in and refines the queries.
Easy, you just talk to it,
and you can refine your shopping experience.
Again, we find the prices here.
I didn't know you could get a flat-screen for under $500,
but that's a good deal there on
Bing if you're looking for TV.
We think that's going to make shopping
easier. Let's talk about travel.
Before I jump in, I want to
just have you remind yourself,
when you're going to plan a trip to a foreign country,
think about all the things you go through.
Travel times, what sites do I want to see,
regulations to observe, budget.
Our research shows on Bing,
people spend on average weeks
to even months to plan a trip,
and to use our organizational tools.
I'm going to show you how we make that so
much easier with the new Bing.
I'm traveling to Mexico for
my cousin's wedding, and with the new Bing,
I now don't have to start
with something that's dumbed down,
like Mexico City travel tips.
I can ask for what I want.
First, let me just compare that against
what you get in today's search engines.
I'll type in this long query of what I really want,
and you get what you expect.
Links to go try to find the answer for yourself.
But we can do much better.
Let's try it in the new Bing.
I'll put in the long query,
which is essentially create an itinerary for
a five-day trip to Mexico City for me and my family.
Just like that, Bing Goes to work.
Just take a look at how it starts to compile.
Starts with Day 1, and we put it in there.
Look, arrive in Mexico City,
check into your hotel,
go check out maybe de Bella de Artes, have some lunch.
Then there's Day 2.
You see, isn't this just
so much better as a starting point?
Look, if you want to learn more, if you like, hey,
I don't love these five days, no problem.
Down there, we have links where
you can go and learn more.
We put in some nice touches in there as well.
Again, now, let's say business travel changes.
Oh, I only have a three day.
I don't have to go back out there and figure out.
I just say, Hey, make this a three-day trip,
and Bing reflows
that recommendation into a three-day trip.
Now let's just have some fun.
Let's say, Okay, yeah, I'm still trip planning.
We'd like to shop, where can I shop?
You get some shopping recommendations.
Like to go out at night. Make the most of the trip.
Where's the nightlife?
You get a list of nightlife.
You see this is just so much better than
today's search to start for your travel planning.
Let me show you a final example of chat and how I think
the new language models are going to maybe
help bring the world a little bit closer together.
Understanding different cultures is often
done through the arts like music and literature.
I've been fascinated by Japanese traditions.
[inaudible] share with me one of
his favorite searches from a while back on poetry.
With that, I'm going to use that as inspiration.
Without a clear idea of what I want necessarily,
I'm just going to type in a simple prompt,
top Japanese poets and
Bing starts to respond with a nice list.
You see this, it does a great job of
mixing the Japanese language and
the English language and it
knows since I'm Korean and English to do that.
Right away, I learn about this poet, Matsuo Basho.
It turns out he is one
of the greatest haiku masters of the world.
I love how it now enlists his name,
but we go ahead and we give you one of his famous haikus
in Japanese and we auto-translate it in English.
Great. I can say, I want to learn
a little bit more about Matsuo.
I can say, tell me a little bit more.
It'll give you another jumping-off point, that's great.
Then I can say, hey, tell me about another haiku.
Now, what I want you to reflect on is look how
easy this is to discover something new.
Normally, I might not have done this.
I might not have gone to learn about something
new in the world through search
because it's cumbersome to
click on links and have to deal with foreign languages.
But this is what we mean
by unlocking the joy of discovery.
Finally, when what you are searching for
doesn't exist and you need that spark of creativity,
bank and generate the content to help you get started.
We just finished planning that trip to Mexico and now
what I would like to do is I'd like to
share that information with my family.
They're all over the world but I can simply ask Bing,
hey, write an email,
sharing this itinerary that I've
researched and put it
into a thing for me to be able to send to my family.
Notice here how the emails starts, it's a great email.
Personalized touches, has the trip highlights,
it'll close here with a nice heartfelt message
, it's just a grip.
It just saves a bunch of time on your everyday work.
Now, for my family,
English isn't necessarily always
the first language so I can
ask Bing to just translate that in Spanish.
With a simple request, just basically say, hey,
translate that to Spanish,
Bing knows to take that entire email and
itinerary and convert that into Spanish.
In fact, Bing can translate
automatically over 100 languages.
My sister, I'm going to admit to you,
she's a better Spanish writer than I am,
but I might just impress her with this one here.
Even if I don't, I at least I saved
myself a lot of time from having to type the mail.
Our focus with Bing is to help generate
content and inspiration that
helps you with your daily life.
I'm going to give you a couple of more examples.
Let's show you one here.
I want to create a weekly meal-plan for my family of
four that has vegetarian options
and caters to those who don't like nuts.
Since I'm not the healthiest of eater,
I'm going to admit to you here on occasion,
this has given me a great list day-by-day,
breakfast, lunch, and dinner with all of the ingredients.
So much easier to help you
start to create a meal plan that says,
hey, let me get to a healthier meal plan.
Now, there's some things
in there I learned when I did this query,
which is chia seeds, which I don't normally eat.
That sparked another idea to show you.
Let's say I wanted to get this grocery list
by grocery section.
All I can do is say give me
that grocery list by groceries section.
Bing now takes that exact menu for the week and puts
all of the ingredients I
need by groceries section so when I go shopping,
now can be super efficient.
You see all of the ingredients there.
It's a great help.
Then finally, if you're looking for
family activities and you're struggling for ideas,
Bing can help you with fun things
like spontaneous trivia games.
My family and I were into music so I
asked Bing to create a '90s music trivia game.
Just look how great some of these questions
are and the answer so just creates that game for you.
We're having a little debate backstage
about one of the questions that comes up here,
which is, who wrote the hips song, Jump Around?
Is it Kris Kross or House of Pain?
I call it out because it shows you how clever it is.
Because for those of you who know your '90s hip-hop,
Kris Kross wrote a song called Jump,
but House of Pain was the one that wrote Jump Around.
That's how clever Bing is.
Bing is going to help you look great with
your family and have a bunch of fun.
Let's summarize where we are so far.
You've seen better search.
You've seen complete answers.
You've seen an incredible new chat experience and
the ability to spark your creativity all with Bing,
your AI powered cop-out for the web.
But what if that's still too much work?
What if you could have that co-pilot right alongside you?
Is there at the ready at anytime you want it,
aware of the context in which you're in.
What if you could get that co-pilot
on the 1.4 billion Windows PCs,
on the most used application on the PC, the browser?
Where you don't have to wonder, Team has created it.
I want to introduce you to your AI
powered co-pilot in Edge.
We've just updated Edge with
a new look and feel and new AI capabilities.
As you can see here, it's sleeker,
it's lighter and you're going to notice now that we've
integrated Bing in a really cool new way.
Let me show you.
Here I am on the Gap website.
I'm browsing around with my new Edge browser.
I want to read Gap's quarterly report.
I can navigate down to their earnings,
Click on "Q3",
and up comes the 15 page Gap PDF. It's pretty long.
I won't have time to read all that.
What I'd love is a summary of the key points.
I want to show you how now with the power of
Bing's AI capabilities within Edge, we can help.
With one click, I can open up the sidebar.
Now as you can see at the top of the Window,
we have two features, we have chat and compose.
Let me show how chat works.
I can use chat in Edge to simply
ask it to give me the key takeaways of the page I'm on.
I'll just say, key takeaways from the page,
and Bing and AI can now read that PDF and look how
great it comes up
with the summary of the key points here,
their earnings, the fact,
it's going to reaffirm full year guidance.
Very cool, a massive time savings.
But now I want to compare this with say,
Lululemon who also has their third quarter earnings.
Bing can now call out to the web,
pull information from outside of this page,
bring it into Edge,
compare it with the information that's on
this page all within Edge,
and I asked it to it in a table and
look how amazing this is.
Just like that in one table,
I can get an answer this question.
Think about how much time that
would have taken otherwise.
Let me try if we can take it one step further.
A top use case we've learned from our friends at
OpenAI is that developers are
really being more productive with ChatGPT.
Here we are on a Stack Overflow website discussion board
to learn a little bit about programming.
In this case, I'm researching
tips on how to parse the JSON file.
As we read through,
we find this great little code snippet and I'm like, oh,
it's fantastic except it's in
Python and we need it in REST.
All we need to do is highlight that text,
have it automatically copied over into the Edge sidebar,
and now, Bing
inside of Edge says, what do you want to do?
We'll say, hey, rewrite this code in REST.
With that simple command,
bin can go and take that code and rewrite
that automatically in the new programming language.
This is amazing. GitHub Copilot
has been a huge boost in developer productivity.
Imagine what the co-pilot can do for
people everywhere on any page.
One final thing to show you.
Not only can you better consume information,
but you can better create.
After our big announcement,
I'm going to want to write a LinkedIn post, let's say.
I'll just click "Create" on
the post and up
comes with a creation dialogue and LinkedIn.
But now I can open up the Bing
sidebar and you can see here,
I now will go to compose and I'll just give it a prompt.
I'll say, hey, introducing
the new AI powered Bing in Edge,
let's make that enthusiastic and generate
a draft and just lightly to help with enthusiasm.
Just like that, you get
a little draft and I can edit it and then with one-click,
it copies right over into my post dialogue,
I can add some hashtags to get it
some juice and just like that,
I've created a post.
All of these amazing new capabilities and
what we think is a revolutionary new experience.
World-class search, the ability to actually get
answers to your questions made easy with integrated chat,
and the ability to generate content when you need it to
spark your imagination brought
to you not only when you're searching,
but everywhere on the web,
courtesy of the new Edge browser.
With your copilot for the web,
we aspire to unlock that joy of discovery,
that wonder of creation,
and that feeling of empowerment and being
able to harness the world's knowledge.
Thank you on that on the presentation.
In a moment,
two of our engineering leaders
are going to come up and unpack a little bit about
the technical details of how we built
the new Bing and our approach to responsible AI.
But before I do, I'd like to invite Sam Altman to come up
and share his thoughts on this moment and
our joint work. Welcome, Sam.
SAM ALTMAN: Thank you Yusuf.
It's great to be here.
The new Bing experience really
looks fantastic and the depth of
it doesn't come through until you
get to use it. I hope you-all will enjoy.
OpenAI and Microsoft have been working
together for more than three years now.
We're so grateful to have
a partner that shares our vision,
our values of building advanced AI that's
safe and will have a very positive impact on society.
Thanks to our partnership and Azure's AI infrastructure,
we've been able to make pretty
significant strides in our research,
which has led to the creation of systems like
ChatGPT, Dolly, and Codex.
But we want to make the benefits of AI
available to as many people as possible.
That's why we've worked with Microsoft to get
this AI technology into the hands of
millions of people through Azure Open AI service,
GitHub Copilot and starting today, Bing.
The new Bing experience is powered by one of
our next-generation models that
Microsoft has customized specifically for search.
It takes key learnings from ChatGPT,
GPT-3.5, and the new model is faster,
more accurate and more capable.
We all search for things many times a day
and Microsoft has created and shipped,
a much better, more useful,
and more enjoyable search experience.
I feel like we've been waiting for this for 20 years.
I'm very happy it's here. I believe
that using AI to transform
critical tasks like these is going to greatly
improve our productivity and day-to-day quality of life.
I think this is the beginning of a very new era.
Going forward, we are excited to
continue to collaborate with
Microsoft very far into the future.
The two companies share a deep sense of
responsibility in ensuring that AI gets deployed safely.
It's very important to us we're
eager to continue learning from
real-world use so they
will create better and better AI systems,
you've got to do that in the real-world, not in the lab.
Our teams will continue working together to set standards
for the use of these systems across the entire industry.
Thank you very much, and now I'll hand it over to Dena.
DENA SAUNDERS: Thank you, Sam, Yusuf.
Hello. I'm Dina Saunders.
I'm the product leader of
the team who brought together the magic
behind being combined with
OpenAI's most powerful model to date.
Yusuf introduced you to Prometheus.
I'm here to tell you about the technical innovation
behind the experience.
There are five aspects contributing to the new Bing.
The first is a substantial under
the hood change to every layer
of things technological stack.
The second is a proprietary system
called Bing chat orchestration.
The third is state of the art prompt generation.
The fourth is around inference and
a brand new interactive user experience.
Our last development is the infrastructure
to scale for the opportunity ahead.
Today we'll touch on three of these.
Let me start with Bing chat orchestration.
This is what we use to gather
information needed to answer your question.
The chat orchestrator ingest
your long semantic query and fans out multiple searches.
In fact, you might remember as Yusuf was sharing earlier,
you can actually see in
the user experience the searches that
Bing plus Prometheus is issuing on your behalf.
In addition to searching,
we also inject additional sources of
information to help inform the model.
Let me pause here and say that is incredibly important.
We're actually pulling in fresh data, news, answers,
contextual signals such as your location and
the context of the conversation to feed into the model,
to help ground the information
that the model is then using to reason over.
With this fresh information,
some of the experiences that US have showed you earlier,
such as a sports example or inferring on recent events,
those would simply not be possible.
There's more though. This orchestrator in
parallel actually parses the documents
that it finds and identifies
important and relevant pieces of information.
This orchestrator is by nature,
curious and if it comes
across something particularly interesting
or insightful and actually starts that same cycle again.
This virtuous loop results in a package of
information that informs and
fuels the responses you are seeing.
This magic combination of reasoning capabilities with
fresh contextual data sparks
the innovative and groundbreaking combination
that powers the new Bing.
Next, I'll talk about inference and
the new interactive user experience.
The new interactive experience
seamlessly blends search and chat.
We feed a prompt, ie,
a proprietary set of instructions
into the model which synthesizes
the information reasons over all of the contexts
that we've gathered to form an answer to your question.
This overall process is called modal completion.
Think of it like when someone knows you
well enough to complete your sentence.
We run model inferences to generate
tokens ie words which we
stream to you in the user experience
real-time as you saw in Yusuf's examples.
These words combine with sentences
to form the response to your question.
When appropriate, Prometheus
enhances the text answer with
citations to sources and
rich structured answers from Bing.
In users presentation, you saw that
these inferences display in a
brand new, innovative, dual,
interactive user experience that
blends new conversational elements,
a classic search experience,
and a new interactive, immersive chat mode.
The last aspect of our differentiation and
innovation is around the infrastructure we have to scale.
Our differentiation comes really
from taking this innovation that we just shared with
you today and being able to ship
this at scale to millions of users worldwide.
Earlier, I talked about how we touched
every aspect of the tech stack to deliver this.
All of those steps that we just talked about,
those are done in the order of
milliseconds served at the speed of search latency.
When Yusuf was showing you the search page,
you can see near instant search results
juxtaposed on the conversational
insights, all streaming in.
This is possible because we're built on Azure,
the world supercomputer,
the best and most trusted Cloud platform available.
This was a huge effort, incredibly hard.
Behind the scenes, teams were working on powering and
building out machines in data centers worldwide.
We were carefully orchestrating and configuring
a complex set of distributed resources.
We built new platform pieces
designed to help load balance,
optimize performance and scale like never before.
There is no other company that can
do this like Microsoft and Bing.
As we roll out the product globally,
we're starting out small, focused on
learning and evolving with the help of your feedback.
We understand that with this major technological shift,
there will be new challenges along the way.
We're committed to learning with you.
On behalf of the team, we're delighted to
share this moment with you and with that,
let me introduce you to Sarah,
who's one of Microsoft's renowned responsible AI experts.
SARAH BIRD: Hi, I'm Sarah Bird.
I lead our Responsible AI engineering team for
new foundational AI technologies
like the Prometheus Model.
I was one of the first people to touch
the new OpenAI model as part of
an advanced red team that we pulled together jointly
with OpenAI to understand the technology.
My first reaction was just, wow,
it's the most exciting and powerful technology
I have ever touched.
But with the technology this powerful,
I also know that we have
an even greater responsibility
to ensure that it is developed,
deployed, and used properly.
Which means there's a lot that we have
to do to make it ready for users.
Fortunately, at Microsoft,
we're not starting from scratch.
We have been preparing for this moment for many years.
Since 2017, we have been investing in
a cross company program to ensure that
our AI systems are responsible by design,
which has pushed us to innovate across
our entire portfolio of
products to solve Responsible AI challenges.
Through that we have created a foundation of
Responsible AI product implementation patterns
that we can build on top of.
We're also not new to
working with generative AI technologies.
We've been using early versions in Office and Bing
for spell checking and grammar rewriting for years.
Obviously, lately, it's only getting more exciting.
In the past year, we launched the GitHub Copilot GA,
based on the codecs co-generation model,
the Bing creator powered by
the DALL-E image generation model,
and our own Florence multi-modal model.
That's a lot of generative AI.
With each one, we have learned more about
the risks that generative AI technologies can bring,
including well-known ones like
the perpetuation of stereotypes and bias,
as well as novel risks such as
jail breaks and we
have developed mitigations to address them.
In developing the new Bing,
we are building on our years of operating
large-scale consumer and enterprise services
and our deep partnership with OpenAI.
However, for this product,
we went further than we've ever gone before.
We have marshaled the full strength
of our Responsible AI ecosystem.
Scientists, researchers, ethicist, engineers,
and legal and policy experts to work together as
a single team to develop
approaches to measurement and risk mitigation strategies.
We've added Responsible AI to every layer,
from the core AI model to the user experience.
First, starting with the base technology,
we are partnering with OpenAI to
improve the model behavior through fine-tuning.
Second, we've built a state-of-the-art safety system.
Bing has been maturing its defensive technologies
over many years of operating a web scale search engine.
These include AI systems that understand user queries and
classified documents to ensure
safe and quality results for users.
We have combined these with newer AI technologies
that we've created in Azure
to moderate generative models.
We are continuously retraining all of
these safety models by mining anonymous Bing query logs,
and using the new OpenAI model to
generate thousands of example conversations.
We can update our defenses in minutes to
respond to gaps we find or changes in the world.
The result is a safety system that enables us
to act and understand data at
every level of the application so we can defend against
intentional misuse and mistakes from the AI.
Finally, at the application layer,
we are iterating on instructions as part of
the meta-prompt to guide
the model to produce great responses.
We have designed the user experience to ensure
users understand and are in control.
How do we know all of this works?
Measuring responsible harms is
a challenging new area of research.
This is where we really needed to innovate.
We had a key idea that we could actually use
the new OpenAI model as a Responsible AI tool to
help us test for
potential risk and we
developed a new testing system based on this idea.
Let's look at a tech planning
as an example of how this works.
Early red teaming showed that the model can generate
much more sophisticated instructions than
earlier versions of the technology
to help someone plan an attack,
for example on a school.
Obviously, we don't want to
aid illegal activities in the new Bing.
However, the fact that
the model understands these activities
means we can use it to identify and defend against them.
First, we took advantage
of the model's ability to conduct
realistic conversations
to develop a conversation simulator.
The model pretends to be
an adversarial user to conduct thousands of
different potentially harmful conversations
with Bing to see how it reacts.
As a result, we're able to
continuously test our system on
a wide range of conversations
before any real user ever touches it.
Once we have the conversations,
the next step is to analyze them,
to see where Bing is doing
the right thing versus where we have defects.
Conversations are difficult for most AI to
classify because they're multi turn
and often more varied.
But with the new model,
we were able to push the boundary of what is possible.
We took guidelines that are typically
used by expert linguists to label
data and modified them so
the model could understand them as labeling instructions.
We iterated it with it and
the human experts until there was
significant agreement in their labels.
We then used it to classify
conversations automatically so we could
understand the gaps in our system
and experiment with options to improve them.
This system enables us to create a tight loop of testing,
analyzing, and improving,
which has led to significant new innovations
and improvements in
our Responsible AI mitigations from
our initial implementation to where we are today.
The same system enables us to test
many different Responsible AI risk,
for example, how accurate and fresh the information is.
Of course, there's still more to do here today
and we do see places where the model is making mistakes.
We wanted to empower users to understand
the sources of any information
and detect errors themselves,
which is why we have provided
references in the interface.
We've also added feedback features
so that users can point
out issues they find so that we can get better over time.
Maturing and new technology takes time and collaboration,
which is why we're very excited for
this next phase where we can
share the technology with the real-world.
We look forward to seeing users unlock
its full potential and getting feedback from
stakeholders across society to help
develop this into an essential tool for the future.
Our goal is to continuously advance the state of the art.
Yusuf, back to you.
YUSUF MEHDI: Thank you, Sarah. To close,
we have one final piece of news to share.
I'm pleased to announce that the new Bing is live
today for desktop limited preview.
What this means is that we're going to
make Bing available today for
everyone to try on a limited number of queries
and then to sign up on the wait list to
get access to the full experience.
We're also starting with a select group of folks
who will have access to the full experience right away.
All of you here in the room,
you're all going to be on that list so you're all
going to get to try it right away.
In addition then we plan to expand
to millions of people in the coming weeks.
We are also going to be launching our mobile version.
Today is an important part of our journey.
But it's just the beginning. As Sarah said,
we intend to innovate quickly,
get your feedback, and continue to bring
new innovations and capabilities to everyone.
Welcome to the new Bing and Edge.
Thanks very much, thanks for being here.
[MUSIC]