3.3. 簽署 (3. Signatures) - VoiceTube 看影片學英語

字幕列表影片播放

The following content is provided under a Creative
Commons license.
Your support will help MIT OpenCourseWare
continue to offer high-quality educational resources for free.
To make a donation or to view additional materials
from hundreds of MIT courses, visit MIT OpenCourseWare
at ocw.mit.edu.
TADGE DRYJA: Today, I'm going to talk
about signatures and all sorts of different signature
kind of things.
In the problem set, you're working with signatures,
but you're working with hash-based signatures, which
are not actually used in bitcoin at all.
But we'll talk about those.
OK.
So if you've gotten through the homework,
there's lamport signatures, right?
These are hash-based signatures.
And they use hash functions.
So it's fairly straightforward.
You can understand them.
There's nothing super crazy going on.
The code is fairly compact.
So that's cool.
What are some disadvantages of these lamport signatures?
Does anyone-- yeah?
AUDIENCE: You can only use it once.
TADGE DRYJA: Yeah.
OK.
So plus.
This is hashes.
That's cool.
One-time use.
Other possible disadvantages of them relative to other systems,
if you're aware.
Another is they're kind of huge, kind of big, right?
You can deal with it.
But if you were looking in the forge,
or that file with the signatures,
it's like, what, 8K for a signature--
8 kilobytes-- kind of big.
Keys are 16 kilobytes--
kind of annoying.
Private keys are also 16 kilobytes.
So yes, sig 8K, 16K priv/pub key.
So that's some disadvantages.
So since I don't have slides, I'm
gonna make this more fun and interactive.
What are some solutions for these problems?
So we can actually mitigate/solve
both of these things to a pretty good extent.
So how about the first one, one-time use?
What would be a fairly obvious way
to mitigate the one-time use problem?
And don't think the answer is too stupid.
It may be a fairly stupid answer, and it might work.
So yeah?
AUDIENCE: Not actually revealing pieces of your private key?
Instead, reveal something else.
TADGE DRYJA: There's probably some clever way.
But that might be too clever.
Something really simple for, OK, I can only use a key once.
How can I use a "key," quote unquote, more than once?
Yeah?
AUDIENCE: Make another one.
TADGE DRYJA: Yeah.
You can make another key.
So you could say, well, I've got this 16 kilobyte public key.
Well, I'm going to make a 32 kilobyte public key.
And it's just two public keys stuck together.
And now, when I make a signature,
I just put an extra bit in the front.
And I say, well, this signature is using key 0
or this signature is using key 1,
and it's got the whole signature after.
And then you look through this 32 kilobyte public key,
and you say, OK, well, it starts with a zero,
so that means it's using the first key, the first subkey
in this 32 byte public key block.
And in this case, it's using one,
so that means it's using the latter subkey.
So that would work.
That would let you use your public key twice,
at the cost of doubling your public key size, which
is not really great, right?
And it's not very efficient.
But it does sort of work.
OK.
Any clever ways to do it more efficiently?
Or wait.
So OK, also, I'll give you sort of a hint.
In this case, let's say this is pub sub 0 and pub sub 1, right?
And then, your 32 byte pubkey is just them
concatenated together, right--
pub0, pub1.
What would happen to the private keys in this case, right?
How would private keys work here?
Same expansion of size, I guess.
Can anyone think of a way to mitigate the expansion of size
of private keys in this case?
So the private keys are the preimages here, right?
They lead into these public key blocks.
So you could just say, OK, well, I
have twice the size private key leading into twice the size
public key.
Could you do that more efficiently?
Yeah?
AUDIENCE: Could you just hash the private key
so that you have two hashes instead of one?
TADGE DRYJA: Yes.
So let's say you have this 16k block,
and you want this to turn into two public key.
So that's the basic good way to do it.
And it sort of turns in like that.
How exactly-- what's the way you do that?
AUDIENCE: You can keep the same private key as before
and just add something like zero or one to indicate--
TADGE DRYJA: Yeah.
So this is a hash function, right?
And so before, we just said, OK, hash of this is block 0,
this is block 1, this is block 2.
So the idea of pub--
let's see.
Is this visible?
This might be too small, right?
AUDIENCE: Yes.
TADGE DRYJA: Yes, OK.
Let me make this bigger.
Sorry.
OK.
So in these diagrams, you've got your private key right now.
And it's in these big blocks.
And there's 256 of them, but let's keep it small.
And the idea is these are 32 byte blocks with random numbers
in it.
And then you hash it to get your public key.
So we say, OK, pub2--
and this is public, this is private, and, let's say,
secret--
pub2 is just the hash of secret2, right?
But yeah.
What we could do is we could sort of have two different hash
functions.
And then a real simple way to make
a whole bunch of different hash functions
is we define, OK, well hash0 is defined
as the hash function of whatever your input x is concatenated
with the number 0.
And then hash1, we define as just x comma 1 and so on.
And this is actually secure.
You could do this.
Any questions or possible objections?
AUDIENCE: I was thinking that if someone knew the hash function
you're using, wouldn't they only define
x because they know that it won't help with [INAUDIBLE] x.
[INAUDIBLE]
TADGE DRYJA: Yes.
Yes.
So there's no real entropy or secrets in this 0 and 1.
But it's purely riding on x, right?
But the idea is, well, if I do this,
and I say, OK, well pub2 is the hash
of secret2 concatenated with 0, yeah, if you know secret2,
you can go back.
Because 0 is obvious.
But the idea is if you don't know secret2, the fact that you
know the last byte of the hash input doesn't really help you.
Because there's all this data that you don't know.
And so you're not going to be able to find a preimage.
You're like, OK, I know the preimage
to public2 ends with a 0 byte.
What are the other 32 bytes that come before that?
You still can't go back to make a preimage.
AUDIENCE: But it feels like there's some sort of--
what's the word-- you'd make a similar statement saying, oh,
if the last byte is not important,
then the second-to-last byte is not important, either, right?
TADGE DRYJA: So the attack is I've got pub2,
which is a 32 byte hash output.
I want to find what this thing is.
Now, I do know what the last byte is, right?
It's going to be, let's say, 33 bytes,
and the last byte is a 0.
And this is going to be some random numbers
all the way back.
So I have some insight into what the preimage is.
I know the last byte is a 0.
But that doesn't help because I don't
know all the bytes before that.
So I can't really do--
I can't sign because I need to be able to reveal
all these things.
And the way the hash functions, if it's a good hash
function, the fact that you know some part of the preimage
shouldn't give you insight into the other parts
of the preimage.
Because it looks really random.
And adding stuff to the end here shouldn't reveal anything.
There are hash functions where this can be dangerous,
and you can consider those like broken hash functions, where
if I concatenate a lot of known data at the end,
it might give me properties of the public key that I can find.
[INAUDIBLE],, as far as we know, it works fine.
Most of the hash functions work fine this way.
So yeah, you can do this.
And then you can say, OK, well, I'm
going to use hash function 0 to make pubkey 0, right?
This is pubkey 0.
And if I use hash function 1, I make pubkey 1.
Now I have a 16 kilobyte-- let me erase this--
so now what I can do is I can say, OK, well,
I have a 16 kilobyte secret key, private key,
but I have a 32 kilobyte public key
that I can use twice that has two subkeys
so I don't have to store as much of my private key data.
So that's cool.
That makes it quite a bit more efficient.
Does anyone have an idea of how to get it even more efficient
for private key storage?
You can actually get it very efficient
using this kind of idea.
Any ideas?
Yeah.
Well, you--
So some hints.
This is pretty useful, right?
The idea is, well, even if you know
what the sort of suffix of all these things
are, if you don't know x, you're not
going to be able to find the preimage, right?
So maybe add some more stuff here.
Any idea?
Oh, yeah?
AUDIENCE: Could you hash the index?
TADGE DRYJA: Right.
Right.
So instead of just saying 0 and 1,
we can say here, well, why not just have one chunk?
And there's one chunk, and it's got random number x.
Well, this is hash x comma 0.
This is hash x comma 1.
This is hash x comma 2.
OK.
So yeah.
So what you can do is you can--
this was before, right, where I said, OK, yeah,
but this is basically how to do a 32 byte private key where
your private key is actually quite small,
and you just derive all of the public keys
by adding numbers at the end, concatenating them
into your hash.
And now you're like, OK, cool.
I have a 32 byte private key, and I
can make enormous public keys from them,
and it's still secure, right?
Because I can add whatever number I want here.
And I can do this a million times,
and no one will be able to find the preimage, even though I'm
giving you all these related hashes, right?
It's x comma 0, x comma 1, x comma a million.
I can give you millions of them.
And each time I do it, it doesn't give you
any insight into how to find what x is--
if it's a good hash function.
So that's really powerful.
And now I can say, OK, the private key problem is solved.
I have 0 of 1 growth in my private key.
I can make as many public keys as I want, never
gets any bigger.
Cool.
Still have this problem with the public keys, though.
Let's say I want to sign four times.
OK.
Well, I make a 64 kilobyte pubkey.
Now I can sign four times.
Great.
But my public key size expands with the number of signatures
I ever want to do.
Any ideas on how you could mitigate that
or how you could still make a usable system,
despite this fact?
There's a couple different techniques.
Any ideas?
OK.
So one would be every time I sign,
I also sign my next pubkey.
You can do that.
It's got some downsides, right?
So the idea is I first publish one 16 kilobyte pubkey.
And then I say, OK, when I sign, I'm going to sign my message.
And concatenated to that message will be my next pubkey.
And so I'm signing something and also indicating
what my next public key will be that I sign with.
Then I can still maintain a small private key
and then keep signing.
The problem is, in order for someone to verify,
they're going to need to look at all of my previous signatures,
right?
So if I'm saying, hey, here's my 500th public key
that I'm signing with, here's the message,
you're going to have to go back and look at my entire history
of signing, which is not great.
OK.
So other ideas of how to deal with large pubkeys?
What's a way you could do this?
So hints.
Is there any kind of data function
we've seen here where it takes an arbitrary sized amount
of data and outputs a fixed size?
[LAUGHING]
In fact, it's the only thing we've
been talking about so far.
So what would you do to make your public keys smaller?
AUDIENCE: Hash.
TADGE DRYJA: Hash them.
Right.
OK.
So you say, well, yeah, 32 byte pubkey.
Well, you just commit to the pubkey
with the hash of the whole thing, right?
And I say, well, I've got my 16 kilobyte by pubkey.
I just hash it, right?
And now I have a 32 byte public key.
And I've committed.
That works, right?
I've committed to my public key.
However, when I sign, I need to provide the whole public key
in order for you to verify.
I need to provide this whole 16 original pubkey,
original version pubkey, after when I sign.
So now my signature goes from 8K to, what, 24K.
So I haven't really made it more efficient, right,
in that the total thing with public keys and signatures
stays the same size--
actually, slightly larger.
But this is kind of a game.
Because the public key, it might be
like I'm showing it to you beforehand,
and we're space-constrained there.
And then, when I'm doing the signature,
I have more space or time or something like that.
So this is actually useful.
And actually, this technique is used
in Bitcoin and all different cryptocurrencies,
where you can call this a pubkey hash--
pubkey hash, or they say PKH.
And the main benefit is size, right?
You could say, well, if I've got a really big pubkey,
I can commit to it by sending to the hash of the pubkey.
And then, when I later want to sign,
I reveal the actual pubkey.
You can do it that way.
OK.
So then this is pretty cool, right?
We've now taken our system, which had fairly large--
16 kilobyte-- private keys, turned them
into 32 byte private keys.
Great.
And also, this is 0 of 1-- never expands.
Awesome.
We've taken a system with large, potentially huge--
32 kilobyte, 64 kilobyte-- public keys.
We've got it down to 32 bytes.
Awesome.
OK.
We've now made this thing much more efficient, much more
useful.
The problem-- the signatures are still big.
They actually got a little bit bigger
because they include the full public key in the signature.
And let's say we had one of these things,
where we make a bunch of public keys
and commit to them at the outset.
So you say, OK, I've got a 64 kilobyte pubkey.
I can commit to the whole thing by hashing it.
However, then, when I reveal and want to give you a signature,
my signature is now whatever it'll be--
72, right?
The signature would be 72 kilobytes--
kind of huge.
So I can add a bunch of keys, but I'm just
sort of pushing the problem somewhere else--
into the signature, right?
There's a better way to commit to many public keys.
It's maybe not super obvious unless you've seen it before.
But if you have, do you know what
the better way to commit to many public keys is?
Or you could come up with it on the spot,
and it'd be like, wow, you should have gotten
this stuff named after you.
Any idea?
OK.
So what you can do--
it's a little non-obvious--
you can make a tree, right?
So the idea is instead of just saying,
oh, I'm going to take all four of my big pubkeys,
stick them all in a row, take the hash of that whole thing
and publish that as my public hash,
I'm going to have a little bit more complex design.
And that design is a binary tree.
OK.
So the idea is here are my four public keys that take up 64K
each.
I can make these intermediate hashes.
So I make a binary tree where I say, OK, this
is the hash of the first two keys stuck together.
This is the hash of the second two keys stuck together.
The top is the hash of these two intermediate hashes
stuck together.
And then, this top is my root, which is
equivalent to my pubkey hash.
And so I can publish this, and it commits me to all four
of my public keys.
So it might not be obvious.
Do you see any advantages that this
could have over just concatenating all four
of the keys and hashing it?
Yes?
AUDIENCE: I'm confused.
Where is your private key in this?
TADGE DRYJA: The private key is down here, right?
The private key is not shown.
But I have a private key.
It turns into these pubkeys.
And we can use the technique here,
where we just have, really, one 32 byte private key
and create all the different chunks of all the public keys
by concatenating different indices.
Yes?
AUDIENCE: You can do [INAUDIBLE] systems and log-in time instead
of linear time.
TADGE DRYJA: Yeah, exactly.
I can prove that, hey, this is public key 0, which
is in here, this root hash, and I don't have to reveal all four
in order to do it.
So yeah.
Did someone else have, probably, the same idea?
Yeah, OK.
OK.
So how are we going to do that?
You obviously know.
Anyone else?
Yeah.
AUDIENCE: So assuming you wanted the public key 0,
you would publish that and 1, and also hash of 2, 3.
TADGE DRYJA: Right, yeah.
And then you can prove up to the top.
So I have little colors.
So the idea is the verifier who's verifying the signature,
he already knows the root, right?
Because you've pre-published that as your public key,
as your public key root and your public key hash.
And then that verifier is provided
with public key 0 saying, hey, here's a signature.
Here's the message.
Here is public key 0.
So you've got both of these things.
Now, the prover needs to prove that public key 0 is
contained within the root.
And to do that, the only two things the prover needs to add
is pub1--
OK, so the prover says, here's public key 1.
That allows the verifier to compute this.
And then the prover says, hey, here's
hash 2, 3, which allows the verifier to compute the root
and then check that it's equal.
So yeah, you only need two extra hashes instead of all four.
So saves some space.
It doesn't save that much space when you only have four.
It saves a lot of space when you have 1,000, right?
So you can add 0 of n elements.
And the root stays the same size.
The root stays 32 bytes, regardless
of how many leaf nodes you have.
And you can prove an element with log n--
oh, those parentheses are wrong--
0 log n intermediate hashes.
So if you have 1,000 keys, your root stays 32 bytes,
and your proofs are going to be not too, too big, right--
320 bytes of overhead, which is a bunch smaller
than a signature, right?
So this is really cool.
And you can use this much more practically
than the raw lamport signatures that
were in the first problem set.
Because those, if you're just doing
one key and one signature, it's about the same size.
But this actually lets you use it multiple times,
lets you store a much smaller private key,
commit to much smaller public keys.
It's a more usable system.
The signatures, though, still are kind of huge, right?
Your signatures are going to be a little over 8
kilobytes, which is OK, right?
Computers can deal with 8 kilobytes.
That's called a Merkle tree made by Ralph Merkle
in '76 or something.
And it was originally made for exactly that purpose.
Anyway, so these are called Merkle trees.
Ralph Merkle.
He doesn't really work on crypto now.
He works on biotech stuff now, I think.
Anyway.
But he's cool.
And he gets, like, everything named after him.
So you will see the word Merkle everywhere.
And it's, you know, he invented something cool.
Awesome.
This is one of the kind of things,
though, where it seems like, yeah,
I might have been able to think of that, right?
It's sort of a binary tree and hash functions.
It's nothing that crazy.
But so there's all sorts of Merkle trees.
You can make a different thing.
It's used in Bitcoin.
It's used in a bunch of different cryptocurrencies.
Oh, cool.
It's a very powerful way to take a whole bunch
of different things and commit to them in one small thing.
OK.
So this is very cool.
But we can do better.
And so the next things are things that I definitely
would never have thought of because it's much more
complex than just, like, hey, let's use a binary tree
with hashes.
And I will mention RSA and then go
into ECDSA and elliptic curve [INAUDIBLE] signatures, which
do different things and are not based on just hash functions.
So I'll talk about RSA for a few minutes,
and then we'll have a little break, like intermission.
And then we'll go into elliptic curve stuff.
OK.
So RSA was invented by locals.
The R is Ron--
no, I think the R stands for Rivest,
but Ron Rivest is at CSAIL, and he's still
working on cool crypto stuff.
It's not used in Bitcoin or any currency, actually--
well, any currently operating cryptocurrency.
Because the signatures and keys are
a bit larger than the later systems we'll see--
smaller than the hash-based things, but still a bit larger.
Was used in chaumian blinded cash.
Because you can do blinded signatures, where you say,
OK, I'm going to have you sign something for me,
and then after you give me the signature,
I can pull apart the signature.
Because I'll have you sign something
that you don't know what you're signing, right?
I'm going to mask this message and blind it
so that you sign the message, and then I
can get your signature on the real message
without you knowing what you signed.
So basically, the idea is, well, I actually
have message prime equals message plus randomness.
And I say, hey, can you sign m prime?
And you say, OK.
I produce a signature on m prime with my private key.
And we'll call that s prime.
And then what I can do is I can s equals s prime
minus the randomness, or, really, some function
of that randomness.
And now I have a signature.
And this is a valid signature on m.
So that's kind of cool.
It's like, hey, can you sign this message for me?
And we're not going to tell you what you're signing.
It seems sort of useless, like why would I
sign something I have no idea what I'm signing?
But it can be used in that chaumian blinded cash.
Because I then have a proof that, well, this bank didn't
know what they were signing, but they signed this,
and this is their signature, and they gave it to me.
And I can use that as a thing to represent money.
There's all sorts of cool things you can do with this.
The basic setup of RSA is make two prime numbers, p and q.
And then compute n, which is the product p times q.
So this seems fairly straightforward, right?
It's not too hard.
Everyone knows what prime numbers are.
Now you have this composite number, p times q.
How do I make a leading question for the next slide?
Does this have any relation to things that we've looked at?
This doesn't look at all like a hash function, right?
But are there any properties similar to a hash function
here?
Yeah?
AUDIENCE: It's hard to route the image.
It's hard to [INAUDIBLE].
TADGE DRYJA: Yeah.
It's a one-way function in that if I know p and q,
I can compute n, right?
But given n, I can't necessarily find p and q, right?
If they're small, sure, I just check.
And there's all sorts of algorithms to find factors.
But if p and q are pretty big, like hundreds of bits long
or thousands of bits long, finding n
may be impractical, whereas if p and q are really long,
finding the product of both of them will be very practical.
Even if huge, multiplying two integers is really easy.
So this is a one-way function but not a hash function.
And so a lot of these signature systems use one-way functions,
but trickier one-way functions that aren't hash functions,
and still have cool things that you can do.
So I'm not going to go into RSA in super detail.
I'm going to run through it really quick,
and you'll be like, huh, how does that work?
But if you think it's interesting, you can look it up
and then maybe you go to CSAIL and ask Ron
how it works, although maybe try to look it up first.
He's probably explained it a lot of times.
But it has different properties.
So you could say, I'm going to use this p times
q equals n as a hash function and make a lamport signature
out of it where my private key is a whole bunch of p and q
pairs, and my public key is a whole bunch of n's, and I
reveal them and stuff.
You could do that.
That would work.
But you can do much more powerful things
because of the mathematical properties of these things.
So yeah, I'm just going to go through it really fast.
But if you know anything, this is kind of cool.
The idea is you can do a bunch of multiplication
and exponential modulo n.
And so because n is almost prime,
there's some interesting properties,
where if you know the factors of n,
you have these shortcuts you can do.
And someone who doesn't know the factors of n
can't do these shortcuts.
So you publish e, which is a constant-- this is sort
of a parameter system.
So RSA is used today all over the place.
If your computer is open and you've
been using the web in the last five seconds,
your computer has performed RSA calculations.
Every time you connect to a website, you're doing RSA.
So generally, it's 65537.
They just make this up.
You could also use three-- some small number.
And then d is a number that you can compute
if you know what p and q are.
So this is sort of the private key, right?
p and q itself can be seen as the private key.
But you can actually compute this d number
and then discard p and q.
And so this is how you compute d or whatever.
But the idea is n is your public key, and d is your private key.
e is always the same, and p and q, you can discard afterwards.
And then what you can do--
and not going to get into it-- but you can say, OK,
my signature s is message, right--
I can take the hash and make a message m--
message to the d power modulo n.
And if I verify, if I take that signature to the e power modulo
n, it should equal m again, which
is crazy and kind of awesome.
Because the idea is to sign, OK, I
just raised to the d power mod n.
It's going to be some number on the same size as n, right?
And then I give that to someone.
They take that signature to the 65537 power modulo
m again and see if m--
the message-- comes back out.
And if it does, they know I signed,
and I have the private key that corresponds to n.
Now, the cool thing is I can use this any number of times.
I can use the same thing a bazillion times
with different m's, and that doesn't give any help
on how to factor n or how to compute d.
So this was late 70s, also, RSA, and one of the first,
hey, we can do better than hash functions.
It was developed at around the same time
as hash-based signatures.
All of these things were sort of coming out at the same time.
You can do lots of cool stuff because there's
these properties where we can make m prime plus m
and take the exponentiation of all that, and it still works.
OK.
So that's really cool.
You can do lots of cool stuff.
But this is also not used in Bitcoin.
One of the things is the size.
So key sizes are smaller than with lamport signatures.
And they're often about 2 kilobits, or 256 bytes,
which is fine.
And that's the same size as signatures.
So if you use your web browser--
secure-- well, how secure?
Certificate, valid.
Oh, this browser might not show me.
OK, never mind.
AUDIENCE: In Firefox, it'll do it.
TADGE DRYJA: In Firefox, it tells you more.
OK.
Well, anyway, anyway, anyway.
But yeah, there's RSA keys in, basically, every certificate
for web browsers and websites.
And they're generally 2,048 bits long.
That's seen as pretty secure.
Even 1,024 bits has not been hacked.
But people think it probably could
be by a very motivated actor.
So this is decent size.
One thing about RSA is that it's a little tricky to implement,
right?
There's a whole bunch of weird math there.
It's not like the lamport signatures,
where it's like, OK, I have a hash function, use it, right?
It's kind of hard to shoot yourself
in the foot with the lamport signature scheme.
You can, right, but it's sort of obvious
if you do, so if you're doing the homework
and it just doesn't work at all, and you're like,
oh, I was hashing twice, or, oh, I was reversing the order,
or it just didn't work.
But there's no subtle ways where, oh, I was off by one,
and now it reveals all my private keys.
That's hard to do, whereas in RSA, it's actually quite easy.
If your p and q values, for example--
what's a really crazy gotcha--
if p or q minus 1-- so if p minus 1 or q minus 1--
is smooth-- essentially, a number that's highly composite
has many small factors--
you can find someone's private key.
You can factor it.
So it's not just that p and q have to be prime.
P minus 1 and q minus 1 have to be not too unprime.
It can't be a power of two or something.
Otherwise, you can lose your privacy.
So there's all sorts of crazy things that happen with RSA.
Bitcoin and other coins use elliptic curve signatures,
which I'll talk about next, that are sort of displacing RSA,
in many cases.
So it hasn't really taken off for web certificates
but a lot of other types of encryption, signing, things
like that.
Cryptocurrencies are also one of the big uses of elliptic curve
signatures because they're somewhat more
powerful and somewhat smaller and really cool.
And we will talk about those in three minutes.
So intermission, three minutes.
Stretch.
Ask random questions.
Talk to the neighbors.
OK.
So they're using these elliptic curves,
which are curves of the form y squared
plus x cubed plus some coefficient x squared
plus some constant.
And in the case of Bitcoin, it's very simple.
It's y squared equals x cubed plus 7.
And that's the curve Bitcoin uses.
And this is what it looks like if you plot it.
And that seems really simple.
We can totally understand that.
But it gets a little weird.
With elliptic curves, you have these points on the curves,
right?
And another property is if you draw a straight line that
intersects this curve--
you could also draw a line which doesn't
intersect at all, fine--
but if you draw a line that intersects the curve,
it will intersect in three places.
So if I draw a line here, it intersects these three dots.
If I draw a line here, it intersects here, here,
and then it'll intersect somewhere down here.
It'll intersect in three places unless I take a tangent.
If I draw, and I try to be like, oh, I'm
going only intersect in two, well, I
can take a tangent here, and it'll
intersect in one more place.
What we can do, though, is we can say, well,
if you intersect at a tangent, that's kind
of like intersecting twice.
We can deal with that.
AUDIENCE: What if you draw a vertical line?
TADGE DRYJA: If you draw a vertical-- oh, yeah, vertical
line, then there's a point at infinity,
which shouldn't happen.
AUDIENCE: But infinity is included--
TADGE DRYJA: Yeah.
AUDIENCE: --in the set of points on the curve?
So it's like there's three points, anyway.
TADGE DRYJA: Yeah.
There's a point up there--
sort of.
In the practice of signing and stuff,
you probably shouldn't hit it.
Yeah.
But so what we do is we say, OK, well, three points in a line
equals zero.
And so if we take this point plus this point
plus this point, that's zero.
Another way to say that is P plus Q minus R equals 0,
or P plus Q equals negative R. And another property is
the negation of a point is just that same point
with the y-coordinate flipped.
So if this is negative R, we just
go down to the bottom side of this,
and this is positive R. Similarly, minus P
would be here.
Minus Q would be here.
And if you had a point Z here, negative Z would be up here.
This is how it's constructed.
However, we don't actually use these nice curves.
We chop them up in modulo something.
Yeah?
AUDIENCE: You have P plus Q equals negative R. Should it
be positive R?
TADGE DRYJA: No.
In this case, R is up here.
Right.
In this case--
P. Wait, hold on.
[INTERPOSING VOICES]
TADGE DRYJA: Oh, shoot.
Sorry.
P plus Q equals R. Right.
Sorry.
OK.
Never mind.
Yeah.
The picture is correct.
But so basically, P plus Q equals R
because R is down here.
Sorry.
Yeah, that's why.
OK.
This is R. This is negative R. So yeah,
P plus Q equals R, not negative R. Sorry.
When you want to multiply a point, you can.
Because you can take a tangent, right?
So you can add these points, right?
You can say, OK, I want the sum of P and Q,
which will be down here--
R. I want 2P, for example.
Well, with 2P, I can take a tangent
and then find where it intersects
and then go down below the curve.
So for example, if this is G, I say OK, well, G
starts here, take the tangent, find where it intersects
and then drop down.
And this is 2G.
So it's a simple way to say, well,
I'm not really multiplying.
I'm just adding it to itself, which is
sort of what multiplication is.
And I can do this again and again.
So I say, oh, I want 4G.
I take the tangent at 2G, find the intersection here,
drop down.
OK.
Here's 4G.
Take the tangent here, it'll go--
I can keep doing that and get 2G, 4G, 8G.
And if I want intermediate things, like, well, I want 3G,
well, I draw a line between G and 2G, find the intersection,
and find the sum.
So yeah?
Yes?
AUDIENCE: When you find the intersection of that
[INAUDIBLE] the intersection of the modulo of something,
does it just--
TADGE DRYJA: Yeah, yeah.
So the pictures look nice, in this case, right?
It's like, hey, there's this curve.
And it looks so simple.
And we can graphically do it on graph paper.
But when you're actually doing it on the computer,
it looks more like this.
Everything is modulo some giant prime number.
You could still draw lines.
It's just that the curve is now all these crazy-looking dots.
And you still have that symmetry from the y-axis.
But the way the computer does it is all modulo a number.
So it doesn't actually map to a nice curve.
So the fun thing about when you're doing modulo
is all the properties stay the same, right?
With regular numbers, you can still
do addition, multiplication, division
all modulo some number, and it still all works, right?
So same with this.
You still have the same draw a line between P and Q,
and you get to here.
And then you flip over the y-axis, and now this is R.
Similarly, you can take a tangent--
although a tangent is harder to visually see because there's
no obvious curve here--
but the equations work the same.
OK.
So you don't need to know exactly why these things work
or curves.
But we're going to go through what you can do with these
and what properties you have.
OK.
So we've defined these points on a curve
and how this curve works.
And then we also have regular old numbers, right?
So what we'll do from now on is say, like a and b,
our lowercase numbers, these are scalars.
Scalars just means regular old numbers.
The uppercase A and B will refer to points on the curve, OK?
So from now on, lowercase is regular number,
uppercase is point on this curve,
or point on this mashed-up thing that we still call a curve.
OK.
So I'll go through what operations we can do,
and then we can see what use those have.
So first, scalars.
You can add, subtract, multiply, divide.
You can do whatever you want, right?
These are just regular numbers, like five or 17 or 22.
Everything's OK, right?
Easy.
Everything here is integers.
We're doing modulo some big prime number,
so there's no decimal points.
I think in this class, we'll probably never
have to use floating point, which is great,
because floating point always scares me.
It's also uint.
These are like unsigned integers.
You can do it with signed.
But since you're modulo some big number,
it's like unsigned numbers that loop around.
So it's really nice for computers.
Computers deal very well with these things.
Computers are good at floating point, as well.
But the thing is, in cryptography,
a lot of times floating point ends up with error, right,
where you're pretty close, but you've
got some fixed amount of decimal places you can store.
Your floating point goes out to some precision.
And then error can accumulate.
And in a lot of cryptography, you want zero error, right?
Like in a hash function, it's pretty exact.
So a lot of times, the cryptography,
you use integers instead of floating point or rational
numbers.
OK.
So with scalars, these are just regular integers,
natural numbers even.
You can do these.
All the operations, you're familiar with.
That's fairly straightforward.
With points.
So you can add and subtract the points, right?
We showed how you do that visually.
And then the equations for it, the computer
can do that pretty quick.
However, this is not defined.
Multiplication and division with two points is undefined.
It's just not clear how you do it.
It's not something in this system, right?
It's a group.
You've got one operation.
You don't have this other one.
So you can add and subtract points,
but you can't multiply them.
Any questions about this?
This is pretty important.
Make sense?
Yes?
AUDIENCE: In the previous case, when you were adding points,
what is defined as addition?
Are you adding only the y components?
TADGE DRYJA: You use both the x and y-coordinates
to add, right?
So when you're doing it in the computer,
you're saying, OK, well, what are
the x- and y-coordinates of P?
What are the x- and y-coordinates of Q?
Find the slope here and where it intersects the axis.
And then see where it'll intersect the curve
at another point.
AUDIENCE: Addition Is literally defined as compute that point.
It is not a function of adding the x- and y-coordinates.
AUDIENCE: Gotcha.
TADGE DRYJA: Yeah.
Well, I mean-- yeah.
But the equation would be--
since the computer doesn't actually look at the curve,
you just say, OK.
Find the Px- and y-coordinate.
Find the Qx- and y-coordinate.
Find the slope, and then find where it intersects,
and then compute for that for R.
OK.
So yeah.
You can add and subtract.
You cannot multiply and divide these points.
Sounds good?
Any other questions?
Yes?
AUDIENCE: How would you subtract?
TADGE DRYJA: Ah.
You just go down, right?
So if I want P minus Q, I should say P plus negative Q.
And negative Q is just right here, right down on the axis.
So I say, OK, I want P minus Q. I go here.
Find it.
It's going to be up there somewhere.
So yeah.
You just add the negative of the number.
I guess that's another thing.
Minus B-- negative B-- is also an operation I can do.
I can negate something, which lets me do subtraction.
AUDIENCE: For those of you with math backgrounds,
this is defining a group operation
on the field of natural numbers, modulo--
TADGE DRYJA: Which is a big one.
AUDIENCE: --and using this elliptic curve.
So this is A. And the only thing that's defined
is addition and subtraction and not multiplication.
TADGE DRYJA: Right.
You can do the same group operations
without using elliptic curves.
And so you can just do exponential modulo
big prime number.
I'm going to use curves because it's
what is used in all the actual systems.
But you can sort of forget about the curve after this, right?
You can say, well, look, we just have these uppercase variables
and these lowercase variables.
And the fact that it's on a curve, we can sort of abstract
away and just know that we have these different types,
essentially.
And when you're in the computer, you're just like,
yeah, that's a point.
That's a scalar.
Here are the operations I can do,
and I just run the functions.
OK.
So the next part, when you mix scalars and points,
so mixed operations.
You cannot add a point and an integer or subtract a point
and an integer.
That's not defined, right?
You say, OK, here's this point p minus 7.
Well, does that mean move the x-axis?
Does that mean move the y?
It's not defined.
However, you can do this, right?
You can multiply and divide.
So you can say, well, A times 2, well,
you take the tangent, right?
It's just A plus A. Or A times 7, is we just break it down as
A plus A plus A plus A 17 times.
We can also divide, which is a little weird
because we have to find the inverse of B
and then multiply by that--
the multiplicative inverse of B modulo the order.
But anyway, you can do these.
This is a little trickier because you
have to compute what, like, B to the negative 1 is
and multiply by that.
You can do it.
So you can do these things where you have points times scalars.
But you cannot add.
Yeah?
AUDIENCE: So 2A is basically joining
the tangent to A, [INAUDIBLE] beginning.
TADGE DRYJA: Yep.
So that was here.
If this is G, this is 2G.
Find the tangent, find the intersection, and then negate.
OK.
So you can do that.
That's cool.
You cannot do that.
OK.
So roster of operations we can do.
This is it.
And this is sort of, if you think of that,
you can do all these cool things.
It's sort of amazing, all the crazy things
you can do with just these operations, right?
With regular numbers, you can do whatever you want.
That's obvious.
And with these points, you can add and subtract them.
And when you mix, you can multiply and divide.
And that's it.
OK.
Any questions so far?
You don't have to worry too much about the curves and all
of the math and stuff.
But you just sort of say, OK, here's my toolbox.
Here's what I can do.
OK.
Any questions?
Good?
So now what we can do is we can define a one-way function
sort of like a hash function, sort of like multiplying
p times q to get n.
What we will need, in addition to this,
is everyone pick some point on the curve, and we call it G.
And it's random.
And we want it to be somewhat verifiably random.
We just all have to agree on a point.
And we'll call this the generator point.
Another property is that, since it's all modulo, this number,
if B is too big, you'll wrap around, right,
because it's all modulo, this n.
So it's called a generator point because--
and every point on the curve can be one--
if you keep, say, 2G, 3G, 4G, 5G,
eventually, you'll get back to G, right?
It's all finite.
OK so does anyone have an idea, OK,
how would we make some cool one-way function
given these operations and the fact
that we have an agreed-upon point?
It's not super obvious, but it's actually
fairly straightforward.
Once you see it, you're like, oh.
Any ideas?
Yeah.
AUDIENCE: So you take your point G,
and you multiply it by some scalar,
and you publish where the point is [INAUDIBLE] scalars.
TADGE DRYJA: Yep, yep.
OK.
So your private key--
let's say private key, lowercase a,
it's just some 256 bit random number, scalar,
exactly the same as 32 bytes like in a lamport signature.
And your public key is just a times G.
And your public key is now a point on a curve.
We're going to call it uppercase A.
And you have a 32 byte x-coordinate, right--
because the x dimension is 256 bits--
32 byte y-coordinate.
So it's 64 bytes--
pretty small.
Anyone have another quick optimization
on how you could reduce this?
So there's a lot of optimization going on
in all these different cryptocurrency systems.
Because they don't scale too well.
But any idea of how to get that down?
AUDIENCE: Can you hash it?
TADGE DRYJA: You could.
So you could hash it.
You'd make it into 32 bytes.
There's a more useful way.
But if you hash it, you still have to reveal it later, right?
So it helps, making a pubkey hash.
And that's what Bitcoin did initially
is you'd have the 64 byte public keys.
And then you'd send to the hash of them.
There's a nicer way.
Any ideas?
OK.
So the thing is it's symmetric, right?
So the curve is symmetric about the x-axis.
Any ideas?
yes.
AUDIENCE: Could you just code one of the x- or y-coordinates
and say which side?
TADGE DRYJA: Right.
So you take the x-coordinate and encode that.
And then just that one bit--
4-- it's on the top.
It's on the bottom.
And then let them figure out exactly where it is, right?
So you can encode the x-coordinate only and 1 bit
for y, and then you're down to 33 bytes.
It's a little annoying because the 33 bytes, well,
one of those bytes is going to be empty.
It's just going to have one bit in it for up or down.
But whatever.
So you can do that, and that's also really cool.
OK.
Any questions so far?
AUDIENCE: Could you encode a [INAUDIBLE] and same
up and down?
Could you just encode an [INAUDIBLE]??
TADGE DRYJA: Huh.
Probably.
But I don't think it would be smaller.
I think you'd have to probably have the same size
if you use some other coordinate system.
Hm.
Yeah, you could encode a slope--
AUDIENCE: Yeah, the [INAUDIBLE].
TADGE DRYJA: And you say it intersects,
and it's at the slope, and which side of the axis?
AUDIENCE: Yeah, and if it crosses multiple times,
which one of the--
TADGE DRYJA: You probably could.
I think you would end up the same size.
But yeah.
But it might be faster for some things.
There's all sorts of different encodings.
AUDIENCE: I guarantee that you have an integer in the end.
TADGE DRYJA: Yeah.
So you might not be on the curve.
But there's a similar problem here
where it might not be on the curve.
You can encode a point that's also off the curve.
And so when you actually get a public key,
like over the wire in these systems,
the first thing you usually do is
make sure this is a valid public key,
and it's actually on the curve.
Because there's a lot of points that aren't.
And sometimes you can screw around with the code that way.
Yeah?
AUDIENCE: Can you define the scalar multiplication again?
how does it go?
TADGE DRYJA: OK.
So scalar multiplication-- a times G is, basically,
break it out as G plus G plus G plus G plus G a times.
And G plus G is find the tangent of G
and then find the intersection, right?
So this is G. This is 2G, right-- tangent, intersect,
and then the negation.
And so you just do that a bazillion times.
However, since you can find 2G and then you
can double 2G to get 4G and double 4G to get 8G,
you can come up with powers of 2 of G
and then add those powers of 2 to do
a much more efficient addition so that you can practically
make--
because little a is going to be some huge decimal 50-digit
number or whatever.
So you're going to have to do this two, four, six,
eight binary expansion and then add them all up.
OK.
Yeah?
AUDIENCE: You're chucking everything into some grain size
to get these.
You're [INAUDIBLE] a continuous curve.
TADGE DRYJA: Yes.
AUDIENCE: It feels like if you have 306 bits in the original,
you shouldn't be able to get all the way down to 32 bits, right?
Because you just want to make sure that you--
TADGE DRYJA: This?
AUDIENCE: Yeah.
If you removed a byte of precision,
would you actually be losing any power?
TADGE DRYJA: What you can do a little-- what you can do
is you can just say, look, I'm only
going to allow public keys that are on the top, positive y
component.
And everything else is disallowed.
You could have that as a rule of your system,
and then you can get rid of the y bit.
But the thing is the y-coordinate
does affect some of the things like signatures, things
that you'll later use.
So you can just say, OK, look, implicitly y bit is positive.
And you lose 1 bit of security.
Because there is a difference between
the positive and negative thing if you're doing signing.
But yeah.
And so some systems do do that.
And it makes things easier.
There's some signing systems that do that.
OK.
I'm going to go to the next part.
ECDSA is used in bitcoin.
I'm actually not going to explain it.
It's a worse signature system.
So the reason why this was used was the better system
was patented.
And so there's a clearer, more obvious, cleaner, more powerful
signature system that was patented by this German guy,
Claus Schnorr.
And so no one used it, or maybe a few people used it.
But in general, with open source things and web standards,
patents are really hard to work with.
And so all the cryptographers said,
well, we'll make this other system
that's different enough that it doesn't infringe on the patent.
And it's kind of ugly.
It does work as a signature scheme.
But actually, I'm going to explain
the Schnorr signatures, which make a lot more sense,
are easier to understand and manipulate,
and the patent has expired, so we can all use it.
And it will probably be going into Bitcoin.
And some cryptocurrencies use Schnorr signatures.
I think Monero uses a Schnorr-like signature.
But Bitcoin will probably be putting it
into Bitcoin in the next year or so.
So all these things will be much more applicable,
and you can use them.
And there's code out.
You can use them now.
OK.
Yeah.
Sorry.
Patent has expired-- free to use the better
algo that must not be named.
So yeah.
People don't call it Schnorr signature as much.
Because it's like, well, this is the guy who prevented us
from using this for 20 years.
And there's modifications to it.
OK.
So the elliptic curve signature, for lack of a better name.
This is something of a simplification.
And there's reading about how you can really
do it the right way.
But this does give you the right idea.
And for a single signer, this is secure, and it works.
But later things, you might want to actually add other stuff.
So the idea is you've got your message m, right?
Same as in lamport, same as in RSA, you've
got a message m and a private key lowercase a.
And your public key is a times G, like we said.
And then, when you want to sign, you actually
make a one-time use key pair for the signature only.
And they usually call that k.
And k is a new random number.
It's the private key.
And you're going to multiply it by G.
And they call this R. I don't know why they
use these letters, but anyway.
K times G is R. R is the other public key you're using just
for this signature.
And then the signature itself is quite straightforward.
As the signer, you compute s, which is k--
this new private key you just made up--
minus the hash of your message concatenated with R,
this public key-- so you encode the public key in 33 bytes,
stick it in there--
multiplied by a, your normal private key.
And then your signature is the pair--
the R point and the s scalar.
This is a little bit of a bunch to process.
But it's actually not too bad, right?
This is a scalar--
32 bytes-- that you made up.
The hash, this is a message, also going to be however long.
This is 33 bytes.
You concatenate them together-- hash.
You end up with another 32 byte scalar.
You multiply those two scalars.
So actually, the only elliptic curve operation
here is calculating what R is.
When you're actually calculating s,
this is all just scalar, so it's very fast.
So in terms of practical computer stuff,
doing these operations is a little bit on the slow side.
Because you're doing those point additions quite a number
of times.
So a decent code, a decent CPU, can
do on the order of thousands of these types of operations
per second, which is decent.
You can do a decent amount, but it is kind of slow.
And it's considerably slower than hash functions.
You can do a million iterations of a hash function per second
on a core, or you can do, maybe, 1,000 or 2,000
of this kind of thing, so still pretty fast but slower.
And so this signing procedure, this takes a bit of time.
This takes, actually, very little time.
Because you're just subtracting and multiplying 32 byte scalar
integers, which is much faster.
OK.
So does this make sense?
We'll go a little bit of how you can't forge this.
OK.
Sorry.
So first verification.
Sorry.
So how would you, if anyone knows or can think,
how would you verify this, right?
I know their public key--
capital A, which is little a times G--
and then they're giving me a message m,
and they're giving me a signature, which is R and s.
And they've computed it this way.
I can't verify this equation myself
because I don't know what k is.
I don't know what little a is, right?
I'm just given R, s, m, and big A. So how would I verify this?
AUDIENCE: Can I, for example, take the hash of the message,
use R, and then try random a's and that--
on, sorry.
TADGE DRYJA: Yeah.
Do you have one?
AUDIENCE: So trying random is probably [INAUDIBLE] because--
TADGE DRYJA: There's a lot, yeah.
AUDIENCE: --there's too many.
The old one that you don't know it.
But I think one more thing you could do is take little s
and multiply it by big G.
TADGE DRYJA: Yes.
AUDIENCE: And then do some--
TADGE DRYJA: Yeah.
So the basic idea-- yeah, you're right.
Multiply this side by G. Then you also
have to multiply the other side by G, right?
So you say, OK, well, as the verifier--
I've got R, I've got s--
I multiply s by G, and now I have
to multiply this side by G. And what'll that look like?
Well, what's k times G?
AUDIENCE: R.
TADGE DRYJA: R. What's little a times G?
Big A. So yeah.
I want to verify.
I want to verify that this equation was used,
right-- s equals k minus this hash times little a.
Multiply both sides by G, I get s times G
equals k times G minus the hash thing times little a times
G. This is big A. This is R.
Hey, now I know everything here as the verifier, right?
I know what s time G is.
I know what R is.
I know what big A is.
I know what's going into this hash function.
And so I just say, OK.
Well, I rearrange it a little, and I say, is R equal to this?
And if it is, that's a good signature.
If it's not, it's not a good signature.
Any questions about this?
Does this makes sense, mostly?
So for the verification, it's a bit more CPU-intense, right?
You're given a scalar.
You multiply by G. You also have to multiply A--
this is actually more costly--
you have to multiply A by this hash.
The reason that's more costly-- it's
still a scalar times a point.
The thing is, since G is used for multiplying a lot,
you precompute all sorts of coefficients times G,
and you store them in RAM or in your CPU somewhere.
So multiplying by G is a little bit faster
because you do it a lot, you can precompute a lot,
whereas multiplying by A is different each time for all
the different signatures.
So this is a little bit slower, in general.
So it takes a little bit more CPU time
to verify a signature than to create one,
something like twice as much.
But still, you can do this pretty fast.
On a modern computer, you can do thousands of these a second.
OK.
Any questions so far?
I'll go a little bit into why this is not forgible, right?
So I want to forge a signature.
Well, I don't know little a.
But why don't I make up a k and compute s and R?
But I need a, right?
So I'm going to say, OK, this is the equation I want.
I'll just make up my own k and make up my own k, s,
that'll satisfy this.
Without a, I really can't make a valid s, right?
I can make something up here.
But I can't compute it because I don't know here, right?
I can try to grind through hash functions, but it won't work.
The basic problem is if I make up an s
and solve for R, right--
I can say, well, I know what R is--
but the thing is R is in the hash function, as well.
It's the one-wayness of the hash function that
actually breaks this, right?
So I could say, well, let me solve for R. Well,
R is the hash of m and R times a plus s times G.
If I can come up with a valid R here, I can do this.
The problem is R is defined by the hash of R in this case.
I can't compute this, and I can't cancel this out, right?
So you can see just like, OK, I need to solve for R. Well,
but it's already in here.
I'm stuck.
If I don't know k, I can't come up with a signature, either.
OK.
There is one footgun with EC signatures.
If you use the same R value for different signatures
with the same pubkey, you reveal your private key.
So k has to be random and new every time.
If you look through the equation,
you'll see how you can solve--
if you have two of the same k's with different s's--
you can solve for k.
And then, once you solve for k given
this, if you know what k is, you can
find what little a is, right, as a verifier.
You can say, well, you gave me s.
That's part of the signature.
I know this whole coefficient.
If I know k, I can solve for little a by dividing this out.
So always use a new k value.
There have been many cases where people don't.
Probably the most famous one is PlayStation 3 used ECDSA.
And they used a fixed k value for signing all their code.
And that allowed people to calculate their private keys
and then run pirated games.
That was actually the first time I
learned about elliptic curve signatures,
in 2010 or something.
So actually, there's interesting tutorials
written by the hackers who broke Playstation 3 about how
this works.
And that's when I first read about it.
OK.
We're almost out of time.
Any questions about this stuff so far?
OK.
So this is a signature algorithm.
It's like, OK, fine.
This works.
I can sign, same as lamport signatures, same as RSA.
What's really nice about the elliptic curve stuff
is you can do more than just signatures.
There's all sorts of fun things you can do.
And it seems like a fairly unexplored area
in that I've found fun things that you
can do that I guess are novel.
And I'm not an expert on this at all.
It's just that it's a fairly new area of research
in how to do cool bitcoiny cryptocurrency
things with these curves and points and stuff.
So I'll go through a couple fun things you can do.
Fun with points.
OK.
So I'm not using an asterisk.
Just b times G. So let's say Alice
has a public key A. Bob has a public key B, right,
which is little a times G or little b times G. Well,
this is weird.
Little a times big B--
Alice's private key multiplied by Bob's public key--
you can do that, right?--
is equal to Bob's private key times Alice's public
key, right?
It's commutative.
The multiplication is commutative.
That's weird, right?
It's just a times G times b, which is the same--
or it doesn't matter the order-- it's
the same as b times G times a, which is the same as a times
b times G. And let's call that C. C-- this
is called a Diffie-Hellman key exchange point.
Diffie and Hellman are two also late-'70s PhD students who came
up with all this cool stuff.
No, wait.
Diffie was a PhD student.
Hellman was his advisor?
I don't remember.
Anyway, they came up with this idea.
And this is a shared point.
What would this be useful for?
It's actually super useful.
Can you think of like, oh, OK.
We can do this.
We can compute C. Why would we want to do that?
Any applications you can think of?
Yeah?
AUDIENCE: Trying to share keys with people without both
of them knowing who they are and evaluating
the other set of keys.
TADGE DRYJA: Yeah.
It's really cool to share keys.
So this is also done if you have a web browser.
I think it's all elliptic curve Diffie-Hellman now, mostly.
But if you have a web browser and you go to a website,
this is happening, basically, for every HTTPS request.
Because you can use C as a sort of
shared key and encrypt things with it.
The idea is big A can be public.
Big B can be public.
Point C will not be public.
The only way to compute it is to know
either Alice's private key-- little a--
or Bob's private key, little b.
Even though you can see the public keys,
you can't compute C from just the public keys, right?
Because big A times big B, that's not defined.
You can't multiply the points.
You have to multiply the point by a scalar.
And since you don't know the scalars
from viewing this exchange, you can't
compute C. But Alice and Bob can both compute C.
So this is really useful.
You can compute C. You can use it for encryption.
You can use it to prove you are who you say you are.
You can use it as an interactive verification
where you say, OK, I'm Bob.
I'm going to make random point.
I'm going to give that to you.
Prove you know your private key-- little a--
by telling me what C is, right?
So I can make a random key here, give it to you.
You return C. I verify that that's the right C. And I say,
OK, well you must know little a.
You haven't shown me a little a, but you've
proven that you know it, so proof of knowledge
of the private key.
So you can use that for logins instead of passwords.
And it's much simpler than a signature.
So that's really cool.
All sorts of things you can do with that.
The last part, you can have fun with points.
And I use this in my software.
So you say, OK, well, there's a public key A.
There's a public key B. And we'll
define D as the sum of these two public keys.
The thing is if you're summing these two public keys,
it's commutative, and it is the same
as saying the sum of the two private keys times G, right?
a times G plus b times G is the same as a plus
b times G, which is another really cool property.
So what if I compute this public key D,
and I say I want a signature from public key D?
You can.
And you so the private key, little d,
which can sign for this, is just little a plus little b.
So you can make a combined key.
And then either party--
Alice has little a, Bob has a little b--
and they could reveal it to each other
and allow the other person to make a signature with D later.
I use this in the lightning network software I've had,
where, basically, you say, look, we'll compute D,
and I can give you the private key, right?
I'm Bob.
And I say, OK, here's point b.
Here's point a.
We add them together.
And then, Alice, if I give you my private key, now
you can sign, but I can't, right?
So I'll give you the private key to D
from some information I have.
Or you could give me the private key to D
by sharing little a with me.
So this is another useful thing where
you can have something where both parties know
that neither party can sign unless they give them
something.
And you can use that for bitcoin addresses, things like that.
Before you even get to signatures,
you can share keys and stuff.
There's all sorts of super fun things
you can do with these points and curves and systems.
Multisignatures-- I put a link to a PDF file
that's very recent and kind of overkill and over my head,
as well.
But you can do signatures where you aggregate the signatures.
But
There's ways to say, OK, well, I'm
going to add all these R points, and I'm
going to add all these s points.
And I can have signatures that are
from a bunch of different people that
collapse into the same size as a single signature.
And I can verify that they're all signing the same thing
or, possibly, that they're all signing different things.
So I have 10 different signatures
from different people signing different messages.
But I can add up all the s values.
I have to keep the R values.
But now these 10 signatures, instead of being 64 bytes
each are more like 32 bites each.
And there's one extra 32 byte value for all of them.
So there's a lot of really cool ways
you can combine things, make things smaller,
make things really versatile, where we can share keys,
things like that, which you can't
do with hash-based signatures, hash functions.
And so it's really fun.
One worry is that if quantum computers ever
become really a thing, all this stuff stops working.
All the elliptic curve stuff doesn't work anymore.
You can figure out what people's private keys are,
whereas hash stuff, hash things like lamport signatures,
still work.
So there's some talk of hey, we should
prepare for if that ever happens and make efficient, effective
hash-based signatures and use them
in something like Bitcoin or Ethereum.
People haven't yet because the overhead is really high.
It's really big.
The real thing is all these fun things
might disappear in the next 10, 20 years.
Who knows?
But there's lots of cool stuff.
What do we do with this?
So we're not going to do anything yet
in terms of problem sets.
It's a little hard to program it directly.
But there's libraries.
And the libraries have fairly easy use,
where it's multiply by G is a function,
and multiply by this point is a function.
So this is groundwork for cool stuff you can do.
And it's really nice because it's a new area.
And I'm not an expert on this, but I've come up with stuff
that people have cited me on.
And they're like, yeah, I guess no one thought of that before.
There's a lot of things.
There's one very recent one called
Taproot, where it's one line.
You're like, oh, yeah.
That works.
And it's just these sort of elliptic curve operations.
It's like, yeah, that works.
That's really useful.
How did no one think of that?
It's one line.
Wow.
So there's just not a ton of people
working on this kind of stuff.
And so it's kind of fun.