Introduction to Buffer Overflow

 

I tried to do the easiest I could, but still this tutorial isn't one

of those where you really don't know shit about nothing and when you end it

you know all this. This one takes some work to understand, hey it took huge

work to write!

A little inside note, like everyone that is reading this lines I like to

learn, so some weeks ago I said to myself "Hey what the heck, why not to start

reading some texts about buffer overflows, I know how everything work but

just superficially", so I just started learning and now I'm trying to pass the

knowledge that I gained, to everyone that is interested. So this won't be one

of those texts where you'll learn everything, this will be like a walkthrough,

like the title says an Introduction, (In the end I'll give you some nice texts).

If you have any questions concerning this tutorial post in our message board,

if you find any "bug" in this tutorial please email me and I'll correct it.

Enjoy.

Exploit?

--------

Well probably everyone knows what an exploit is. But you still got to see

that the ones that are entering the security world for the first time

probably don't have the idea of what that is, that's why I wrote this tinny

section.

So for the ones that don't know an exploit is a program, usually written in

C, that exploits some problem that another program have. The exploit will allow

you to run arbitrary code that will let you do something that you shouldn't be

able to do in your normal status on the system.

Nowadays, most of the exploits are what we call Buffer Overflow Exploits.

What's that you ask. Wait because we'll get there. After all, this is the

subject of this tutorial.

Another thing you should know is that everyone knows how to use them(how do

you think that most of the websites that are defaced?), the script kiddies

just go to sites like security focus, packetstorm or fyodor's exploit world,

download it and run it, and then got busted. But why doesn't everybody write

exploits? Well the problem is that many people doesn't know how to spot some

vulnerability in the source code, or even if they can they aren't able to

write a exploit. So now that you have an idea of what an exploit is, let's

go ahead to the

buffer overflow section.

Buffer Overflow after all what's that?

-------------------------------------

Like I said before most of the exploits are Buffer Overflow exploits.

You are probably now thinking "Bah..this guy is bullshiting around, but

still didn't said what buffer overflow is". So let's just talk about it.

A buffer overflow problem is based in the memory where the program stores

it's data. Why's that, you ask. Well because what buffer overflow do is

overwrite expecific memory places where should be something you want, that

will make the program do something that you want.

Well some of you right now are thinking "WOW, I know how buffer overflow

works", but you still don't know how to spot them.

Let's follow a program and try to find and fix the buffer overflow

------ Partial code below--------

main(int argc, char **argv) {

char *somevar;

char *important;

somevar = (char *)malloc(sizeof(char)*4);

important = (char *)malloc(sizeof(char)*14);

strcpy(important, "command");   /*This one is the important

variable*/

stcrpy(somevar, argv[1]);

..... Code here ....

}

....   Other functions here ....

------- End Of Partial Code ------

So let's say that important variable stores some system command like, let's

say "chmod o-r file", and since that file is owned by root the program is run

under root user too, this means that if you can send commands to it, you can

execute ANY system command. So you start thinking. How the hell can I put

something that I want in the important variable. Well the way is to overflow

the memory so we can reach it. But let's see variables memory addresses.

To do that you need to re-written the code. Check the following code.

--------- Partial Code ------------

main (int argc, char **argv) {

char *somevar;

char *important;

somevar=(char *)malloc(sizeof(char)*4);

important=(char *)malloc(sizeof(char)*14);

printf("%p\n%p", somevar, important);

exit(0);

rest of code here

}

--------- End of Partial Code --------

Well we added 2 lines in the source code and left the rest unchanged. Let's

see what does two lines do.

The printf("%p\n%p", somevar, important); line will print the memory

addresses for somevar and important variables. The exit(0); will just keep the

rest of the program running after all you don't want it for nothing, your goal

was to know where is the variables are stored.

After running the program you would get an output like, you will probably

not get the same memory addresses:

0x8049700       <----- This is the address of somevar

0x8049710        <----- This is the address of important

As we can see, the important variable is next somevar, this will let us use

our buffer overflow skills, since somevar is got from argv[1]. Now, we know

that one follow the other, but let's check each memory address so we can have

the precise notion of the data storage. To do this let's re-write the code

again.

-------- Partial code ---------

main(int argc, char **argv) {

char *somevar;

char *important;

char *temp; /* will need another variable */

somevar=(char *)malloc(sizeof(char)*4);

important=(char *)malloc(sizeof(char)*14);

strcpy(important, "command");   /*This one is the important

variable*/

stcrpy(str, argv[1]);

printf("%p\n%p\n", somevar, important);

printf("Starting To Print memory address:\n");

temp = somevar; /* this will put temp at the first memory address we want

*/

while(temp < important + 14) {

/* this loop will be broken when we get to the last memory address we

want, last memory address of important variable */

printf("%p: %c (0x%x)\n", temp, *temp, *(unsigned int*)temp);

temp++;

}

exit(0);

rest of code here

}

------ End Of partial Code ------

Now let's say that the argv[1] should be in normal use send. So you just type

in your prompt:

$ program_name send

You'll get an output like:

0x8049700

0x8049710

Starting To Print memory address:

0x8049700: s (0x616c62)

0x8049701: e (0x616c)

0x8049702: n (0x61)      <---- each of this lines represent a memory address

0x8049703: d (0x0)

0x8049704:   (0x0)

0x8049705:   (0x0)

0x8049706:   (0x0)

0x8049707:   (0x0)

0x8049708:   (0x0)

0x8049709:   (0x19000000)

0x804970a:   (0x190000)

0x804970b:   (0x1900)

0x804970c:   (0x19)

0x804970d:   (0x63000000)

0x804970e:   (0x6f630000)

0x804970f:   (0x6d6f6300)

0x8049710: c (0x6d6d6f63)

0x8049711: o (0x616d6d6f)

0x8049712: m (0x6e616d6d)

0x8049713: m (0x646e616d)

0x8049714: a (0x646e61)

0x8049715: n (0x646e)

0x8049716: d (0x64)

0x8049717:   (0x0)

0x8049718:   (0x0)

0x8049719:   (0x0)

0x804971a:   (0x0)

0x804971b:   (0x0)

0x804971c:   (0x0)

0x804971d:   (0x0)

$

Nice isn't it? You can now see that there exist 12 memory address empty

between somevar and important. So let's say that you run the program with a

command line like:

$ program_name send------------newcommand

You'll get an output like:

0x8049700

0x8049710

Starting To Print memory address:

0x8049700: s (0x646e6573)

0x8049701: e (0x2d646e65)

0x8049702: n (0x2d2d646e)

0x8049703: d (0x2d2d2d64)

0x8049704: - (0x2d2d2d2d)

0x8049705: - (0x2d2d2d2d)

0x8049706: - (0x2d2d2d2d)

0x8049707: - (0x2d2d2d2d)

0x8049708: - (0x2d2d2d2d)

0x8049709: - (0x2d2d2d2d)

0x804970a: - (0x2d2d2d2d)

0x804970b: - (0x2d2d2d2d)

0x804970c: - (0x2d2d2d2d)

0x804970d: - (0x6e2d2d2d)

0x804970e: - (0x656e2d2d)

0x804970f: - (0x77656e2d)

0x8049710: n (0x6377656e) <--- memory address where important variable starts

0x8049711: e (0x6f637765)

0x8049712: w (0x6d6f6377)

0x8049713: c (0x6d6d6f63)

0x8049714: o (0x616d6d6f)

0x8049715: m (0x6e616d6d)

0x8049716: m (0x646e616d)

0x8049717: a (0x646e61)

0x8049718: n (0x646e)

0x8049719: d (0x64)

0x804971a:   (0x0)

0x804971b:   (0x0)

0x804971c:   (0x0)

0x804971d:   (0x0)

Hey cool, newcommand got over command. Now it does something you want,

instead of something he was supposed to do.

NOTE: Remember sometimes those spaces between somevar and

important can have other variables instead of being empty, so check their

values and send them to the same address, or the program can crash before

getting to the variable that you modified.

Now let's think a little. Why does this happen? As you can see in the source

code somevar is declared before important, this will make, most of the times,

that somevar will be first in memory. Now, let's check how each one is got.

Somevar gets it's value from argv[1], and important gets it from strcpy()

function, but the real problem is that important value is assign first so when

you assign value to somevar that is before it important can be overwritten.

This program could be patched against this buffer overflow switching those two

lines, becoming :

strcpy(somevar, argv[1]);

strcpy(important, "command");

If this was the way that the program was done even if you give an argument

that would get into the memory address of important, it will be overwritten by

the true command, since after getting somevar, is assign the value command to

important.

This kind of buffer overflow, is a heap buffer overflow. Like you probably

has seen they are really easy to do in theory but, in the real world, it's not

really easy to do them, after all the example I gave was a really dumb

program right? It's a real pain in the ass to find those important

variables, and also to overflow that variable you need to be able to write to

one that is in a lower memory address, most of times all this conditions

 


GSO
Written on Saturday, 03 October 2009 19:19 by GSO

Viewed 279 times so far.
Like this? Tweet it to your followers!

Rate this article

Latest articles from GSO

Latest 'tweets' from GovernmentSecurity

  • Can I get a Hoot Hoot?! #HootSuite is my number one Twitter client. http://hootsuite.com Link Friday, 06 November 2009 06:03
  • @foadah Thas what Im talking about :) Link Friday, 06 November 2009 05:58
  • #security | Don't panic over the secret copyright treaty | latest-security-news | GSO - Network Security Resources http://bit.ly/1K63Sr Link Thursday, 05 November 2009 08:01
  • #security | Which country has the most bot-infected computers? | latest-security-news | GSO - Network Security Reso... http://bit.ly/HAeG9 Link Thursday, 05 November 2009 08:01
  • #security | Backdoor access for millions of Facebook and MySpace accounts | latest-security-news | GSO - Network S... http://bit.ly/3dwnmc Link Thursday, 05 November 2009 08:01
blog comments powered by Disqus

Site Search

Sponsor Advertisements

SwiftLayer Affiliate Web Hosting

Disqus Tools