Click here to Skip to main content
15,879,535 members
Articles / Programming Languages / C++

Write Your Own Unpacker

Rate me:
Please Sign up or sign in to vote.
4.67/5 (8 votes)
9 Aug 2010CPOL6 min read 44.9K   1.7K   31   15
This tutorial talks about writing a generic unpacker easily in 10 lines using pokas x86 Emulator

Introduction

The packers are just tools that try to encrypt a program to make it hard to be reversed. These tools can be used by many as anti-crackers and more in worms and viruses. Reversing the packer is a waste of time and not a good strategy, so there's the solution.
You can create an unpacker for these types of packers with only 10 lines, and maybe create a tool that unpacks most of the known packers. You can create this unpacker easily with Pokas x86 Emulator.

Pokas x86 Emulator

Many generic unpackers appeared, but none of them were easily usable and hence customizable. However, this tool is maybe the best tool in the unpacking field right now as it is very easy to use, very easy to be customized and open source. Now you will ask me, what's the meaning of Emulator?

Emulator is a program that simulates the processor and memory. Do what they do... diassembles an instruction and runs it in virtual registers and memory. These programs don't run without stopping, but they stop by a breakpoint you added in the debugger that is included in the emulator. This breakpoint should stop the emulator at the OEP (the original entry point).

So What's Pokas x86 Emulator?

Pokas x86 Emulator is an Application-Only emulator created for generic unpacking and testing the antivirus detection algorithms.

This Emulator has many features. Some of them are:

  1. Has an assembler and a disassembler from and to mnemonics.
  2. Support adding new APIs and adding the emulation function to them.
  3. Support a very powerful debugger that has a parser that parses the condition you give and creates a very fast native code that performs the check on this condition.
  4. Supports seh and supports tib, teb, peb and peb_ldr_data.
  5. It monitors all the memory writes and log up to 10 previous Eips and saves the last accessed and the last modified place in memory.
  6. It support 6 APIs: GetModuleHandleA, LoadLibrayA, GetProcAddress, VirtualAlloc, VirtualFree and VirtualProtect.
  7. With all of these, it's FREE and open source. It successfully emulates:
    1. UPX
    2. FSG
    3. MEW
    4. Aspack
    5. PECompact
    6. Morphine

How can I get Pokas x86 Emulator?

You can get the emulator from here. You will find a good documentation of everything in the emulator.

Why Should I Program an Unpacker?

Some people who love challenges will ask why I should program an unpacker even if it's only near 10 lines code with this emulator?

I want to say that coding the unpacker is the last step you should do. First, you should begin with reversing and finding the all anti-unpackers tricks. Then you should find the suitable breakpoint that bypasses all the packer defences and then begin converting all these into code.

The Emulator Design

I don't want to dig into the emulator, but I think it is like this:

ownunpacker/umlonly.jpg

Now I will describe the emulator components and classes that are in the UML figure.

System: This is the Emulated Operating System and CPU. This class is used for disassembling and emulating assembly instructions and APIs. It also has the ability to assemble and disassemble any instruction from or to mnemonics for easy to debug or fix any error and also it could be used in any application as a separate component.

Process: This class is the emulated application. This class is the real emulator as it calls the system functions to emulate, calls the debugger for checking the breakpoints, manage the threads and handle the exceptions (if SEH is enabled).
It also manages the virtual memory and creates the PEB structure.

Thread: This class contains all variables that we need to emulate with. It contains the registers, Eip and EFlags and fs segment. It doesn't have the debug registers (because there is no need for it) and all segments except fs. It also creates the TEB structure and handles the SEH.

Virtual Memory: This class is used to emulate the memory. It doesn't support pages, but it simply adds a pointer and the equivalent virtual pointer and the size of that buffer.
It also monitors the memory writes and detects the invalid pointers and the writing on a read only page.
It also supports VirtualProtect API.

Stack: It's a small class used to push or pop a value from or to the stack and saves the top and bottom of stack in two variables, but it doesn't support resizing.

Debugger: This class is used for adding the stop condition for the emulator. It takes a condition in a string format (ex: "eip==0x4001000 && ecx>=100") and this condition is checked with every instruction emulated by emulate or emulatecommand functions.
The feature of this debugger that it doesn't decrease the performance as it parses the string and converts it into assembly and then converts it by the assembler into a native code to be run in every check on this breakpoint.

PE: This is not a class. It's a library that contains separate functions for working with the PE executables. It contains two functions:

  1. PELoader: This function loads a PE file into memory
  2. PEDump: This function dumps a PE file from memory to the disk with a given entrypoint and sets the import directory to zero.

Using the Emulator

Don't worry if you don't understand many of the descriptions above. You will see that the usage of this emulator is very simple:

  1. Initialize your Emulated OS and CPU
  2. Manage the DLLs and Apis
  3. Emulate the undefined Apis
  4. Adding the Stop Condition
  5. Emulate your Process
  6. Handle the Unhandled Exceptions
  7. Wait for your Breakpoints to be triggered
  8. Dump your Process to a File

Let's see an example:

C++
#include <cstdlib>
#include <iostream>
#include "x86emu.h"
using namespace std;

int main() {
     //First we will create the Environment Variables 
     //the Environment Variables is just some parameters or setting 
     //will be passed to the System 

     EnviromentVariables* vars= (EnviromentVariables*)malloc
				(sizeof(EnviromentVariables)); 
     memset( vars,0,sizeof(EnviromentVariables));

     //this variable should be adjusted to make the system perform well. 
     //This path is the path to the folder that contain 
     //the important DLLs which are ("kernel32.dll","ntdll.dll","user32.dll")

     vars->dllspath="C:\\Windows\\System32\\"; 

     //here we will set the Imagebase of the kernel32.dll and user32.dll

     vars->kernel32=(dword)GetModuleHandleA("kernel32.dll");
     vars->user32=(dword)LoadLibraryA("user32.dll");

     //there's other variables but we can ignore the right now 
    
    //now we will create the new system 

     System* sys=new System(vars);
     define_dll("gdi32.dll",vars->dllspath,0x75DE0000);

		 //now We Will create a new Process
     Process* c;
      try{ 
         c=new Process(sys,"upx.exe");      	//process take two parameters system & 
					//the program filename
     }catch(int x){
            cout << "Error : File name not found\n";
     };
     //Adding new breakpoint is an easy task

     c->debugger->AddBp("__isdirty(eip)");
		 	//there's two commands to emulate c->emulate() & 
			//c->emulatecommand() but emulatecommand(int) 
     int x=c->emulate();//*/ 
     if (x!=EXP_BREAKPOINT){	//there are other exceptions like 
				//invalid pointer and so on return an error for them
        cout << "Error = " << x << "\n";
     }else{
     //Dump the PE file here 

     PEDump(c->GetThread(0)->Eip,c,"test.exe");
     }

The Debugger Breakpoints

Int3 Breakpoint or Hardware on Execution

C++
 "Eip ==0x00401000"  

Memory on Access or Write

C++
 "__lastaccessed()==0x00401000"  
 "__lastmodified()==0x00401000" 

Execution on Modified Data

C++
 "__isdirty(eip)"  

In .text section only

C++
 "__isdirty(eip) && eip>=0x401000 && eip <=0x405000"

Anti-unpackers trick : write "ret" on the real OEP and calls to it

C++
"__isdirty(eip) && ( __ read(eip) & 0xff ) !=0xC3)"

API Hooking

C++
 "__isapi()"  
 "__isapiequal('Getprocaddress')"  // not case sensitive in the api name 

Executing the Code

For compiling the program, I usually use Dev-cpp and gcc, but I think VC++ will not create any problem I think.

What are the files I need ?

You will need x86emu.dll and x86emu.a (You must add them to the linker from project options->parameters->add library or object) and some header files included in x86emu-bin.zip.

After Execution

After execution, you will see a program named test.exe that should be the unpacked application. If it doesn't run, change the compatibility from right click->properties->compatibility-> and change the Windows version and it should run correctly.

Still have a problem?

You can get more information on solving the problem by reading the emulator documentations in x86emu-doc.zip and email me with any problem at amr.thabet[at]student.alx.edu.eg.

To be Continued...

In the next article, we will expose the features of this emulator and we will also cover the entire steps to write an unpacker including reversing, analysing and programming.

This is an a screenshot of an example we will talk about in the next article and is included in the examples (attached with the article).

ownunpacker/scrshot.JPG

Hope you like this emulator and the whole article. See you soon in the next article.

Points of Interest

I love programming very much. I always wanted to know about all technologies, how they were created and how I can create the same. From this, I began digging into the processor and Windows internals. Everyone learns high level languages and I learn the lowest level. I try to know how these high level programming languages work and how Windows works. I begin digging and digging, and still I haven't reached the bottom, but I'll always try...

History

  • 9th August, 2010: Initial post

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)


Written By
Software Developer (Senior)
Ireland Ireland
Amr Thabet (@Amr_Thabet) is a Malware Researcher with 5+ years experience in reversing malware, researching and programming. He is the Author of many open-source tools like Pokas Emulator and Security Research and Development Framework (SRDF).

Comments and Discussions

 
Questionhow could this emulator to help protect software products? Pin
Southmountain16-Nov-21 14:57
Southmountain16-Nov-21 14:57 
QuestionVery nice Pin
schizo.bunny14-Nov-12 2:56
schizo.bunny14-Nov-12 2:56 
AnswerRe: Very nice Pin
AmrThabet15-Jan-13 21:35
AmrThabet15-Jan-13 21:35 
GeneralMy vote of 5 Pin
gndnet21-Oct-12 23:19
gndnet21-Oct-12 23:19 
QuestionDownload Pin
ledtech32-Sep-12 7:33
ledtech32-Sep-12 7:33 
AnswerRe: Download Pin
AmrThabet26-Sep-12 14:12
AmrThabet26-Sep-12 14:12 
GeneralRe: Download Pin
ledtech326-Sep-12 15:09
ledtech326-Sep-12 15:09 
GeneralRe: Download Pin
AmrThabet15-Jan-13 21:42
AmrThabet15-Jan-13 21:42 
GeneralRe: Download Pin
ledtech316-Jan-13 4:15
ledtech316-Jan-13 4:15 
Questiongot my 5 Pin
marc ochsenmeier8-Feb-12 1:08
marc ochsenmeier8-Feb-12 1:08 
AnswerRe: got my 5 Pin
AmrThabet8-Feb-12 1:59
AmrThabet8-Feb-12 1:59 
GeneralMy vote of 3 Pin
S.H.Bouwhuis17-Aug-10 4:54
S.H.Bouwhuis17-Aug-10 4:54 
GeneralRe: My vote of 3 Pin
AmrThabet8-Oct-10 6:49
AmrThabet8-Oct-10 6:49 
Questionhow to compile it by vc Pin
xuefeng_work@hotmail.com10-Aug-10 0:36
xuefeng_work@hotmail.com10-Aug-10 0:36 
AnswerRe: how to compile it by vc Pin
AmrThabet12-Aug-10 9:18
AmrThabet12-Aug-10 9:18 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.