Click here to Skip to main content
Click here to Skip to main content

Tagged as

ECG Feature Extraction with Wavelet Transform and ST Segment Detection using Matlab

, 5 Jan 2012
Rate this:
Please Sign up or sign in to vote.
Algorithm and Detailed Matlab Code for ECG Feature Extraction using Wavelet Transform

Introduction

Refer to http://en.wikipedia.org/wiki/Electrocardiography for an understanding of ECG signal and leads. Conventionally such ECG signals are acquired by ECG acquisition devices and those devices generate a printout of the lead outputs. A cardiologist analyzes the data for checking the abnormality or normalcy of the signal. But in recent times, automatic ECG processing has been of tremendous focus. http://www.codeproject.com/KB/cpp/ecg_dsp.aspx gives a fantastic overview of acquiring and filtering ECG signals through inexpensive hardware into your PC. You can download ECG signal samples of various diseases from http://www.physionet.org/physiobank/database/mitdb/. Now the main point of concern is how to develop a system for extracting the features from ECG signal so that these features can be used for Automatic Diseases Diagnosis. In this Article we shall discuss a technique for extracting features from ECG signal and further analyze for ST-Segment for elevation and depression which are symptoms of Ischemia. You can Learn more about Cardio Vascular Abnormalities and their correlation with ECG peaks fromhttp://circ.ahajournals.org/content/110/17/2721.full.

Background

We will discuss about the algorithm in detail which process the ECG signal Obtained from MIT-BIH database and are in .mat format. For the current analysis, we consider signal of both Normal Sinus Rhythm and ST-Elevated signals. Finally Using a threshold we check the normalcy of the signals.

Using the Code

First Select a filename in .mat format and load the file.
[fname path]=uigetfile('*.mat');
fname=strcat(path,fname);
load(fname );

fig-1-original_ECG.png

Append 100 zeros before and after the signal to remove the possibility of window crossing the signal boundaries while looking for peak locations.
z=zeros(100,1);

A=[z;A;z];
Perform wavelet decomposition. The process of wavelet decomposition down samples the signal. Which essentially means taking the samples at a much lower frequency than the orifinal signal. Therefore details are reduced and QRS complex is preserved.
[c,l]=wavedec(s,4,'db4');
Extract the Coefficients after the transform
ca1=appcoef(c,l,'db4',1);

ca2=appcoef(c,l,'db4',2);

ca3=appcoef(c,l,'db4',3);

ca4=appcoef(c,l,'db4',4);

fig2-ECG-wavelet-Decomposed_Signal.png

If you plot the coefficients you will observe that the frequency bands are separated and ca1,ca2,ca3 and ca4 are cleaner signal. But they will have less number of samples than the actual signal due to downsampling. You can see that first signal resembles to the actual signal but has exactly one forth number of samples because the signal was decomposed in 4 levels. 2nd level has exactly half number of samples that of 1st level, 3rd level has exactly half number of samples than the 2nd level. Because the number of samples is reduced, such signals are also called down-sampled signal.

It is clear that 2nd level decomposed data is noise free. Therefore we consider this signal as ideal ECG signal from which QRS must be detected. But the first R is located in 3rd level decomposition signal at approximately 40th sample whereas the same is located in the original signal at 260th location. Therefore once R peak is detected in 3rd level reconstructed signal, it must be cross validated in the actual signal

Detecting R peak in the down sampled Signal

First find the values which are greater than 60% of the max value of the actual signal. Invariably these are R peaks. As the decomposed signals are noise free signals, First R peak needs to be detected in the Noise free signal. But remember the ultimate goal is to detect the Peak in the original Signal. The sample values in Original Signal will be different than the decomposed signal. So Our strategy here will be to first detect the R peaks in the down sampled signal and than cross verify those points the actual signal.

Let y1 be the decomposed signal.

m1=max(y1)*.60;

P=find(y1>=m1);

So P is now set of points which satisfies the above criteria. If you observe the signal very closely, R-Peak is not a single Impulse peak, therefore there are chances of multiple points in the same peak satisfying the criteria. One thing to remember is in 500Hz sampled signal No to R-Location will be found below 350 samples. In 4th Level decomposition order this value is around 20. So first we will remove the R locations that are too close. 

P1=P; 

P2=[]; 

last=P1(1);

P2=[P2 last];

for(i=2:1:length(P1))

    if(P1(i)>(last+10))

% In this step we find R peaks which are atleast 10 samples apart

        last=P1(i);

        P2=[P2 last];

    end

end

Now Variable P2 represents the position of R-Peaks in the down sampled signal. 

Detecting R peak in the Original Signal 

Search for the position of all the location in signal y1 which are greater than this value m1. They are R locations. But before we proceed, you must know that A R Location in Rt is at least 1/4th of  the actual R location of the same point. Hence we will first map the detected positions to original signal by first multiplying with 4.  

P3=P2*4;

%Multiply the current location with 4 to get the actual scale.

Another important thing you must remember is that, R location in down sampled signal will never be on the original signal at a scale of 4. Down sampling process always deviate the signal positions. Hence we need to search for the maximum value in the Original Signal in a window of +-20 samples from the reference R point obtained as P3. 

Rloc=[];

for( i=1:1:length(P3))

  range= [P3(i)-20:P3(i)+20]

%Search within a window of +-20 samples in the original signal with reference to up scaled  R locations detected in downsampled signal.

    m=max(A(range));

    l=find(A(range)==m);

    pos=range(l);

    Rloc=[Rloc pos];

end

Ramp=A(Rloc);

It is clear now that Ramp and Rloc represents the R peak amplitude and location at the original scale. Let us see the marking of the same in the waveform.

fig-3-Detected_R-peak_in_actual_Signal.png 

Detecting Other Peaks With Reference to R-Peaks

From R-Peak Traverse Forth and Back and Search for Minima and Maxima, these are P,Q,T,S peaks respectively.  So loop in Rloc and search for the other peaks. 

Firstly, If you observe the waveform, it will be very clear that from R location if you select a window of Rloc-100 to Rloc-50 and find the maximum, than that maxima is P peak. 

X=Rloc;

y1=A;

for(i=1:1:1) % If you have a 12 lead data than, for(i=1:1:12)

for(j=1:1:length(X))

a=Rloc(i,j)-100:Rloc(i,j)-10;

    m=max(y1(a));

    b=find(y1(a)==m);

    b=b(1);

    b=a(b);

    Ploc(i,j)=b;

    Pamp(i,j)=m;

%The minima in the Window of   Rloc-100 to Rloc-10  is essentially the Q peak. 

    %% Q  Detection

    a=Rloc(i,j)-50:Rloc(i,j)-10;

    m=min(y1(a));

    b=find(y1(a)==m);

    b=b(1);

    b=a(b);

    Qloc(i,j)=b;

    Qamp(i,j)=m;

%With similar logic you  can detect the S and T peaks.

    %% S  Detection

    a=Rloc(i,j)+5:Rloc(i,j)+50;

    m=min(y1(a));

    b=find(y1(a)==m);

    b=b(1);

    b=a(b);

    Sloc(i,j)=b;

    Samp(i,j)=m;

    %% T Peak

    a=Rloc(i,j)+25:Rloc(i,j)+100;

    m=max(y1(a));

    b=find(y1(a)==m);

    b=b(1);

    b=a(b);

    Tloc(i,j)=b;

    Tamp(i,j)=m;

end

end

Once All the peaks are correctly detected, you can find the Onset and Offset as points of zero crossing for  each lead. ST Segment can be calculated from S-Offset and T-Onset. 

Points of Interest 

You might be interested to learn the feature extraction technique from 12-Lead data using PCA from this Link. 12-Lead ECG Feature extraction with PCA using Matlab. You could also consider cleaning the ECG signal before processing using Symlet or any other filtering technique. 

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

Share

About the Author

Grasshopper.iics
CEO Integrated Ideas
India India
gasshopper.iics is a group of like minded programmers and learners in codeproject. The basic objective is to keep in touch and be notified while a member contributes an article, to check out with technology and share what we know. We are the "students" of codeproject.
 
This group is managed by Rupam Das, an active author here. Other Notable members include Ranjan who extends his helping hands to invaluable number of authors in their articles and writes some great articles himself.
 
Rupam Das is mentor of Grasshopper Network,founder and CEO of Integrated Ideas Consultancy Services, a research consultancy firm in India. He has been part of projects in several technologies including Matlab, C#, Android, OpenCV, Drupal, Omnet++, legacy C, vb, gcc, NS-2, Arduino, Raspberry-PI. Off late he has made peace with the fact that he loves C# more than anything else but is still struck in legacy style of coding.
Rupam loves algorithm and prefers Image processing, Artificial Intelligence and Bio-medical Engineering over other technologies.
 
He is frustrated with his poor writing and "grammer" skills but happy that coding polishes these frustrations.
Group type: Organisation

82 members

Follow on   Twitter   Google+

Comments and Discussions

 
AnswerRe: about code: PingroupGrasshopper.iics23-Dec-12 11:59 
GeneralRe: about code: Pinmemberhusam a.24-Dec-12 5:18 
GeneralRe: about code: PingroupGrasshopper.iics24-Dec-12 21:24 
Answerabout ECG data PinmemberAmit kumar manocha23-Sep-12 7:48 
Questionneed help to get the ecg data set Pinmembernidhalhameed14-Jul-12 19:53 
Hi,
Good day.
I am working on " ecg recognition using intelligent technique" & i encountered the problem of how to collect the ecg arrhythmia cases with .dat format, for example : I need to collect many cases of "Atrail Premature beats" from the records (100, 118, 200, 201, 202) within the "MIT-BIH arrhythmia database" but I found it difficult.
I don’t know to collect them from that database; so, could you please let me know the proper “interval length” that I must cut, and how to do so? can u send me the data set u used in ur research ?
Kind regards,
Nidhal
AnswerRe: need help to get the ecg data set PingroupGrasshopper.iics15-Jul-12 14:28 
AnswerRe: need help to get the ecg data set PingroupGrasshopper.iics24-Sep-12 0:47 
QuestionNice but a Qusestion PinmemberIRSOG30-Mar-12 4:50 
AnswerRe: Nice but a Qusestion Pinmemberuaman rashed2-Sep-12 9:07 
GeneralRe: Nice but a Qusestion PingroupGrasshopper.iics2-Sep-12 14:18 
GeneralRe: Nice but a Qusestion Pinmemberuaman rashed2-Sep-12 22:19 
GeneralRe: Nice but a Qusestion PingroupGrasshopper.iics2-Sep-12 23:08 
GeneralRe: Nice but a Qusestion Pinmemberuaman rashed3-Sep-12 10:36 
GeneralRe: Nice but a Qusestion PingroupGrasshopper.iics3-Sep-12 14:47 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.

| Advertise | Privacy | Mobile
Web04 | 2.8.140827.1 | Last Updated 5 Jan 2012
Article Copyright 2012 by Grasshopper.iics
Everything else Copyright © CodeProject, 1999-2014
Terms of Service
Layout: fixed | fluid